Promoting Executions to Examples

This guide explains how to build high-quality example sets by promoting successful pipeline executions to ground truth examples.

Why Promote Executions?

Promoting executions offers several advantages:

Real-world data - Examples reflect actual production usage
Verified outputs - Only promote executions you’ve confirmed are correct
Efficient workflow - Build examples during normal review processes
Natural coverage - Organically captures the diversity of your use cases

Prerequisites

A pipeline with execution history
A review process where experts verify outputs

Workflow Overview

Execute Pipeline → Expert Reviews → Verified Correct? → Promote to Example

Step 1: Identify Good Executions

Find executions that represent correct behavior:

List successful executions

curl "https://api.catalyzed.ai/pipeline-executions?pipelineId=$PIPELINE_ID&status=succeeded" \
  -H "Authorization: Bearer $API_TOKEN"

const response = await fetch(
  `https://api.catalyzed.ai/pipeline-executions?pipelineId=${pipelineId}&status=succeeded`,
  { headers: { Authorization: `Bearer ${apiToken}` } }
);
const { executions } = await response.json();

for (const exec of executions) {
  console.log(`${exec.executionId}: ${exec.pipelineName}`);
  console.log(`  Input: ${JSON.stringify(exec.input).slice(0, 100)}...`);
  console.log(`  Output: ${JSON.stringify(exec.output).slice(0, 100)}...`);
}

response = requests.get(
    "https://api.catalyzed.ai/pipeline-executions",
    params={"pipelineId": pipeline_id, "status": "succeeded"},
    headers={"Authorization": f"Bearer {api_token}"}
)
executions = response.json()["executions"]

for exec in executions:
    print(f"{exec['executionId']}: {exec['pipelineName']}")
    print(f"  Input: {str(exec['input'])[:100]}...")
    print(f"  Output: {str(exec['output'])[:100]}...")

Step 2: Promote to Example

Option A: Add to Existing Example Set

If you already have an example set:

Promote to existing set

curl -X POST "https://api.catalyzed.ai/pipeline-executions/$EXECUTION_ID/promote-to-example" \
  -H "Authorization: Bearer $API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "existingExampleSetId": "KjR8I6rHBms3W4Qfa2-FN",
    "exampleName": "Financial report - Q4 2024",
    "rationale": "Correctly captured all key metrics in expected format"
  }'

const response = await fetch(
  `https://api.catalyzed.ai/pipeline-executions/${executionId}/promote-to-example`,
  {
    method: "POST",
    headers: {
      Authorization: `Bearer ${apiToken}`,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      existingExampleSetId: "KjR8I6rHBms3W4Qfa2-FN",
      exampleName: "Financial report - Q4 2024",
      rationale: "Correctly captured all key metrics in expected format",
    }),
  }
);
const result = await response.json();
console.log(`Created example: ${result.example.exampleId}`);

response = requests.post(
    f"https://api.catalyzed.ai/pipeline-executions/{execution_id}/promote-to-example",
    headers={"Authorization": f"Bearer {api_token}"},
    json={
        "existingExampleSetId": "KjR8I6rHBms3W4Qfa2-FN",
        "exampleName": "Financial report - Q4 2024",
        "rationale": "Correctly captured all key metrics in expected format"
    }
)
result = response.json()
print(f"Created example: {result['example']['exampleId']}")

Option B: Create New Example Set

If this is your first example or you want a new set:

Promote with new set

curl -X POST "https://api.catalyzed.ai/pipeline-executions/$EXECUTION_ID/promote-to-example" \
  -H "Authorization: Bearer $API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "newExampleSet": {
      "name": "Production Examples - Document Summarizer",
      "description": "Ground truth from verified production executions"
    },
    "exampleName": "First production example",
    "rationale": "Verified by senior analyst"
  }'

const response = await fetch(
  `https://api.catalyzed.ai/pipeline-executions/${executionId}/promote-to-example`,
  {
    method: "POST",
    headers: {
      Authorization: `Bearer ${apiToken}`,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      newExampleSet: {
        name: "Production Examples - Document Summarizer",
        description: "Ground truth from verified production executions",
      },
      exampleName: "First production example",
      rationale: "Verified by senior analyst",
    }),
  }
);
const result = await response.json();
console.log(`Created example set: ${result.exampleSet.exampleSetId}`);
console.log(`Created example: ${result.example.exampleId}`);

response = requests.post(
    f"https://api.catalyzed.ai/pipeline-executions/{execution_id}/promote-to-example",
    headers={"Authorization": f"Bearer {api_token}"},
    json={
        "newExampleSet": {
            "name": "Production Examples - Document Summarizer",
            "description": "Ground truth from verified production executions"
        },
        "exampleName": "First production example",
        "rationale": "Verified by senior analyst"
    }
)
result = response.json()
print(f"Created example set: {result['exampleSet']['exampleSetId']}")
print(f"Created example: {result['example']['exampleId']}")

Response:

{
  "example": {
    "exampleId": "ExR8I6rHBms3W4Qfa2-FN",
    "exampleSetId": "KjR8I6rHBms3W4Qfa2-FN",
    "name": "Financial report - Q4 2024",
    "input": { ... },
    "expectedOutput": { ... },
    "rationale": "Correctly captured all key metrics in expected format"
  },
  "exampleSet": {
    "exampleSetId": "KjR8I6rHBms3W4Qfa2-FN",
    "name": "Production Examples - Document Summarizer"
  },
  "exampleSetCreated": true
}

Option C: Override the Expected Output

Sometimes the execution output is close but not perfect. You can override it:

{
  "existingExampleSetId": "KjR8I6rHBms3W4Qfa2-FN",
  "exampleName": "Financial report - corrected",
  "expectedOutput": {
    "summary": "Q4 2024: Revenue up 15% YoY ($2.3M), stable churn at 3.2%."
  },
  "rationale": "Corrected missing churn metric from original output"
}

Batch Promotion

For promoting multiple executions at once:

async function batchPromote(executionIds: string[], exampleSetId: string) {
  const results = [];

  for (const executionId of executionIds) {
    try {
      const response = await fetch(
        `https://api.catalyzed.ai/pipeline-executions/${executionId}/promote-to-example`,
        {
          method: "POST",
          headers: {
            Authorization: `Bearer ${apiToken}`,
            "Content-Type": "application/json",
          },
          body: JSON.stringify({
            existingExampleSetId: exampleSetId,
            rationale: "Batch promoted - verified correct",
          }),
        }
      );

      if (response.ok) {
        const result = await response.json();
        results.push({ executionId, success: true, exampleId: result.example.exampleId });
      } else {
        const error = await response.json();
        results.push({ executionId, success: false, error: error.message });
      }
    } catch (error) {
      results.push({ executionId, success: false, error: String(error) });
    }
  }

  return results;
}

// Usage
const verifiedExecutions = ["exec1", "exec2", "exec3"];
const results = await batchPromote(verifiedExecutions, exampleSetId);
console.log(`Promoted ${results.filter(r => r.success).length}/${results.length} executions`);

Best Practices

1. Establish a Review Process

Create a clear workflow for reviewing and promoting:

1. Expert reviews execution output
2. Marks as "correct" or "needs improvement"
3. If correct → Promote to example with rationale
4. If incorrect → Create signal with feedback

2. Use Descriptive Names

Good example names help identify issues later:

Bad	Good
`Example 1`	`Financial report - Q4 2024 earnings`
`Test case`	`Edge case - empty document handling`
`Promoted execution`	`Customer support ticket - billing inquiry`

3. Document Rationale

Always include why the output is correct:

{
  "rationale": "Correctly identified all 3 key action items. Used proper formatting with bullet points. Excluded irrelevant meeting chitchat."
}

4. Cover Edge Cases

Actively seek diverse examples:

Different input types - Various document formats, lengths
Edge cases - Empty inputs, unusual formats
Error handling - Graceful failures
Boundary conditions - Maximum/minimum values

5. Review Before Batch Operations

When batch promoting, spot-check a sample first:

// Get a random sample of 5 to review before batch promotion
const sample = executions.sort(() => 0.5 - Math.random()).slice(0, 5);
for (const exec of sample) {
  console.log("Review this execution:");
  console.log(`Input: ${JSON.stringify(exec.input, null, 2)}`);
  console.log(`Output: ${JSON.stringify(exec.output, null, 2)}`);
  // Prompt for confirmation before including in batch
}

Common Errors

Error	Cause	Solution
`409 Conflict`	Execution hasn’t succeeded	Only promote `succeeded` executions
`400 Bad Request`	Missing required fields	Include either `existingExampleSetId` or `newExampleSet`
`403 Forbidden`	No access to example set	Verify team membership

Integration Ideas

Slack Bot Integration

// When expert reacts with :white_check_mark: to an execution notification
async function handleApprovalReaction(executionId: string, userId: string) {
  const result = await promoteExecution(executionId, exampleSetId);
  await sendSlackMessage(userId, `Promoted to example: ${result.example.exampleId}`);
}

Review Dashboard

Build a simple review UI:

Show recent executions with input/output
“Approve” button → Promote to example
“Reject” button → Create signal with feedback
Track approval rate over time

Evaluation Workflow - Use promoted examples in evaluations
Feedback Loops - Capture feedback on incorrect executions
Examples - Example API reference
Example Sets - Managing example sets