List evaluation results
GET /evaluations/{evaluationId}/results
List all evaluation results for a specific evaluation. Results are created as the evaluation runs and track per-example outcomes.
Authorizations
Section titled “Authorizations ”Parameters
Section titled “ Parameters ”Path Parameters
Section titled “Path Parameters ”ID of the parent evaluation to list results for
ID of the parent evaluation to list results for
Query Parameters
Section titled “Query Parameters ”Comma-separated list of result IDs to filter by
Comma-separated list of result IDs to filter by
Comma-separated list of statuses to filter by (pending, passed, failed, error, skipped)
Comma-separated list of statuses to filter by (pending, passed, failed, error, skipped)
Page number for pagination (1-indexed)
Page number for pagination (1-indexed)
Number of results per page (1-100, default: 20)
Number of results per page (1-100, default: 20)
Field to sort results by
Field to sort results by
Sort direction: ascending or descending
Sort direction: ascending or descending
Responses
Section titled “ Responses ”Evaluation results retrieved successfully
object
Array of evaluation results matching the query
object
Unique identifier for the evaluation result (nanoid format)
ID of the parent evaluation
ID of the example being evaluated
ID of the pipeline execution created for this example
Status: ‘pending’, ‘passed’, ‘failed’, ‘error’, or ‘skipped’
Example input mapped to pipeline input format
Output from pipeline execution
Evaluation score (0.0 to 1.0)
Evaluator explanation/feedback
Error details if status is ‘error’ or ‘skipped’
When processing started for this example (ISO 8601)
When processing completed for this example (ISO 8601)
When the result was created (ISO 8601)
Total number of evaluation results matching the query (before pagination)
Current page number
Number of results per page
Bad Request - Validation error or invalid input
object
Unauthorized - Authentication required or invalid token
object
Forbidden - Insufficient permissions
object
Not Found - Resource does not exist