Update Evaluation Run | Humanloop Docs

{
  "id": "id",
  "orchestrated": true,
  "added_at": "2024-01-15T09:30:00Z",
  "created_at": "2024-01-15T09:30:00Z",
  "status": "pending",
  "control": true,
  "dataset": {
    "path": "path",
    "id": "id",
    "name": "name",
    "version_id": "version_id",
    "created_at": "2024-01-15T09:30:00Z",
    "updated_at": "2024-01-15T09:30:00Z",
    "last_used_at": "2024-01-15T09:30:00Z",
    "datapoints_count": 1,
    "directory_id": "directory_id",
    "description": "description",
    "schema": {
      "key": "value"
    },
    "readme": "readme",
    "tags": [
      "tags"
    ],
    "type": "dataset",
    "environments": [
      {
        "id": "id",
        "created_at": "2024-01-15T09:30:00Z",
        "name": "name",
        "tag": "default"
      }
    ],
    "created_by": {
      "id": "id",
      "email_address": "email_address",
      "full_name": "full_name"
    },
    "version_name": "version_name",
    "version_description": "version_description",
    "datapoints": [
      {
        "id": "id"
      }
    ],
    "attributes": {
      "key": "value"
    }
  },
  "version": {
    "path": "path",
    "id": "id",
    "directory_id": "directory_id",
    "model": "model",
    "endpoint": "complete",
    "template": "template",
    "template_language": "default",
    "provider": "anthropic",
    "max_tokens": 1,
    "temperature": 1.1,
    "top_p": 1.1,
    "stop": "stop",
    "presence_penalty": 1.1,
    "frequency_penalty": 1.1,
    "other": {
      "key": "value"
    },
    "seed": 1,
    "response_format": {
      "type": "json_object",
      "json_schema": {
        "key": "value"
      }
    },
    "reasoning_effort": "high",
    "tools": [
      {
        "name": "name",
        "description": "description"
      }
    ],
    "linked_tools": [
      {
        "name": "name",
        "description": "description",
        "id": "id",
        "version_id": "version_id"
      }
    ],
    "attributes": {
      "key": "value"
    },
    "version_name": "version_name",
    "version_description": "version_description",
    "description": "description",
    "tags": [
      "tags"
    ],
    "readme": "readme",
    "name": "name",
    "schema": {
      "key": "value"
    },
    "version_id": "version_id",
    "type": "prompt",
    "environments": [
      {
        "id": "id",
        "created_at": "2024-01-15T09:30:00Z",
        "name": "name",
        "tag": "default"
      }
    ],
    "created_at": "2024-01-15T09:30:00Z",
    "updated_at": "2024-01-15T09:30:00Z",
    "created_by": {
      "id": "id",
      "email_address": "email_address",
      "full_name": "full_name"
    },
    "last_used_at": "2024-01-15T09:30:00Z",
    "version_logs_count": 1,
    "total_logs_count": 1,
    "inputs": [
      {
        "name": "name"
      }
    ],
    "evaluator_aggregates": [
      {
        "value": 1.1,
        "evaluator_id": "evaluator_id",
        "evaluator_version_id": "evaluator_version_id",
        "created_at": "2024-01-15T09:30:00Z",
        "updated_at": "2024-01-15T09:30:00Z"
      }
    ],
    "raw_file_content": "raw_file_content"
  },
  "created_by": {
    "id": "id",
    "email_address": "email_address",
    "full_name": "full_name"
  }
}

Update an Evaluation Run.

Specify control=true to use this Run as the control Run for the Evaluation. You can cancel a running/pending Run, or mark a Run that uses external or human Evaluators as completed.

Authentication

X-API-KEYstring

API Key authentication via header

Path parameters

idstringRequired

Unique identifier for Evaluation.

run_idstringRequired

Unique identifier for Run.

Request

This endpoint expects an object.

controlbooleanOptional

If True, this Run will be used as the control in the Evaluation. Stats for other Runs will be compared to this Run. This will replace any existing control Run.

statusenumOptional

Used to set the Run to cancelled or completed. Can only be used if the Run is currently pending or running.

Allowed values:

Response

Successful Response

idstring

Unique identifier for the Run.

orchestratedboolean

Whether the Run is orchestrated by Humanloop.

added_atdatetime

When the Run was added to the Evaluation.

created_atdatetime

When the Run was created.

statusenum

The status of the Run.

controlboolean

Stats for other Runs will be displayed in comparison to the control Run.

datasetobject

The Dataset used in the Run.

versionobject

The version used in the Run.

created_byany

The User who created the Run.

Errors

422

Update Evaluation Run Evaluations ID Runs Run ID Patch Request Unprocessable Entity Error

$	curl -X PATCH https://api.humanloop.com/v5/evaluations/id/runs/run_id \
>	-H "X-API-KEY: <apiKey>" \
>	-H "Content-Type: application/json" \
>	-d '{}'