Create | Humanloop Docs

curl -X POST https://api.humanloop.com/v4/projects/project_id/evaluations \
     -H "X-API-KEY: <apiKey>" \
     -H "Content-Type: application/json" \
     -d '{
  "config_id": "config_id",
  "evaluator_ids": [
    "evaluator_ids"
  ],
  "dataset_id": "dataset_id"
}'

{
  "id": "id",
  "status": "pending",
  "config": {
    "type": "model",
    "id": "id",
    "model": "model",
    "chat_template": [
      {
        "role": "user"
      }
    ],
    "description": "description",
    "endpoint": "complete",
    "frequency_penalty": 1.1,
    "max_tokens": 1,
    "name": "name",
    "other": {
      "key": "value"
    },
    "presence_penalty": 1.1,
    "prompt_template": "prompt_template",
    "provider": "anthropic",
    "reasoning_effort": "high",
    "response_format": {
      "type": "json_object",
      "json_schema": {
        "key": "value"
      }
    },
    "seed": 1,
    "stop": "stop",
    "temperature": 1.1,
    "template_language": "default",
    "tools": [
      {
        "id": "id",
        "name": "name"
      }
    ],
    "top_p": 1.1,
    "tool_configs": [
      {
        "id": "id",
        "status": "status",
        "name": "name"
      }
    ]
  },
  "created_at": "2024-01-15T09:30:00Z",
  "updated_at": "2024-01-15T09:30:00Z",
  "evaluators": [
    {
      "name": "name",
      "description": "description",
      "arguments_type": "target_free",
      "return_type": "boolean",
      "type": "python",
      "id": "id",
      "created_at": "2024-01-15T09:30:00Z",
      "updated_at": "2024-01-15T09:30:00Z",
      "code": "code",
      "model_config": {
        "id": "id",
        "model": "model"
      },
      "logging_project": {
        "id": "id",
        "name": "name",
        "users": [
          {
            "id": "id",
            "email_address": "email_address"
          }
        ],
        "data_count": 1,
        "feedback_types": [
          {
            "type": "rating"
          }
        ],
        "team_id": "team_id",
        "created_at": "2024-01-15T09:30:00Z",
        "updated_at": "2024-01-15T09:30:00Z"
      }
    }
  ],
  "dataset": {
    "id": "id",
    "name": "name",
    "datapoint_count": 1,
    "created_at": "2024-01-15T09:30:00Z",
    "updated_at": "2024-01-15T09:30:00Z",
    "project_id": "project_id",
    "description": "description"
  },
  "dataset_version_id": "dataset_version_id",
  "dataset_snapshot": {
    "id": "id",
    "name": "name",
    "datapoint_count": 1,
    "created_at": "2024-01-15T09:30:00Z",
    "updated_at": "2024-01-15T09:30:00Z",
    "project_id": "project_id",
    "description": "description"
  },
  "evaluator_aggregates": [
    {
      "model_config_id": "model_config_id",
      "evaluator_id": "evaluator_id",
      "evaluator_version_id": "evaluator_version_id",
      "aggregate_value": 1.1
    }
  ]
}

Create an evaluation.

Authentication

X-API-KEYstring

API Key authentication via header

Path parameters

project_idstringRequired

String ID of project. Starts with pr_.

Request

This endpoint expects an object.

config_idstringRequired

ID of the config to evaluate. Starts with config_.

evaluator_idslist of stringsRequired

IDs of evaluators to run on the dataset. IDs start with evfn_

dataset_idstringRequired

ID of the dataset to use in this evaluation. Starts with evts_.

provider_api_keysobjectOptional

API keys required by each provider to make API calls. The API keys provided here are not stored by Humanloop. If not specified here, Humanloop will fall back to the key saved to your organization. Ensure you provide an API key for the provider for the model config you are evaluating, or have one saved to your organization.

hl_generatedbooleanOptionalDefaults to true

Whether the log generations for this evaluation should be performed by Humanloop. If False, the log generations should be submitted by the user via the API.

namestringOptional

Name of the Evaluation to help identify it.

Response

Successful Response

idstring

Unique ID for the evaluation. Starts with ev_.

statusenum

Status of an evaluation.

configobject

created_atdatetime

updated_atdatetime

evaluatorslist of objects

datasetobject

dataset_version_idstring

dataset_snapshotobject

evaluator_aggregateslist of objects

Errors

422

Evaluations Create Request Unprocessable Entity Error