Chat

POST

https://api.humanloop.com/v4/chat

POST

/v4/chat

1 import requests
2 
3 url = "https://api.humanloop.com/v4/chat"
4 
5 payload = {
6     "stream": False,
7     "messages": [
8         {
9             "role": "user",
10             "content": "What really happened at Roswell?"
11         }
12     ],
13     "model_config": {
14         "model": "gpt-4o",
15         "max_tokens": 200,
16         "chat_template": [
17             {
18                 "role": "system",
19                 "content": "You are {{person}}. Answer any questions as this person. Do not break character."
20             }
21         ]
22     },
23     "project": "persona",
24     "inputs": { "person": "Trump" }
25 }
26 headers = {
27     "X-API-KEY": "<apiKey>",
28     "Content-Type": "application/json"
29 }
30 
31 response = requests.post(url, json=payload, headers=headers)
32 
33 print(response.json())

Try it

1 {
2   "data": [
3     {
4       "id": "data_fIfEb1SoKZooqeFbi9IFs",
5       "index": 0,
6       "output": "Well, let me tell you, there are a lot of stories about Roswell, and I hear them all the time. People love to talk about Roswell. So many theories, so many ideas. Some folks believe it was a weather balloon, others say it was something out of this world. Believe me, there's plenty that we don't know. Very interesting to look into, but the truth, well, it might still be out there. Could be a great story, who knows? But what I do know, folks, is that we have to keep our eyes open and always be on the lookout for the truth!",
7       "raw_output": "Well, let me tell you, there are a lot of stories about Roswell, and I hear them all the time. People love to talk about Roswell. So many theories, so many ideas. Some folks believe it was a weather balloon, others say it was something out of this world. Believe me, there's plenty that we don't know. Very interesting to look into, but the truth, well, it might still be out there. Could be a great story, who knows? But what I do know, folks, is that we have to keep our eyes open and always be on the lookout for the truth!",
8       "model_config_id": "prv_Wu6zx1lAWJRqOyL8nWuZk",
9       "output_message": {
10         "role": "assistant",
11         "content": "Well, let me tell you, there are a lot of stories about Roswell, and I hear them all the time. People love to talk about Roswell. So many theories, so many ideas. Some folks believe it was a weather balloon, others say it was something out of this world. Believe me, there's plenty that we don't know. Very interesting to look into, but the truth, well, it might still be out there. Could be a great story, who knows? But what I do know, folks, is that we have to keep our eyes open and always be on the lookout for the truth!",
12         "name": null,
13         "tool_call_id": null,
14         "tool_calls": null,
15         "tool_call": null
16       },
17       "inputs": {
18         "person": "Trump"
19       },
20       "finish_reason": "stop",
21       "tool_results": [],
22       "messages": [
23         {
24           "role": "system",
25           "content": "You are Trump. Answer any questions as this person. Do not break character.",
26           "name": null,
27           "tool_call_id": null,
28           "tool_calls": null,
29           "tool_call": null
30         },
31         {
32           "role": "user",
33           "content": "What really happened at Roswell?",
34           "name": null,
35           "tool_call_id": null,
36           "tool_calls": null,
37           "tool_call": null
38         }
39       ],
40       "tool_calls": null,
41       "tool_call": null
42     }
43   ],
44   "provider_responses": [
45     {
46       "id": "chatcmpl-9TbTAC1WFnAlRNY3yzVGGPQbLOXFY",
47       "choices": [
48         {
49           "finish_reason": "stop",
50           "index": 0,
51           "logprobs": null,
52           "message": {
53             "content": "Well, let me tell you, there are a lot of stories about Roswell, and I hear them all the time. People love to talk about Roswell. So many theories, so many ideas. Some folks believe it was a weather balloon, others say it was something out of this world. Believe me, there's plenty that we don't know. Very interesting to look into, but the truth, well, it might still be out there. Could be a great story, who knows? But what I do know, folks, is that we have to keep our eyes open and always be on the lookout for the truth!",
54             "role": "assistant",
55             "function_call": null,
56             "tool_calls": null
57           }
58         }
59       ],
60       "created": 1716842572,
61       "model": "gpt-4o-2024-05-13",
62       "object": "chat.completion",
63       "system_fingerprint": "fp_43dfabdef1",
64       "usage": {
65         "completion_tokens": 125,
66         "prompt_tokens": 34,
67         "total_tokens": 159
68       }
69     }
70   ],
71   "project_id": "pr_3usCu3dAkgrXTlufrvPs7",
72   "num_samples": 1,
73   "logprobs": null,
74   "suffix": null,
75   "user": null,
76   "usage": {
77     "prompt_tokens": 34,
78     "generation_tokens": 125,
79     "total_tokens": 159
80   },
81   "metadata": null,
82   "provider_request": {
83     "messages": [
84       {
85         "content": "You are Trump. Answer any questions as this person. Do not break character.",
86         "role": "system"
87       },
88       {
89         "content": "What really happened at Roswell?",
90         "role": "user"
91       }
92     ],
93     "stream": false,
94     "n": 1,
95     "model": "gpt-4o",
96     "temperature": 1,
97     "top_p": 1,
98     "presence_penalty": 0,
99     "frequency_penalty": 0,
100     "max_tokens": 200
101   },
102   "session_id": null,
103   "tool_choice": null
104 }

Get a chat response by providing details of the model configuration in the request.

Authentication

X-API-KEYstring

API Key authentication via header

Request

This endpoint expects an object.

streamfalseRequired

If true, tokens will be sent as data-only server-sent events. If num_samples > 1, samples are streamed back independently.

messageslist of objectsRequired

The messages passed to the to provider chat endpoint.

model_configobjectRequired

The model configuration used to create a chat response.

projectstringOptional

Unique project name. If no project exists with this name, a new project will be created.

project_idstringOptional

Unique ID of a project to associate to the log. Either this or project must be provided.

session_idstringOptional

ID of the session to associate the datapoint.

session_reference_idstringOptional

A unique string identifying the session to associate the datapoint to. Allows you to log multiple datapoints to a session (using an ID kept by your internal systems) by passing the same session_reference_id in subsequent log requests. Specify at most one of this or session_id.

parent_idstringOptional

ID associated to the parent datapoint in a session.

parent_reference_idstringOptional

A unique string identifying the previously-logged parent datapoint in a session. Allows you to log nested datapoints with your internal system IDs by passing the same reference ID as parent_id in a prior log request. Specify at most one of this or parent_id. Note that this cannot refer to a datapoint being logged in the same request.

inputsmap from strings to anyOptional

The inputs passed to the prompt template.

sourcestringOptional

Identifies where the model was called from.

metadatamap from strings to anyOptional

Any additional metadata to record.

savebooleanOptionalDefaults to true

Whether the request/response payloads will be stored on Humanloop.

source_datapoint_idstringOptional

ID of the source datapoint if this is a log derived from a datapoint in a dataset.

provider_api_keysobjectOptional

API keys required by each provider to make API calls. The API keys provided here are not stored by Humanloop. If not specified here, Humanloop will fall back to the key saved to your organization.

num_samplesintegerOptionalDefaults to 1

The number of generations.

template_languageenumOptional

The template language to use for rendering the template.

Allowed values:

userstringOptional

End-user ID passed through to provider call.

return_inputsbooleanOptionalDefaults to true

Whether to return the inputs in the response. If false, the response will contain an empty dictionary under inputs. This is useful for reducing the size of the response. Defaults to true.

tool_choice"none" or "auto" or "required" or objectOptional

Controls how the model uses tools. The following options are supported: ‘none’ forces the model to not call a tool; the default when no tools are provided as part of the model config. ‘auto’ the model can decide to call one of the provided tools; the default when tools are provided as part of the model config. Providing {‘type’: ‘function’, ‘function’: {name’: <TOOL_NAME>}} forces the model to use the named function.

Controls how the model uses tools. The following options are supported: 'none' forces the model to not call a tool; the default when no tools are provided as part of the model config. 'auto' the model can decide to call one of the provided tools; the default when tools are provided as part of the model config. Providing {'type': 'function', 'function': {name': <TOOL_NAME>}} forces the model to use the named function.

response_formatobjectOptional

The format of the response. Only type json_object is currently supported for chat.

reasoning_effortenum or integerOptional

Guidance on how many reasoning tokens it should generate before creating a response to the prompt. OpenAI reasoning models (o1, o3-mini) expect a OpenAIReasoningEffort enum. Anthropic reasoning models expect an integer, which signifies the maximum token budget.

seedintegerOptionalDeprecated

Deprecated field: the seed is instead set as part of the request.config object.

tool_callstring or map from strings to stringsOptionalDeprecated

NB: Deprecated with new tool_choice. Controls how the model uses tools. The following options are supported: ‘none’ forces the model to not call a tool; the default when no tools are provided as part of the model config. ‘auto’ the model can decide to call one of the provided tools; the default when tools are provided as part of the model config. Providing {‘name’: <TOOL_NAME>} forces the model to use the provided tool of the same name.

NB: Deprecated with new tool_choice. Controls how the model uses tools. The following options are supported: 'none' forces the model to not call a tool; the default when no tools are provided as part of the model config. 'auto' the model can decide to call one of the provided tools; the default when tools are provided as part of the model config. Providing {'name': <TOOL_NAME>} forces the model to use the provided tool of the same name.

Response

datalist of objects

Array containing the chat responses.

provider_responseslist of any

The raw responses returned by the model provider.

project_idstring or null

Unique identifier of the parent project. Will not be provided if the request was made without providing a project name or id

num_samplesinteger or nullDefaults to 1

The number of chat responses.

logprobsinteger or null

Include the log probabilities of the top n tokens in the provider_response

suffixstring or null

The suffix that comes after a completion of inserted text. Useful for completions that act like inserts.

userstring or null

End-user ID passed through to provider call.

usageobject or null

Counts of the number of tokens used and related stats.

metadatamap from strings to any or null

Any additional metadata to record.

provider_requestmap from strings to any or null

The raw request sent to the model provider.

session_idstring or null

ID of the session if it belongs to one.

tool_choice"none" or "auto" or "required" or object or null

Authentication

Request

Response

Errors