Capture user feedback

Collect feedback from your users to improve your AI product.

In this tutorial, we’ll show how you can gather valuable insights from your users to evaluate and improve your AI product.

We’ll deploy a simple chat app that allows users to interact with an AI model. Later, we’ll modify the source code to capture user feedback and show how these insights are used to improve the AI product.

Prerequisites

Capture user feedback

You can grab the source code used in this tutorial here: hl-chatgpt-clone-typescript

1

Clone and start a chat app server

$ git clone https://github.com/humanloop/hl-chatgpt-clone-typescript
> # add Humanloop API key
> touch .env.local
> echo HUMANLOOP_API_KEY=YOUR_API_KEY >> .env.local
> # optionally add OpenAI key, if you haven't already in Humanloop app
> echo OPENAI_API_KEY=YOUR_API_KEY >> .env.local
> # run the app
> bun install
> bun run dev
2

Use the chat app

Open the chat app in your browser and start chatting with the AI model.

Chat Agent

Every time the user presses the Send button, Humanloop receives the request and calls the AI model. The response from the model is then stored as a Log.

Let’s check the api/chat/route.ts file to see how it works.

  • The path parameter is the path to the Prompt in the Humanloop workspace. If the Prompt doesn’t exist, it will be created.
  • The prompt parameter is the configuration of the Prompt. In this case we manage our Prompt in code; if the configuration of the Prompt changes, a new version of the Prompt will automatically be created on Humanloop. Prompts can alternatively be managed directly on Humanloop.
  • The messages parameter is the list of all messages exchanged between the Model and the User.

To learn more about calling Prompts with the Humanloop SDK, see the Prompt Call API reference.

api/chat/route.ts
1const response = await humanloop.prompts.callStream({
2 // if Prompt doesn't exist, it will be created
3 path: "chatgpt-clone-tutorial/customer-support-agent",
4 prompt: {
5 model: "gpt-4",
6 template: [
7 {
8 role: "system",
9 content: "You are a helpful assistant.",
10 },
11 ],
12 },
13 // messages is a list of objects: [{role: string, content: string}, ...].
14 // Role is either "user", "assistant", "system", or "tool".
15 messages,
16 providerApiKeys: {
17 // OpenAI API key, if you haven't already set it in Humanloop app
18 openai: process.env.OPENAI_API_KEY,
19 },
20 });
3

Review the logs in Humanloop

After chatting with the AI model, go to the Humanloop app and review the logs. Click on the chatgpt-clone-tutorial/customer-support-agent Prompt, then click on the Logs tab at the top of the page.

You see that all the interactions with the AI model are logged here.

The code will generate a new Prompt chatgpt-clone-tutorial/customer-support-agent in the Humanloop app. To change the path, modify the variable PROMPT_HUMANLOOP_PATH in the api/chat/route.ts file.

Chat Agent
4

Modify the code to capture user feedback

Now, let’s modify the code to start getting user feedback! Go back to the code editor and uncomment lines 174-193 in the page.tsx file.

This snippet will add 👍 and 👎 buttons, that users can press to give feedback on the model’s responses.

1 return (
2 <div className="flex w-full pb-4 mb-4 border-b border-gray-300">
3 <div className="min-w-[80px] uppercase text-xs text-gray-500 leading-tight pt-1">
4 {message.role}
5 </div>
6 {message.content ? (
7 <div className="flex w-full pl-4 whitespace-pre-line">
8 {message.content as string}
9 </div>
10 ) : (
11 <div className="flex w-full pl-4 whitespace-pre-line">...</div>
12 )}
13 {logId && (
14 <div className="debug flex justify-end gap-4 max-h-8">
15 <button
16 className="px-3 font-medium text-gray-500 uppercase border border-gray-300 rounded dark:border-gray-100 dark:text-gray-200 hover:border-blue-500 hover:text-blue-500"
17 onClick={() => {
18 captureUserFeedback(logId, "good");
19 }}
20 >
21 👍
22 </button>
23 <button
24 className="px-3 font-medium text-gray-500 uppercase border border-gray-300 rounded dark:border-gray-100 dark:text-gray-200 hover:border-blue-500 hover:text-blue-500"
25 onClick={() => {
26 captureUserFeedback(logId, "bad");
27 }}
28 >
29 👎
30 </button>
31 </div>
32 )}
33 </div>
34 );

To understand how the feedback is captured and sent to Humanloop, let’s check the api/feedback/route.ts file.

We use Humanloop TypeScript SDK to make calls to Humanloop. To attach user feedback, we only need three parameters:

  • parentId is the Id of the Log to which we want to attach feedback. The page.txs file stores all log Ids for model responses.
  • path is the path to the Evaluator. In this example, we’re using an example ‘rating’ Evaluator.
  • judgment is the user feedback.
api/feedback/route.ts
1const response = await humanloop.evaluators.log({
2 // Pass the `logId` of the Prompt Log to record feedback against.
3 parentId: logId,
4 // Here, we're recording feedback against an example "rating" Evaluator,
5 // which is of type `select` and has two possible options: "good" and "bad."
6 path: "Example Evaluators/Human/rating",
7 // Alternatively, we advise to specify Evaluator by id. This is more robust and less error-prone.
8 // versionId: "ev_9WiSw2VYWjAb22duuQ";
9 judgment: judgment, //user feedback
10 });
5

Capture user feedback

Refresh the page in your browser and give 👍 or 👎 to the model’s responses.

Chat Agent
6

Review the logs in Humanloop

With the user feedback captured, go back to the Humanloop app and review the logs. On the Performance tab, you can see all Evaluators and their values.

The user feedback is captured in the rating Evaluator (‘good’ for 👍 and ‘bad’ for 👎).

Chat Agent

Use the logs to improve your AI product

After you collect enough data, you can leverage the user feedback to improve your AI product.

Navigate back to the Logs view and filter all Logs that have a ‘bad’ rating to review the model’s responses that need improvement.

Run Evals with Dataset on Humanloop.
Logs with filter applied

Click on Log and then on Editor -> button in the top right corner to open the Prompt Editor. In the Prompt Editor, you can make changes to the instructions and the model’s parameters to improve the model’s performance.

Once you’re happy with the changes, deploy the new version of the Prompt.

Run Evals with Dataset on Humanloop.
Prompt Editor

When users start interacting with the new version, compare the “good” to “bad” ratio to see if the changes have improved your users’ experience.

Next steps

Now that you’ve successfully captured user feedback, you can explore more ways to improve your AI product:

Built with