For this quickstart, we’re going to evaluate Outreach Agent, which is designed to compose personalized outbound messages to potential customers. The Agent uses a Tool to research information about the lead before writing a message.
We will assess the quality of the Agent using a Dataset and Evaluators.
Create a Humanloop Account
If you haven’t already, create an account or log in to Humanloop
Add an OpenAI API Key
If you’re the first person in your organization, you’ll need to add an API key to a model provider.
Using the Prompt Editor will use your OpenAI credits in the same way that the OpenAI playground does. Keep your API keys for Humanloop and the model providers private.
In this quickstart, we will use a pre-configured Agent from the Humanloop Library.
Navigate to the Library by clicking the Library button in the upper-left corner. Select the Outreach Agent and click the Clone to Workspace button in the upper-right corner.

This will create an Outreach Agent folder in your workspace. Inside the folder, you’ll find:
The Outreach Agent looks up information about the lead on Hacker News and composes an outbound message to them.
Before we kick off the first evaluation, run the Agent in the Editor to get a feel for how it works:

Evaluations are an efficient way to improve your Agent iteratively. You can test versions of the Agent against a Dataset and see how changing the Agent’s configuration impacts the performance.
To test the Outreach Agent, navigate to the Evals tab and click on the + Evaluation button.
Create a new Run by clicking on the + Run button. Then, follow these steps:
The first two Evaluators will check if the message is friendly and if the Tool was used. The Message Length Evaluator will show the number of words in the output, providing a baseline value for all further evaluations.
Click Save. Humanloop will start generating Logs for the Evaluation.

The evaluation is most useful when you iteratively improve the Agent, as it allows you to compare several versions side by side. Follow our tutorial where we add the Google Search Tool to this Agent and compare the results with this version.
In this quick guide, you’ve created an Agent that can help your organization compose personalized messages to your leads. You’ve evaluated the initial version to see how this Agent performs across multiple inputs.
Now that you’ve successfully run your first Eval, you can explore how you can make your Agent more powerful: