September 15, 2023
Evaluations improvements
We’ve released a couple of minor useability improvements in the evaluations workflow.
Summary statistics for evaluation runs
When reviewing past runs of evaluations, you can now see summary statistics for each evaluator before clicking into the detail view, allowing for easier comparison between runs.
Re-running evaluations
To enable easier re-running of past evaluations, you can now click the Re-run button in the top-right of the evaluation detail view.
Editor - copy tools
Our Editor environment let’s users incorporate OpenAI function calling into their prompt engineering workflows by defining tools. Tools are made available to the model as functions to call using the same universal JSON schema format.
As part of this process it can be helpful to copy the full JSON definition of the tool for quickly iterating on new versions, or copy and pasting it into code. You can now do this directly from the tool definition in Editor:
Selecting the Copy button adds the full JSON definition of the tool to your clipboard:
Single sign on (SSO)
We’ve added support for SOO to our signup, login and invite flows. By default users can now use their Gmail accounts to access Humanloop.
For our enterprise customers, this also unlocks the ability for us to more easily support their SAML-based single sign-on (SSO) set ups.