LLM Evals Done Right - Lessons from Bryan Bischof of Hex AII recently sat down with Bryan Bischof, Head of AI at Hex, to understand how Hex approaches evals and what lessons other teams can take from their experience.Raza Habib
Building Reliable Agents with IroncladOver 50% of contracts at Ironclad are now negotiated by AI. I sat with Cai Gogwilt, Co-founder and Chief Architect to understand what lessons product leaders can take from their success of building AI Agents.Raza Habib
Foundation Models: ExplainedExplore what foundation models are, understand how they work and learn why they’re different to traditional AI models.Conor Kelly
The EU AI Act: Guide for DevelopersIf you’re building an AI product, this post will help you understand if the EU Act will affect you, what you need to do to comply, and what the regulation likely means for the wider tech ecosystem.Raza Habib
Retrieval Augmented Generation (RAG): ExplainedRetrieval augmented generation is the leading AI framework for customizing and improving LLM-generated responses. Learn how it works and why it's important in our guide.Conor Kelly
Evaluating LLM ApplicationsAn overview of evaluating LLM applications. The emerging evaluation framework, parallels to traditional software testing and some guidance on best practices.Peter Hayes
Humanloop is SOC 2 Type II certified Announcing that Humanloop is now certified as SOC 2 Type 2 compliant.Raza Habib
How to Build the Right Team for Generative AIGenerative AI and Large Language Models (LLMs) are new to most companies. If you’re an engineering leader, it can be hard to know what skills and types of people are needed for AI projects. In this post I’d like to share what we’ve learned about the skills needed to build a great AI team.Raza Habib
How to Maximize LLM PerformanceAn overview of the techniques that OpenAI recommends to get the best performance of your LLM applications. Covering best-practices in prompt engineering, retrieval-augmented generation (RAG) and fine-tuning.Jordan Burgess
OpenAI Fine-tuning: GPT-3.5-TurboBrief overview of fine-tuning, why it’s significant, how it works on OpenAI and how Humanloop can help you to finetune your own custom models.Conor Kelly