Resources
Gpt-4

Evaluating LLM Applications

Peter Hayes

An overview of evaluating LLM applications. The emerging evaluation framework, parallels to traditional software testing and some guidance on best practices.

Evaluating LLM Applications

How to Maximize LLM Performance

Jordan Burgess

An overview of the techniques that OpenAI recommends to get the best performance of your LLM applications. Covering best-practices in prompt engineering, retrieval-augmented generation (RAG) and fine-tuning.

How to Maximize LLM Performance