Evaluating LLM Applications
Peter Hayes
An overview of evaluating LLM applications. The emerging evaluation framework, parallels to traditional software testing and some guidance on best practices.
An overview of evaluating LLM applications. The emerging evaluation framework, parallels to traditional software testing and some guidance on best practices.
An overview of the techniques that OpenAI recommends to get the best performance of your LLM applications. Covering best-practices in prompt engineering, retrieval-augmented generation (RAG) and fine-tuning.