LLM Evaluation - Compare Outputs

This guide explains how to compare outputs from different LLM models, such as GPT-3.5 and Claude, to determine the most suitable choice for your language model application.

Click this link to get started using Google Colab →

Or download the notebook directly from GitHub.

❓ Questions? Talk to a product expert or request a demo.

💡 Need help? Contact us at [email protected].

PreviousEvals SDK Advanced Guide NextLLM Evaluation - Prompt Specs Quick Start

Last updated 7 days ago

Was this helpful?