Embedding Visualizations for LLM Monitoring
Last updated
Last updated
© 2024 Fiddler Labs, Inc.
Embedding visualization is a powerful technique used to understand and interpret complex relationships in high-dimensional data. Reducing the dimensionality of custom features into a 2D or 3D space makes it easier to identify patterns, clusters, and outliers.
In Fiddler, high-dimensional data like embeddings and vectors are ingested as a Custom feature.
Our goal in this document is to show you how you can visualize and interact with these custom features in 3D to do analysis and expose patterns.
We utilize the UMAP (Uniform Manifold Approximation and Projection) technique for embedding visualizations. UMAP is a dimensionality reduction technique that is particularly good at preserving the local structure of the data, making it ideal for visualizing embeddings. We reduce the high-dimensional embeddings to a 3D space.
UMAP is supported for both Text and Image embeddings in a Custom feature and you can follow this guide on how to apply a UMAP visualization to your application data.
UMAP embedding visualizations are extremely helpful in understanding common themes and topics present in the data corpus for generative AI applications. When evaluating prompts and responses, it is paramount to see which concept clusters are emerging and which clusters are exhibiting the most problems. By further coloring these clusters with a variety of LLM and GenAI correctness and safety metrics, users can identify the clusters with the most issues.
📘 To create an embedding visualization chart, follow the UI Guide here.