Product Concepts

What is ML Observability?

ML observability is the modern practice of gaining comprehensive insights into your AI application's performance throughout its lifecycle. It goes beyond simple indicators of good and bad performance by empowering all model stakeholders to understand why a model behaves in a certain manner and how to enhance its performance. ML Observability starts with monitoring and alerting on performance issues, but goes much deeper into guiding model owners towards the underlying root cause of model performance issues.

What is LLM Observability?

LLM observability is the practice of evaluating, monitoring, analyzing, and improving Generative AI or LLM based application across their lifecycle. Fiddler provides real-time monitoring on safety metrics like toxicity, bias, and PII and correctness metrics like hallucinations, faithfulness and relevancy.

Project

A project within Fiddler represents your organization's distinct AI applications or use cases. It houses all of the model schemas that have been onboarded to Fiddler for the purpose of AI observability. Projects are typically specific to a given ML application or use case. They serve as a jumping-off point for Fiddler's model monitoring and explainability features.

Additionally, Fiddler projects serve as the main organizational structure within the platform. They are also the entity to which authorization is controlled within the platform. Different users and teams are given access to view information within Fiddler at the project level.

Model Schemas

In Fiddler, A model schema is the metadata about a model that is being observed. Model schemas are onboarded to Fiddler so that Fiddler understand the data in which it is observing. Fiddler does not require the model artifact itself to properly observe the performance of the model (however, model artifacts can be uploaded to Fiddler to unlock advanced explainability features). . Instead, we may just need adequate information about the model's schema, or the model's specification in order to monitor the model.

📘
Working with Model Artifacts
You can upload your model artifacts to Fiddler to unlock high-fidelity explainability for your model. However, it is not required. If you do not wish to upload your artifact but want to explore explainability with Fiddler, we can build a surrogate model on the backend to be used in place of your artifact.

Environments

Within Fiddler, each model has two environments; pre-production and production. Environments help Fiddler uses environments to assign purpose to the data published to it. These environments help Fiddler distinguish between:

Non-time series data (pre-production datasets, eg. training data)
Time-series data (production data, eg. inference logs)

Pre-Production Environment

Pre-production environment contains non-time series chunks of data, called datasets. Datasets are used primarily for point-in-time analysis or as static baselines for comparison against production data.

Production Environment

Production environment contains time series data such as production or inference logs which are the "digital exhaust" coming off of each decision a model makes. This time series data provides the inputs and outputs of each model inference/decision and is what Fiddler analyses and compares against the pre-production data to determine if the model's performance is degrading.

Datasets

Datasets within Fiddler are static sets of data that have been uploaded to Fiddler for the purpose of establishing "baselines". A common dataset that is uploaded to Fiddler is the model's training data.

Baselines

Baselines are derived from datasets and used by Fiddler to understand what the expected data distributions that the model will encounter are. A baseline in Fiddler is a set of reference data used to compare the performance of a model for monitoring purposes. The default baseline for all monitoring metrics in Fiddler is typically the model's training data. Additional baselines can be added to existing models as well that are derived from other datasets, historical inferences, or rolling baselines that refer back to data distributions using historical inferences.

Metrics

Metrics are computations performed on data received by the platform. Fiddler supports the ability to specify custom user-defined metrics (Custom Metrics) in addition to five out-of-the-box metric types:

Performance
Data Drift
Data Integrity
Statistic
Traffic

Alerts

Alerts are user-specified rules which trigger when some condition is met by data received on the Fiddler platform. For any alert rule, you can customize how you'd like to be notified (either via email, Slack, or using webhooks).

Segments

Segments are "row filters" specified on your data. They allow you to see metrics on only some subset of the data population (e.g. "People under 50"). You can also set alerts on Segments.

Product Concepts

What is ML Observability?

What is LLM Observability?

Project

Model Schemas

📘
Working with Model Artifacts

Environments

Pre-Production Environment

Production Environment

Datasets

Baselines

Metrics

Alerts

Segments

Enrichments

Users/Teams

Charts

Dashboards

Jobs

Bookmarks

Alerts

Triggered Alerts

What is ML Observability?

What is LLM Observability?

Project

Model Schemas

📘Working with Model Artifacts

Environments

Pre-Production Environment

Production Environment

Datasets

Baselines

Metrics

Alerts

Segments

Enrichments

Users/Teams

Charts

Dashboards

Jobs

Bookmarks

Alerts

Triggered Alerts

📘
Working with Model Artifacts