- Data Collection: Organized storage of model input/output data
- Environment Separation: Distinct handling of production vs. pre-production data
- Baseline Source: Reference data for drift detection and monitoring
- Analysis Support: Data download and statistical analysis capabilities
- Model Integration: Tight coupling with specific models for context
- Automatic Creation: Created by Model.publish() operations
- Model-Scoped: Each dataset belongs to exactly one model
- Named Collections: Unique names within a model for identification
- Row Tracking: Automatic counting of data records
- Environment Typed: Classified as production or pre-production data
Example
Datasets cannot be created directly through the Dataset class. They are
automatically created when data is published to models using Model.publish().
Use the Dataset class for retrieval, listing, and analysis operations.
Parameters
Dataset name, must be unique within the model.
Should be descriptive of the data contents or purpose.
Identifier of the model this dataset belongs to.
Can be provided as UUID object or string representation.
Identifier of the parent project.
Can be provided as UUID object or string representation.
Example
Datasets are typically retrieved using Dataset.get(), Dataset.from_name(),
or Dataset.list() rather than created directly. Direct creation is mainly
used internally by the Fiddler client.
classmethod get()
Retrieve a dataset by its unique identifier. Fetches a dataset from the Fiddler platform using its UUID. This is the most direct way to retrieve a dataset when you know its ID.Parameters
The unique identifier (UUID) of the dataset to retrieve.
Can be provided as a UUID object or string representation.
Returns
The dataset instance with all metadata and row count information.
Raises
- NotFound – If no dataset exists with the specified ID.
- ApiError – If there’s an error communicating with the Fiddler API.
Example
This method makes an API call to fetch the latest dataset state from the server.
The returned dataset instance reflects the current state in Fiddler.
classmethod from_name()
Retrieve a dataset by name within a specific model. Finds and returns a dataset using its name and model context. Dataset names are unique within a model, making this a reliable lookup method when you know both the dataset name and model ID.Parameters
The name of the dataset to retrieve. Dataset names are unique
within a model and are case-sensitive.
The identifier of the model containing the dataset.
Can be provided as UUID object or string representation.
Returns
The dataset instance matching the specified name and model.
Raises
- NotFound – If no dataset exists with the specified name in the given model.
- ApiError – If there’s an error communicating with the Fiddler API.
Example
Dataset names are case-sensitive and must match exactly. This method
is useful when you know the dataset name from configuration or when
working with named datasets created during model training workflows.
classmethod list()
List all pre-production datasets for a specific model. Retrieves all datasets that have been published to a model in the pre-production environment. These datasets are typically used for baselines, training data analysis, and validation purposes.Parameters
The identifier of the model to list datasets for.
Can be provided as UUID object or string representation.
Yields
Dataset – Dataset instances for all pre-production datasets in the model.
Raises
ApiError – If there’s an error communicating with the Fiddler API.Returns
Iterator[Dataset]
Example
This method returns an iterator for memory efficiency and only includes
pre-production datasets. Production data is handled separately through
the monitoring system. Convert to a list with list(Dataset.list(…))
if you need to iterate multiple times.