Supervised machine learning involves identifying a predictive task, finding data to enable that task, and building a model using that data. Fiddler captures this workflow with project, dataset, and model entities.
A project represents a machine learning task (e.g. predicting house prices, assessing creditworthiness, or detecting fraud).
A project can contain one or more models for the ML task (e.g. LinearRegression-HousePredict, RandomForest-HousePredict).
Create a project by clicking on Projects and then clicking on Add Project.
- Create New Project — A window will pop up where you can enter the project name and click Create. Once the project is created, it will be displayed on the projects page.
You can access your projects from the Projects Page.
A dataset in Fiddler is a data table containing features, model outputs, and a target for machine learning models. Optionally, you can also upload metadata and “decision” columns, which can be used to segment the dataset for analyses, track business decisions, and work as protected attributes in bias-related workflows. For more details refer to Datasets in the Platform Guide.
Once you click on a particular project, you will be able to see if there are any datasets associated with the project. For example, the bank_churn project, in the following screenshot, has the bank_churn dataset. Datasets are uploaded via the Fiddler client.
A model in Fiddler represents a machine learning model. A project will have one or more models for the ML task (e.g. a project to predict house prices might contain LinearRegression-HousePredict and RandomForest-HousePredict). For further details refer to the Models section in the Platform Guide.
At its most basic level, a model in Fiddler is simply a directory that contains model artifacts such as:
- The model file (e.g.
package.py: A wrapper script containing all of the code needed to standardize the execution of the model.
You can collate specific visualizations under the Project Dashboard. After visualizations are created using the Model Analytics tool, you can pin them to the dashboard, which can then be shared with others.
[^1]: Join our community Slack to ask any questions
Updated 4 months ago