client.get_feature_importance

Get global feature importance for a model over a dataset or a slice.

Input ParameterTypeDefaultDescription
project_idstrNoneA unique identifier for the project.
model_idstrNoneA unique identifier for the model.
data_sourceUnion[fdl.DatasetDataSource, fdl.SqlSliceQueryDataSource]NoneType of data source for the input dataset to compute feature importance on (DatasetDataSource or SqlSliceQueryDataSource)
num_iterationsOptional[int]10000The maximum number of ablated model inferences per feature.
num_refsOptional[int]10000Number of reference points used in the explanation.
ci_levelOptional[float]0.95The confidence level (between 0 and 1).
overwrite_cacheOptional[bool]FalseWhether to overwrite the feature importance cached values or not
PROJECT_ID = 'example_project'
MODEL_ID = 'example_model'
DATASET_ID = 'example_dataset'


# Feature Importance - Dataset data source
feature_importance = client.get_feature_importance(
    project_id=PROJECT_ID,
    model_id=MODEL_ID,
    data_source=fdl.DatasetDataSource(dataset_id=DATASET_ID, num_samples=200),
    num_iterations=300,
    num_refs=200,
    ci_level=0.90,
)

# Feature Importance - Slice Query data source
query = f'SELECT * FROM {DATASET_ID}.{MODEL_ID} WHERE CreditScore > 700'
feature_importance = client.get_feature_importance(
    project_id=PROJECT_ID,
    model_id=MODEL_ID,
    data_source=fdl.SqlSliceQueryDataSource(query=query, num_samples=80),
    num_iterations=300,
    num_refs=200,
    ci_level=0.90,
)
Return TypeDescription
tupleA named tuple with the feature impact results.