Stores information about a dataset.

For information on how to customize these objects, see Customizing Your Dataset Schema.

Input Parameters

Type

Default

Description

display_name

str

None

A display name for the dataset.

columns

list

None

A list of fdl.Column objects containing information about the columns.

files

Optional [list]

None

A list of strings pointing to CSV files to use.

dataset_id

Optional [str]

None

The unique identifier for the dataset

**kwargs

Additional arguments to be passed.

columns = [
    fdl.Column(
        name='feature_1',
        data_type=fdl.DataType.FLOAT
    ),
    fdl.Column(
        name='feature_2',
        data_type=fdl.DataType.INTEGER
    ),
    fdl.Column(
        name='feature_3',
        data_type=fdl.DataType.BOOLEAN
    ),
    fdl.Column(
        name='output_column',
        data_type=fdl.DataType.FLOAT
    ),
    fdl.Column(
        name='target_column',
        data_type=fdl.DataType.INTEGER
    )
]

dataset_info = fdl.DatasetInfo(
    display_name='Example Dataset',
    columns=columns
)