Dataset Splits
A dataset version can have multiple splits, usually separating training, validation and test data.
Get all splits
splits = client.get_dataset_splits(my_dataset.id, new_dataset_version.id)
Parameter | Type | Description |
---|
dataset_id | str | The dataset id |
dataset_version_id | str | The dataset version id |
Create a split
new_split = DatasetSplit(
name= "train",
description= "training data for our model to learn from"
)
new_split = client.create_dataset_split(my_dataset.id, new_dataset_version.id, new_split)
Parameter | Type | Description |
---|
dataset_id | str | The dataset id |
dataset_version_id | str | The dataset version id |
split | DatasetSplit | The split object |
Dataset split properties in more detail:
Property | Type | Description |
---|
id | str | Unique id for the dataset split |
created_at | str | The creation date |
updated_at | str | Last updated date |
name | str | The dataset split name |
description | str | The dataset split description |
user_id | str | The unique id of the dataset split creator |
version_id | The unique id of the dataset version the split belongs to | |
Get a split
my_split = client.get_dataset_split(my_dataset.id, new_dataset_version.id, new_split.id)
Parameter | Type | Description |
---|
dataset_id | str | The dataset id |
dataset_version_id | str | The dataset version id |
split_id | str | The split id |
Update a split
my_split.description = "Training data"
client.update_dataset_split(my_dataset.id, new_dataset_version.id, my_split)
Parameter | Type | Description |
---|
dataset_id | str | The dataset id |
dataset_version_id | str | The dataset version id |
split | DatasetSplit | The split object |
Delete a split
client.delete_dataset_split(my_dataset.id, new_dataset_version.id, my_split)
Parameter | Type | Description |
---|
dataset_id | str | The dataset id |
dataset_version_id | str | The dataset version id |
split | DatasetSplit | The split object |