Dataset Splits
A dataset version can have multiple splits, usually separating training, validation and test data.
Get all splits
splits = client.get_dataset_splits(my_dataset.id, new_dataset_version.id)
| Parameter | Type | Description |
|---|
| dataset_id | str | The dataset id |
| dataset_version_id | str | The dataset version id |
Create a split
new_split = DatasetSplit(
name= "train",
description= "training data for our model to learn from"
)
new_split = client.create_dataset_split(my_dataset.id, new_dataset_version.id, new_split)
| Parameter | Type | Description |
|---|
| dataset_id | str | The dataset id |
| dataset_version_id | str | The dataset version id |
| split | DatasetSplit | The split object |
Dataset split properties in more detail:
| Property | Type | Description |
|---|
| id | str | Unique id for the dataset split |
| created_at | str | The creation date |
| updated_at | str | Last updated date |
| name | str | The dataset split name |
| description | str | The dataset split description |
| user_id | str | The unique id of the dataset split creator |
| version_id | The unique id of the dataset version the split belongs to | |
Get a split
my_split = client.get_dataset_split(my_dataset.id, new_dataset_version.id, new_split.id)
| Parameter | Type | Description |
|---|
| dataset_id | str | The dataset id |
| dataset_version_id | str | The dataset version id |
| split_id | str | The split id |
Update a split
my_split.description = "Training data"
client.update_dataset_split(my_dataset.id, new_dataset_version.id, my_split)
| Parameter | Type | Description |
|---|
| dataset_id | str | The dataset id |
| dataset_version_id | str | The dataset version id |
| split | DatasetSplit | The split object |
Delete a split
client.delete_dataset_split(my_dataset.id, new_dataset_version.id, my_split)
| Parameter | Type | Description |
|---|
| dataset_id | str | The dataset id |
| dataset_version_id | str | The dataset version id |
| split | DatasetSplit | The split object |