Datasets
Good data is the key to any AI solution. SeeMe.ai offers a simple, standardised way to interact with every part of the data lifecycle.
Get all datasets
Get a list of all your datasets:
datasets = client.get_datasets()
The get_datasets()
method does not take any parameter.
Create a dataset
my_dataset = Dataset(
name= "Cats & dogs dataset",
description= "A dataset with labelled images of cats and dogs.",
multi_label= False,
notes= "Cats and dogs is often used as a demo dataset.",
default_splits= True,
content_type= ContentType.IMAGES
)
my_dataset = client.create_dataset(my_dataset)
Parameter | Type | Description |
---|
dataset | Dataset | The entire dataset object |
Properties in more detail:
Property | Type | Description |
---|
id | str | Unique id for the dataset |
created_at | str | The creation date |
updated_at | str | Last updated date |
name | str | The dataset name |
description | str | The dataset description |
user_id | str | The unique id of the dataset creator |
notes | str | More elaborate notes about the dataset |
versions | List[DatasetVersion] | A list of all the version of the dataset (see below) |
multi_label | bool | Flag indicating whether tiems can have multiple labels |
default_splits | bool | Create default splits (“train”, “valid”, “test”) when creating the dataset. |
has_logo | bool | Flag indicating whether the dataset has a logo or not |
logo | str | Name and extension of the logo file |
content_type | DatasetContentType | Type of items in the dataset. Possible values IMAGES , TEXT , TABULAR , NER . |
Get dataset
my_dataset = client.get_dataset(my_dataset.id)
Parameter | Type | Description |
---|
dataset_id | str | The dataset id |
Update dataset
my_dataset.notes += "~25k labelled images of cats and dogs; 22500 for training, 2000 for validation."
client.update_dataset(my_dataset)
Parameter | Type | Description |
---|
dataset | Dataset | The entire dataset object |
Delete dataset
client.delete_dataset(my_dataset.id)
Parameter | Type | Description |
---|
dataset_id | str | The dataset id |
Upload dataset logo
my_dataset = client.upload_dataset_logo(my_dataset.id, folder="directory/to/logo", filename="logo_filename.jpg")
Parameter | Type | Description |
---|
dataset_id | str | Unique id for the dataset |
folder | str | Name of the folder that contains the logo file (without trailing ‘/’), default value “data” |
filename | str | Name of the file to be uploaded, default value “logo.jpg”. Supported formats: jpg , jpeg , png . |
Download dataset logo
client.get_dataset_logo(my_dataset)
Parameter | Type | Description |
---|
dataset | Dataset | The entire dataset object |