Prepare Your Object Detection Dataset

Object detection requires images with annotated bounding boxes around objects. This guide covers data requirements and annotation workflows.

Data Requirements

Requirement	Recommendation
Minimum images	100+ per object class
Formats	JPG, PNG, WEBP
Annotation format	YOLO, COCO, or Pascal VOC
Image quality	Clear, well-lit, representative

Annotation Formats

SeeMe.ai supports multiple annotation formats:

YOLO Format

# One .txt file per image, same filename
# Each line: class_id x_center y_center width height (normalized 0-1)
0 0.5 0.5 0.2 0.3
1 0.25 0.75 0.1 0.15

COCO Format

{
  "images": [{"id": 1, "file_name": "image1.jpg", "width": 1920, "height": 1080}],
  "annotations": [
    {"id": 1, "image_id": 1, "category_id": 0, "bbox": [100, 200, 150, 300]}
  ],
  "categories": [{"id": 0, "name": "car"}]
}

Pascal VOC Format

<annotation>
  <object>
    <name>car</name>
    <bndbox>
      <xmin>100</xmin><ymin>200</ymin>
      <xmax>250</xmax><ymax>500</ymax>
    </bndbox>
  </object>
</annotation>

Using the Web Platform

Annotate Objects

Open an image in the annotation interface
Select the Bounding Box tool
Click and drag to draw boxes around objects
Select the label for each box
Save and move to next image

Keyboard shortcuts:

N - Next image
P - Previous image
D - Delete selected box
1-9 - Quick label selection

Using the Python SDK

from seeme import Client

client = Client()

# Create object detection dataset
dataset = client.create_dataset(
    name="Vehicle Detection",
    description="Cars, trucks, and motorcycles",
    content_type="object_detection"
)

# Import YOLO format dataset
# Structure:
# dataset/
#   images/
#     img1.jpg
#     img2.jpg
#   labels/
#     img1.txt
#     img2.txt
#   classes.txt

client.import_yolo_dataset(
    dataset_id=dataset.id,
    version_id=version.id,
    images_path="./dataset/images",
    labels_path="./dataset/labels",
    classes_path="./dataset/classes.txt"
)

# Import COCO format dataset
client.import_coco_dataset(
    dataset_id=dataset.id,
    version_id=version.id,
    images_path="./coco/images",
    annotations_path="./coco/annotations.json"
)

# Create object detection dataset
curl -X POST "https://api.seeme.ai/api/v1/datasets" \
  -H "Authorization: myusername:my-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "Vehicle Detection",
    "description": "Cars, trucks, and motorcycles",
    "content_type": "object_detection"
  }'

# Create dataset version
curl -X POST "https://api.seeme.ai/api/v1/datasets/{dataset_id}/versions" \
  -H "Authorization: myusername:my-api-key" \
  -H "Content-Type: application/json" \
  -d '{"name": "v1"}'

# Upload image with bounding box annotation
curl -X POST "https://api.seeme.ai/api/v1/versions/{version_id}/items" \
  -H "Authorization: myusername:my-api-key" \
  -F "file=@./image.jpg" \
  -F "split_id={split_id}"

# Add bounding box annotation
curl -X POST "https://api.seeme.ai/api/v1/items/{item_id}/annotations" \
  -H "Authorization: myusername:my-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "label_id": "your-label-id",
    "bbox": {
      "x": 0.2,
      "y": 0.3,
      "width": 0.4,
      "height": 0.3
    }
  }'

Best Practices

Annotate consistently - Same rules for all annotators
Include edge cases - Partially visible, occluded objects
Balance classes - Similar counts per object type
Tight boxes - Minimize background in boxes
Quality check - Review 10% of annotations

Next Step

2. Annotate Objects →

Annotate Objects

Prepare Your Object Detection Dataset

Data Requirements

Annotation Formats

YOLO Format

COCO Format

Pascal VOC Format

Using the Web Platform

Create Dataset

Upload Images

Annotate Objects

Using the Python SDK

Best Practices

Next Step