Prepare Your Object Detection Dataset

Prepare Your Object Detection Dataset

Object detection requires images with annotated bounding boxes around objects. This guide covers data requirements and annotation workflows.

Data Requirements

RequirementRecommendation
Minimum images100+ per object class
FormatsJPG, PNG, WEBP
Annotation formatYOLO, COCO, or Pascal VOC
Image qualityClear, well-lit, representative

Annotation Formats

SeeMe.ai supports multiple annotation formats:

YOLO Format

# One .txt file per image, same filename
# Each line: class_id x_center y_center width height (normalized 0-1)
0 0.5 0.5 0.2 0.3
1 0.25 0.75 0.1 0.15

COCO Format

{
  "images": [{"id": 1, "file_name": "image1.jpg", "width": 1920, "height": 1080}],
  "annotations": [
    {"id": 1, "image_id": 1, "category_id": 0, "bbox": [100, 200, 150, 300]}
  ],
  "categories": [{"id": 0, "name": "car"}]
}

Pascal VOC Format

<annotation>
  <object>
    <name>car</name>
    <bndbox>
      <xmin>100</xmin><ymin>200</ymin>
      <xmax>250</xmax><ymax>500</ymax>
    </bndbox>
  </object>
</annotation>

Using the Web Platform

Using the Python SDK

Best Practices

  1. Annotate consistently - Same rules for all annotators
  2. Include edge cases - Partially visible, occluded objects
  3. Balance classes - Similar counts per object type
  4. Tight boxes - Minimize background in boxes
  5. Quality check - Review 10% of annotations

Next Step

2. Annotate Objects →