.cbor
- Files in the Edge Impulse Data Acquisition format. The uploader will not resign these files, only upload them..json
- Files in the Edge Impulse Data Acquisition format. The uploader will not resign these files, only upload them..csv
- Files in the Edge Impulse Comma Separated Values (CSV) format. If you have configured the “CSV wizard”, the settings will be used to parse your CSV files..wav
- Lossless audio files. It’s recommended to use the same frequency for all files in your data set, as signal processing output might be dependent on the frequency..jpg
and .png
- Image files. It’s recommended to use the same ratio for all files in your data set..mp4
and .avi
- Video file. You can then from the studio split this video file into images at a configurable frame per second.info.labels
- JSON-like file (without the .json
extension). You can use it to add metadata and for custom labeling strategies (single-label vs multi-label, float values labels, etc…). See Edge Impulse exporter formatuploader icon
The visual uploader
training
, testing
or perform an 80/20 split between your data samples.
If needed, you can always perform a split later from your project’s dashboard.
^[a-zA-Z0-9\s-_]+
. For example: idle.01 will yield the label idle
.
Thus, if you want to use labels (string values) containing float values (e.g. “0.01”, “5.02”, etc…), automatic labeling won’t work.
To bypass this limitation, you can make an info.labels
JSON file containing your dataset files’ info. We also support adding metadata to your samples. See below to understand the Edge Impulse Exporter format.
info.labels
files)info.labels
file.
The info.labels
file (can be located in each subdirectory or at the folder root) provides detailed information about the labels. The file follows a JSON format, with the following structure:
version
: Indicates the version of the label format.files
: A list of objects, where each object represents a supported file format and its associated labels.
path
: The path or file name.category
: Indicates whether the image belongs to the training or testing set.label
(optional): Provides information about the labeled objects.
type
: Specifies the type of label - unlabeled
, label
, multi-label
label
(optional): The actual label or class name of the sample.labels
(optional): The labels in the multi-label format:
label
: Label for the given period.startIndex
: Timestamp in milliseconds.endIndex
: Timestamp in milliseconds.metadata
(Optional): Additional metadata associated with the image, such as the site where it was collected, the timestamp or any useful information.boundingBoxes
(Optional): A list of objects, where each object represents a bounding box for an object within the image.
label
: The label or class name of the object within the bounding box.x
, y
: The coordinates of the top-left corner of the bounding box.width
, height
: The width and height of the bounding box.info.labels
file:
Studio Uploader detected info.labels
Dataset annotation formats
Click upload data
Data uploaded
bounding_boxes.labels
file.
The bounding_boxes.labels
file in each subdirectory provides detailed information about the labeled objects and their corresponding bounding boxes. The file follows a JSON format, with the following structure:
version
: Indicates the version of the label format.files
: A list of objects, where each object represents an image and its associated labels.
path
: The path or file name of the image.category
: Indicates whether the image belongs to the training or testing set.label
: Provides information about the labeled objects.
type
: Specifies the type of label (e.g., a single label).label
: The actual label or class name of the object.metadata
: Additional metadata associated with the image, such as the site where it was collected, the timestamp or any useful information.boundingBoxes
: A list of objects, where each object represents a bounding box for an object within the image.
label
: The label or class name of the object within the bounding box.x
, y
: The coordinates of the top-left corner of the bounding box.width
, height
: The width and height of the bounding box.bounding_boxes.labels
example:
_annotations.coco.json
file in each subdirectory provides detailed information about the labeled objects and their corresponding bounding boxes. The file follows a JSON format, with the following structure:
Categories
The “categories” component defines the labels or classes of objects present in the dataset. Each category is represented by a dictionary containing the following fields:
id
: A unique integer identifier for the category.name
: The name or label of the category.supercategory
: A higher-level category that the current category belongs to, if applicable. This supercategory
is not used or imported by the Uploader.id
: A unique integer identifier for the image.width
: The width of the image in pixels.height
: The height of the image in pixels.file_name
: The file name or path of the image file.id
: A unique integer identifier for the annotation.image_id
: The identifier of the image to which the annotation belongs.category_id
: The identifier of the category that the annotation represents.bbox
: A list representing the bounding box coordinates in the format [x, y, width, height].area
: The area (in pixels) occupied by the annotated object.segmentation
: The segmentation mask of the object, represented as a list of polygons.iscrowd
: A flag indicating whether the annotated object is a crowd or group of objects.area
, segmentation
, iscrowd
fields.
_annotations.coco.json
example:
_annotations.csv
file is located in the same directory of the images it references. A class-descriptions.csv
mapping file can be used to give short description or human-readable classes from the MID LabelName
.
An OpenImage CSV dataset usually has this directory structure:
ImageID
: An identifier or filename for the image to which the annotation belongs.Source
: The source or origin of the annotation, indicating whether it was manually annotated or obtained from other sources.LabelName
: The class label of the object.Confidence
: The confidence score or probability associated with the annotation.XMin, YMin, XMax, YMax
: The coordinates of the bounding box that encloses the object, usually represented as the top-left (XMin, YMin) and bottom-right (XMax, YMax) corners.IsOccluded, IsTruncated, IsGroupOf, IsDepiction, IsInside
: Binary flags indicating whether the object is occluded, truncated, a group of objects, a depiction, or inside another object.LabelName
in the CSV file.LabelName
correspond to specific object categories defined in the OpenImage dataset’s ontology (MID).class-description.csv
mapping file to see your classes in Edge Impulse Studio.
Bounding Box Coordinates:
class-descriptions.csv
mapping file:
*class-descriptions.csv
_annotations.csv
example:
cubes.23im33f2.xml
:
file_name
: The filename of the image.class_name
: The class label or category of the image._annotations_single_label.csv
example:
file_name
: The filename of the image.classes
: The class label or category of the object.xmin
: The x-coordinate of the top-left corner of the bounding box.ymin
: The y-coordinate of the top-left corner of the bounding box.xmax
: The x-coordinate of the bottom-right corner of the bounding box.ymax
: The y-coordinate of the bottom-right corner of the bounding box._annotations_bounding_boxes.csv
example:
.txt
.class_label
, normalized bounding box coordinates (center_x
, center_y
, width
, height
).cubes-23im33f2.txt
cubes-23im33f2.jpg
image.
classes.txt
, classes.names
or data.yaml
(used by Roboflow YOLOv5 PyTorch export format) files contain configuration values used by the model to locate images and map class names to class_id
s.classes.txt
file: