Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Task: Object Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse teams and is a variant of the "Object Detection - Dice" dataset. It images of dice on a white background labeled by color.
Feature extraction: Image
Learning block: Object Detection
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 85
Labeling Method: object detection
Train/Test Split: 80.00% / 20.00%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Training Set
Testing Set
Total Data Items
68
17
Labels
blue, green, orange, purple, red, yellow
blue, green, orange, purple, red, yellow
Task: Image Classification
License: Apache 2.0
This dataset contains images of fire extinguisher safety pins. This dataset can be used to combine image classification and visual anomaly detection.
Label information:
rot-powder
: Pin for ROT powder fire extinguisher - Yellow - manufacturer reference: 06210409F
rot-water
: Pin for ROT water spray fire extinguisher - Blue - manufacturer reference: 06210410F
anomaly
: Other fire extinguisher safety pins
Feature extraction: Image
Learning block: Transfer Learning (Images), NVIDIA TAO, Classification
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 264
Labeling Method: single label
Train/Test Split: 79.92% / 20.08%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
This datasets collection contains publicly available datasets collected, generated or curated by Edge Impulse or its partners.
Each of these datasets highlights a specific use case or machine learning task, helping you understand the types of data commonly encountered in projects like object detection, audio classification, and visual anomaly detection. While not intended for production models, they serve as valuable resources for learning, experimentation, and prototyping. Whether you're a developer, researcher, or student, these datasets are here to support your exploration of edge AI by offering real-world scenarios and insights into data preparation and model development.
Task: Image Classification
License:
This dataset has been collected by Edge Impulse teams and contains images taken from a smartphone using the NATIONAL GEOGRAPHIC 40x-1280x Microscope.
Total Data Items: 246
Labeling Method: single label
Train/Test Split: 81.30% / 18.70%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Feature extraction:
Learning block: , ,
Not sure what to choose? Try out this dataset with the .
Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
(Enterprise feature)
Training Set
Testing Set
Total Data Items
211
53
Labels
anomaly, rot-powder, rot-water
anomaly, rot-powder, rot-water
Training Set | Testing Set |
Total Data Items | 200 | 46 |
Labels | cotton stem, epidermis onion, housefly leg, unknown, wood stem | cotton stem, epidermis onion, housefly leg, unknown, wood stem |
Task: Object Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse teams and contains images of bottle racks on different backgrounds and different orientations. It can be used to count the number of beers in the rack. The two labels - empty or full - is used to distinguish if the beer has a cap or not.
Feature extraction: Image
Learning block: Object Detection, NVIDIA TAO,
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 130
Labeling Method: object detection
Train/Test Split: 74.62% / 25.38%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
License: Apache 2.0
This dataset has been synthetically generated by Edge Impulse teams using NVIDIA Omniverse and Edge Impulse Omniverse Extension.
You can also have a look at this tutorial created by Edge Impulse expert George Igwegbe to create a synthetic dataset using NVIDIA Omniverse Replicator.
Feature extraction: Image
Learning block: Object Detection, NVIDIA TAO,
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 101
Labeling Method: object detection
Train/Test Split: 78.22% / 21.78%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse teams and contains images of dice on a white background. It can be used to count the number of items in the frame.
Feature extraction: Image
Learning block: Object Detection
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 85
Labeling Method: object detection
Train/Test Split: 80.00% / 20.00%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse team and used extensively to design the FOMO (Faster Objects, More Objects) object detection architecture.
See FOMO documentation or the announcement blog post.
Feature extraction: Image
Learning block: Object Detection, NVIDIA TAO,
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 670
Labeling Method: object detection
Train/Test Split: 79.70% / 20.30%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
License: Apache 2.0
This dataset, collected by Edge Impulse team, is a variant of the Cubes on Conveyor Belt dataset. It has been used to perform spatial aware object detection using a micro transformer.
Have a look at this blog post for more information: FOMO Self-Attention
Total Data Items: 85
Labeling Method: object detection
Train/Test Split: 97.65% / 2.35%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Training Set
Testing Set
Total Data Items
97
33
Labels
empty, full
empty, full
Training Set
Testing Set
Total Data Items
79
22
Labels
Can
Can
Training Set
Testing Set
Total Data Items
68
17
Labels
dice
dice
Training Set
Testing Set
Total Data Items
534
136
Labels
blue, green, red, yellow
blue, green, red, yellow
Training Set
Testing Set
Total Data Items
83
2
Labels
left, right
left, right
Task: Visual Anomaly Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse teams and contains a single flat washer, randomly located in the frame. The dataset has been collected using two different backgrounds - a white background and textured dark-grey background. The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Feature extraction: Image
Learning block: Visual Anomaly Detection (FOMO-AD)
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 464
Labeling Method: single label
Train/Test Split: 58.84% / 41.16%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse teams and contains a single DHT11 sensor, centered in the frame, with a similar size and a uniform background.
The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
The DHT11 have been used to teach IoT classes in the past and have been manipulated by students extensively. When not wiring the pins properly, it can cause an overheat which often lead to the plastic melting. Some other anomalous images are missing wiring pins.
Feature extraction: Image
Learning block: Visual Anomaly Detection (FOMO-AD)
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 195
Labeling Method: single label
Train/Test Split: 69.74% / 30.26%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse teams and contains a single fire extinguisher head thread, centered in the frame, with a similar size and a variable background.
The training dataset only contains "good" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Feature extraction: Image
Learning block: Visual Anomaly Detection (FOMO-AD)
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 204
Labeling Method: single label
Train/Test Split: 58.82% / 41.18%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
License: Apache 2.0
This dataset has been collected by Edge Impulse teams and contains a single thermostatic valves, centered in the frame, with a similar size and a uniform background.
The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Feature extraction: Image
Learning block: Visual Anomaly Detection (FOMO-AD)
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 195
Labeling Method: single label
Train/Test Split: 62.05% / 37.95%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
License:
This dataset has been collected by Edge Impulse teams and contains a single red-white capsule, centered in the frame, with a similar size and a uniform background.
The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Total Data Items: 149
Labeling Method: single label
Train/Test Split: 64.43% / 35.57%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Audio Classification
License:
This dataset has been generated by Edge Impulse teams to recognize the sound of glass breaking.
Total Data Items: 500
Total Data Length: 0h 21m 15s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 80.00% / 20.00%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Audio Classification
License:
This dataset has been collected by Edge Impulse teams to recognize the sound of water running from a faucet, even in the presence of other background noise.
Total Data Items: 18
Total Data Length: 0h 15m 40s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 88.89% / 11.11%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Audio Classification
License:
This dataset can be used to build an Edge AI project detecting the "Hello World" keyword phrase.
Total Data Items: 2062
Total Data Length: 0h 34m 22s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 79.97% / 20.03%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Motion and Vibration Classification
License:
Total Data Items: 11
Labeling Method: single label
Train/Test Split: 72.73% / 27.27%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
(Enterprise feature)
The synthetic data has been generated using the .
You can also have a look at the blog post .
Feature extraction: , ,
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
(Enterprise feature)
You can also follow learn how to collect audio data from microphones, use signal processing to extract the most important information, and train a deep neural network that can tell you whether the sound of running water can be heard in a given clip of audio. Finally, you'll deploy the system to an embedded device and evaluate how well it works.
Feature extraction: , ,
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
(Enterprise feature)
Have you ever wanted to make your own "Ok, Google" or "Alexa" keyword spotting model? The helloworld
class has been collected by Edge Impulse teams, the added noise
samples come from the and the unknown
samples are based on a subset of data in the .
You can also follow to guide you through building your keyword spotting model, from data collection to deployment on embedded devices.
Feature extraction: , , ,
Learning block: ,
Not sure what to choose? Try out this dataset with the .
Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
(Enterprise feature)
Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
(Enterprise feature)
Training Set
Testing Set
Total Data Items
273
191
Labels
no anomaly
anomaly, no anomaly
Training Set
Testing Set
Total Data Items
136
59
Labels
no anomaly
anomaly, no anomaly
Training Set
Testing Set
Total Data Items
120
84
Labels
good
good, rusty
Training Set
Testing Set
Total Data Items
121
74
Labels
no anomaly
anomaly, no anomaly
Training Set | Testing Set |
Total Data Items | 96 | 53 |
Labels | no anomaly | anomaly, no anomaly |
Training Set | Testing Set |
Total Data Items | 400 | 100 |
Labels | glass_breaking, noise | glass_breaking, noise |
Total Data Length | 0h 17m 37s | 0h 3m 37s |
Training Set | Testing Set |
Total Data Items | 16 | 2 |
Labels | faucet, noise | faucet, noise |
Total Data Length | 0h 13m 40s | 0h 2m 0s |
Training Set | Testing Set |
Total Data Items | 1649 | 413 |
Labels | helloworld, noise, unknown | helloworld, noise, unknown |
Total Data Length | 0h 27m 29s | 0h 6m 53s |
Training Set | Testing Set |
Total Data Items | 8 | 3 |
Labels | extract, grind, idle, pump | extract, grind, idle, pump |
Task: Sensor Fusion Classification
License: Apache 2.0
This dataset has been collected by Edge Impulse team to demonstrate how to perform Sensor Fusion Classification leveraging Neural Networks Embeddings.
Sensor fusion refers to the process of combining data from different types of sensors to give more information to the neural network. To extract meaningful information from this data, you can use the same DSP block, multiples DSP blocks, or use neural networks embeddings.
You can have a look at this tutorial for more information to see how to use this dataset.
This dataset combines audio data and accelerometer data.
Feature extraction: Audio Embeddings (Spectrogram + Convolution Neural Network Embeddings) + Spectral Analysis (accelerometer)
Learning block: Classification using a fully-connected network
Total Data Items: 36
Labeling Method: single label
Train/Test Split: 91.67% / 8.33%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Training Set
Testing Set
Total Data Items
33
3
Labels
extract, grind, idle, pump
Task: Motion and Vibration Classification
License: Apache 2.0
This is a prebuilt dataset, collected by Edge Impulse teams, for a gesture recognition system based on continuous motion, for the Continous Motion Recognition tutorial. It contains 15 minutes of data sampled from a MEMS accelerometer at 62.5Hz over the following four classes:
Idle - board sits idly on your desk. There might be some movement detected, e.g. from typing while the board is present.
Snake - board moves over the desk as a snake.
Updown - board moves up and down in a continuous motion.
Wave - board moves left and right like you're waving to someone.
Feature extraction: Spectral Features (FFTs or Wavelets)
Learning block: Classification + optionally Anomaly Detection (K-Means) or Anomaly Detection (GMM)
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 101
Labeling Method: single label
Train/Test Split: 84.16% / 15.84%
Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
Upload portals (Enterprise feature)
If you use this dataset in your research paper, please cite it using the following BibTeX:
Training Set
Testing Set
Total Data Items
85
16
Labels
idle, snake, updown, wave
anomaly, idle, snake, updown, wave