Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Task: Image Classification
License: BSD 3-Clause Clear
This dataset has been collected by Edge Impulse teams and contains images taken from a smartphone using the NATIONAL GEOGRAPHIC 40x-1280x Microscope.
Feature extraction: Image
Learning block: Transfer Learning (Images), Classification
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 246
Labeling Method: single label
Train/Test Split: 81.30% / 18.70%
Training Set
Testing Set
Total Data Items
200
46
Labels
cotton stem, epidermis onion, housefly leg, unknown, wood stem
cotton stem, epidermis onion, housefly leg, unknown, wood stem
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Image Classification
License: BSD 3-Clause Clear
This dataset contains images of fire extinguisher safety pins. This dataset can be used to combine image classification and visual anomaly detection.
Label information:
rot-powder
: Pin for ROT powder fire extinguisher - Yellow - manufacturer reference: 06210409F
rot-water
: Pin for ROT water spray fire extinguisher - Blue - manufacturer reference: 06210410F
anomaly
: Other fire extinguisher safety pins
Feature extraction: Image
Learning block: Transfer Learning (Images), Classification
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 264
Labeling Method: single label
Train/Test Split: 79.92% / 20.08%
Training Set
Testing Set
Total Data Items
211
53
Labels
anomaly, rot-powder, rot-water
anomaly, rot-powder, rot-water
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
This datasets collection contains publicly available datasets collected, generated or curated by Edge Impulse or its partners.
Each of these datasets highlights a specific use case or machine learning task, helping you understand the types of data commonly encountered in projects like object detection, audio classification, and visual anomaly detection. While not intended for production models, they serve as valuable resources for learning, experimentation, and prototyping. Whether you're a developer, researcher, or student, these datasets are here to support your exploration of edge AI by offering real-world scenarios and insights into data preparation and model development.
Task: Object Detection
License: BSD 3-Clause Clear
This dataset has been collected by Edge Impulse teams and contains images of dice on a white background. It can be used to count the number of items in the frame.
Feature extraction: Image
Learning block: Object Detection
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 85
Labeling Method: object detection
Train/Test Split: 80.00% / 20.00%
Training Set
Testing Set
Total Data Items
68
17
Labels
dice
dice
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
License: BSD 3-Clause Clear
This dataset has been collected by Edge Impulse teams and is a variant of the "Object Detection - Dice" dataset. It images of dice on a white background labeled by color.
Feature extraction: Image
Learning block: Object Detection
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 85
Labeling Method: object detection
Train/Test Split: 80.00% / 20.00%
Training Set
Testing Set
Total Data Items
68
17
Labels
blue, green, orange, purple, red, yellow
blue, green, orange, purple, red, yellow
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
License: BSD 3-Clause Clear
This dataset, collected by Edge Impulse team, is a variant of the Cubes on Conveyor Belt dataset. It has been used to perform spatial aware object detection using a micro transformer.
Have a look at this blog post for more information: FOMO Self-Attention
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 85
Labeling Method: object detection
Train/Test Split: 97.65% / 2.35%
Training Set
Testing Set
Total Data Items
83
2
Labels
left, right
left, right
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
License: BSD 3-Clause Clear
This dataset has been collected by Edge Impulse team and used extensively to design the FOMO (Faster Objects, More Objects) object detection architecture.
See FOMO documentation or the announcement blog post.
Feature extraction: Image
Learning block: Object Detection
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 670
Labeling Method: object detection
Train/Test Split: 79.70% / 20.30%
Training Set
Testing Set
Total Data Items
534
136
Labels
blue, green, red, yellow
blue, green, red, yellow
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
This dataset has been collected by Edge Impulse teams and contains images of bottle racks on different backgrounds and different orientations. It can be used to count the number of beers in the rack. The two labels - empty or full - is used to distinguish if the beer has a cap or not.
Total Data Items: 130
Labeling Method: object detection
Train/Test Split: 74.62% / 25.38%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Object Detection
Total Data Items: 101
Labeling Method: object detection
Train/Test Split: 78.22% / 21.78%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
This dataset has been collected by Edge Impulse teams and contains a single red-white capsule, centered in the frame, with a similar size and a uniform background.
The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Total Data Items: 149
Labeling Method: single label
Train/Test Split: 64.43% / 35.57%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
This dataset has been collected by Edge Impulse teams and contains a single fire extinguisher head thread, centered in the frame, with a similar size and a variable background.
The training dataset only contains "good" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Total Data Items: 204
Labeling Method: single label
Train/Test Split: 58.82% / 41.18%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
This dataset has been collected by Edge Impulse teams and contains a single DHT11 sensor, centered in the frame, with a similar size and a uniform background.
The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
The DHT11 have been used to teach IoT classes in the past and have been manipulated by students extensively. When not wiring the pins properly, it can cause an overheat which often lead to the plastic melting. Some other anomalous images are missing wiring pins.
Total Data Items: 195
Labeling Method: single label
Train/Test Split: 69.74% / 30.26%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
This dataset has been collected by Edge Impulse teams and contains a single thermostatic valves, centered in the frame, with a similar size and a uniform background.
The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Total Data Items: 195
Labeling Method: single label
Train/Test Split: 62.05% / 37.95%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Anomaly Detection
This dataset has been collected by Edge Impulse teams and contains a single flat washer, randomly located in the frame. The dataset has been collected using two different backgrounds - a white background and textured dark-grey background. The training dataset only contains "nominal" (no anomaly) images whereas the testing dataset contains both nominal and anomalous images.
Total Data Items: 464
Labeling Method: single label
Train/Test Split: 58.84% / 41.16%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
License:
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
License:
This dataset has been synthetically generated by Edge Impulse teams using and .
You can also have a look at created by Edge Impulse expert George Igwegbe to create a synthetic dataset using NVIDIA Omniverse Replicator.
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
License:
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
License:
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
License:
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
License:
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
License:
Feature extraction:
Learning block:
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
Training Set
Testing Set
Total Data Items
97
33
Labels
empty, full
empty, full
Training Set
Testing Set
Total Data Items
79
22
Labels
Can
Can
Training Set
Testing Set
Total Data Items
96
53
Labels
no anomaly
anomaly, no anomaly
Training Set
Testing Set
Total Data Items
120
84
Labels
good
good, rusty
Training Set
Testing Set
Total Data Items
136
59
Labels
no anomaly
anomaly, no anomaly
Training Set
Testing Set
Total Data Items
121
74
Labels
no anomaly
anomaly, no anomaly
Training Set
Testing Set
Total Data Items
273
191
Labels
no anomaly
anomaly, no anomaly
Task: Audio Classification
License: BSD 3-Clause Clear
Have you ever wanted to make your own "Ok, Google" or "Alexa" keyword spotting model? The helloworld
class has been collected by Edge Impulse teams, the added noise
samples come from the Microsoft Scalable Noisy Speech Dataset and the unknown
samples are based on a subset of data in the Google Speech Commands Dataset.
This dataset can be used to build an Edge AI project detecting the "Hello World" keyword phrase.
You can also follow our tutorial to guide you through building your keyword spotting model, from data collection to deployment on embedded devices.
Feature extraction: Audio (MFCC), Audio (MFE), Spectrogram, Raw Data
Learning block: Classification, Transfer Learning (Keyword Spotting)
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 2062
Total Data Length: 0h 34m 22s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 79.97% / 20.03%
Training Set
Testing Set
Total Data Items
1649
413
Labels
helloworld, noise, unknown
helloworld, noise, unknown
Total Data Length
0h 27m 29s
0h 6m 53s
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Visual Regression
License: BSD 3-Clause Clear
This dataset has been generated by Edge Impulse teams and contains images of vial tubes filled at different levels with a green liquid.
The images in the training set have been captured using 5 different backgrounds. The images in the testing set have been captured using 9 different backgrounds. The background
information is available in the metadata of the data samples.
Feature extraction: Image
Learning block: Regression
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 747
Labeling Method: single label
Train/Test Split: 66.67% / 33.33%
Training Set
Testing Set
Total Data Items
498
249
Labels
-1, 0, 0.5, 0.75, 1, 1.25, 1.5, 2
-1, 0, 0.4, 0.5, 0.7, 0.75, 1, 1.2, 1.25, 1.5, 2
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Audio Classification
License: BSD 3-Clause Clear
This dataset has been collected by Edge Impulse teams to recognize the sound of water running from a faucet, even in the presence of other background noise.
You can also follow our tutorial learn how to collect audio data from microphones, use signal processing to extract the most important information, and train a deep neural network that can tell you whether the sound of running water can be heard in a given clip of audio. Finally, you'll deploy the system to an embedded device and evaluate how well it works.
Feature extraction: Audio (MFE), Spectrogram, Raw Data
Learning block: Classification
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 18
Total Data Length: 0h 15m 40s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 88.89% / 11.11%
Training Set
Testing Set
Total Data Items
16
2
Labels
faucet, noise
faucet, noise
Total Data Length
0h 13m 40s
0h 2m 0s
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Audio Classification
License: BSD 3-Clause Clear
This dataset has been generated by Edge Impulse teams to recognize the sound of glass breaking.
The synthetic data has been generated using the Edge Impulse's ElevenLabs.io integration.
You can also have a look at the blog post That Sounds Great: Create Ultra-Realistic Audio Datasets with ElevenLabs.io.
Feature extraction: Audio (MFE), Spectrogram, Raw Data
Learning block: Classification
Not sure what to choose? Try out this dataset with the EON Tuner.
Total Data Items: 500
Total Data Length: 0h 21m 15s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 80.00% / 20.00%
Training Set
Testing Set
Total Data Items
400
100
Labels
glass_breaking, noise
glass_breaking, noise
Total Data Length
0h 17m 37s
0h 3m 37s
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Sensor Fusion Classification
License: BSD 3-Clause Clear
This dataset has been collected by Edge Impulse team to demonstrate how to perform Sensor Fusion Classification leveraging Neural Networks Embeddings.
Sensor fusion refers to the process of combining data from different types of sensors to give more information to the neural network. To extract meaningful information from this data, you can use the same DSP block, multiples DSP blocks, or use neural networks embeddings.
You can have a look at this tutorial for more information to see how to use this dataset.
This dataset combines audio data and accelerometer data.
Feature extraction: Audio Embeddings (Spectrogram + Convolution Neural Network Embeddings) + Spectral Analysis (accelerometer)
Learning block: Classification using a fully-connected network
Total Data Items: 36
Labeling Method: single label
Train/Test Split: 91.67% / 8.33%
Training Set
Testing Set
Total Data Items
33
3
Labels
extract, grind, idle, pump
Clone the public project
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels
). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Motion and Vibration Classification
Idle - board sits idly on your desk. There might be some movement detected, e.g. from typing while the board is present.
Snake - board moves over the desk as a snake.
Updown - board moves up and down in a continuous motion.
Wave - board moves left and right like you're waving to someone.
Total Data Items: 101
Labeling Method: single label
Train/Test Split: 84.16% / 15.84%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
Task: Motion and Vibration Classification
Total Data Items: 11
Labeling Method: single label
Train/Test Split: 72.73% / 27.27%
Download
HuggingFace - Soon
Kaggle - Soon
Import this dataset to your Edge Impulse project
If you use this dataset in your research paper, please cite it using the following BibTeX:
License:
This is a prebuilt dataset, collected by Edge Impulse teams, for a gesture recognition system based on continuous motion, for the . It contains 15 minutes of data sampled from a MEMS accelerometer at 62.5Hz over the following four classes:
Feature extraction: (FFTs or Wavelets)
Learning block: + optionally or
Not sure what to choose? Try out this dataset with the .
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
License:
Clone the
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
This project uses the Edge Impulse Exporter Format (info.labels
). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
Training Set
Testing Set
Total Data Items
85
16
Labels
idle, snake, updown, wave
anomaly, idle, snake, updown, wave
Training Set
Testing Set
Total Data Items
8
3
Labels
extract, grind, idle, pump
extract, grind, idle, pump