1 of 4

Audio Classification

Faucet vs noise

Task: Audio Classification

License:

Description

This dataset has been collected by Edge Impulse teams to recognize the sound of water running from a faucet, even in the presence of other background noise.

Compatible Blocks

Dataset Details

Total Data Items: 18
Total Data Length: 0h 15m 40s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 88.89% / 11.11%

Usage

Download
- HuggingFace - Soon
- Kaggle - Soon
Import this dataset to your Edge Impulse project

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

Glass breaking

Task: Audio Classification

License:

Description

This dataset has been generated by Edge Impulse teams to recognize the sound of glass breaking.

Compatible Blocks

Dataset Details

Total Data Items: 500
Total Data Length: 0h 21m 15s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 80.00% / 20.00%

Usage

Download
- HuggingFace - Soon
- Kaggle - Soon
Import this dataset to your Edge Impulse project

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

Keyword Spotting

Task: Audio Classification

License: Apache 2.0

Description

Have you ever wanted to make your own "Ok, Google" or "Alexa" keyword spotting model? The helloworld class has been collected by Edge Impulse teams, the added noise samples come from the Microsoft Scalable Noisy Speech Dataset and the unknown samples are based on a subset of data in the Google Speech Commands Dataset.

This dataset can be used to build an Edge AI project detecting the "Hello World" keyword phrase.

You can also follow our tutorial to guide you through building your keyword spotting model, from data collection to deployment on embedded devices.

Compatible Blocks

Feature extraction: Audio (MFCC), Audio (MFE), Spectrogram, Raw Data
Learning block: Classification, Transfer Learning (Keyword Spotting)

Not sure what to choose? Try out this dataset with the EON Tuner.

Dataset Details

Total Data Items: 2062
Total Data Length: 0h 34m 22s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 79.97% / 20.03%

Usage

Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
- Direct link
- HuggingFace - Soon
- Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
- Studio uploader
- CLI uploader
- CSV Wizard
- Python SDK
- Ingestion API
- Import from S3 buckets
- Upload portals (Enterprise feature)

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

@misc{edgeimpulse_dataset_499022,
    title = {Audio Classification - Keyword Spotting},
    author = {Edge Impulse},
    year = {2024},
    url = {https://studio.edgeimpulse.com/public/499022/latest},
    note = {Apache 2.0}
}

Keyword Spotting

Task: Audio Classification

License: Apache 2.0

Description

This dataset can be used to build an Edge AI project detecting the "Hello World" keyword phrase.

You can also follow our tutorial to guide you through building your keyword spotting model, from data collection to deployment on embedded devices.

Compatible Blocks

Feature extraction: Audio (MFCC), Audio (MFE), Spectrogram, Raw Data
Learning block: Classification, Transfer Learning (Keyword Spotting)

Not sure what to choose? Try out this dataset with the EON Tuner.

Dataset Details

Total Data Items: 2062
Total Data Length: 0h 34m 22s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 79.97% / 20.03%

Usage

Clone the public project.
To clone and use this project, visit the Edge Impulse Studio link, click on the Clone button on the top-right corner and follow the cloning instructions.
Download
- Direct link
- HuggingFace - Soon
- Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels). See this documentation page for more info.
Edge Impulse also supports different data sample formats and dataset annotation formats that you can import into your project to build your edge AI models:
- Studio uploader
- CLI uploader
- CSV Wizard
- Python SDK
- Ingestion API
- Import from S3 buckets
- Upload portals (Enterprise feature)

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

@misc{edgeimpulse_dataset_499022,
    title = {Audio Classification - Keyword Spotting},
    author = {Edge Impulse},
    year = {2024},
    url = {https://studio.edgeimpulse.com/public/499022/latest},
    note = {Apache 2.0}
}

Glass breaking

Task: Audio Classification

License:

Description

This dataset has been generated by Edge Impulse teams to recognize the sound of glass breaking.

The synthetic data has been generated using the .

You can also have a look at the blog post .

Compatible Blocks

Feature extraction: , ,
Learning block:

Not sure what to choose? Try out this dataset with the .

Dataset Details

Total Data Items: 500
Total Data Length: 0h 21m 15s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 80.00% / 20.00%

Usage

Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
Download
- HuggingFace - Soon
- Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
- (Enterprise feature)

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

@misc{edgeimpulse_dataset_497425,
    title = {Audio Classification - Glass breaking},
    author = {Edge Impulse},
    year = {2024},
    url = {https://studio.edgeimpulse.com/public/497425/latest},
    note = {Apache 2.0}
}

Faucet vs noise

Task: Audio Classification

License:

Description

This dataset has been collected by Edge Impulse teams to recognize the sound of water running from a faucet, even in the presence of other background noise.

You can also follow learn how to collect audio data from microphones, use signal processing to extract the most important information, and train a deep neural network that can tell you whether the sound of running water can be heard in a given clip of audio. Finally, you'll deploy the system to an embedded device and evaluate how well it works.

Compatible Blocks

Feature extraction: , ,
Learning block:

Not sure what to choose? Try out this dataset with the .

Dataset Details

Total Data Items: 18
Total Data Length: 0h 15m 40s
Axis Summary: audio
Labeling Method: single label
Train/Test Split: 88.89% / 11.11%

Usage

Clone the .
To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.
Download
- HuggingFace - Soon
- Kaggle - Soon
Import this dataset to your Edge Impulse project
This project uses the Edge Impulse Exporter Format (info.labels). See this for more info.
Edge Impulse also supports different and that you can import into your project to build your edge AI models:
- (Enterprise feature)

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

@misc{edgeimpulse_dataset_497635,
    title = {Audio Classification - Faucet vs noise},
    author = {Edge Impulse},
    year = {2024},
    url = {https://studio.edgeimpulse.com/public/497635/latest},
    note = {Apache 2.0}
}