Keyword Spotting

Task: Audio Classification

License: BSD 3-Clause Clear

Dataset Screenshot

Description

Have you ever wanted to make your own "Ok, Google" or "Alexa" keyword spotting model? The helloworld class has been collected by Edge Impulse teams, the added noise samples come from the Microsoft Scalable Noisy Speech Dataset and the unknown samples are based on a subset of data in the Google Speech Commands Dataset.

This dataset can be used to build an Edge AI project detecting the "Hello World" keyword phrase.

You can also follow our tutorial to guide you through building your keyword spotting model, from data collection to deployment on embedded devices.

Compatible Blocks

Not sure what to choose? Try out this dataset with the EON Tuner.

Dataset Details

  • Total Data Items: 2062

  • Total Data Length: 0h 34m 22s

  • Axis Summary: audio

  • Labeling Method: single label

  • Train/Test Split: 79.97% / 20.03%

Training Set

Testing Set

Total Data Items

1649

413

Labels

helloworld, noise, unknown

helloworld, noise, unknown

Total Data Length

0h 27m 29s

0h 6m 53s

Usage

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

@misc{edgeimpulse_dataset_499022,
    title = {Audio Classification - Keyword Spotting},
    author = {Edge Impulse},
    year = {2024},
    url = {https://studio.edgeimpulse.com/public/499022/latest},
    note = {Apache 2.0}
}

Last updated