LogoLogo
HomeDocumentationProjectsForumStudio
  • Edge Impulse Datasets
  • Image
    • Image Classification
      • Fire extinguisher safety pin
      • Microscope
    • Object Detection
      • Bottles rack
      • Cans on conveyor belt
      • Cubes colors on conveyor belt
      • Dice
      • Dice colors
      • Self Attention - Cubes on conveyor belt
    • Visual Anomaly Detection
      • Capsule
      • DHT11
      • Fire extinguisher head thread
      • Flat washers
      • Thermostatic valves
    • Visual Regression
      • Vial tubes
  • Audio
    • Audio Classification
      • Faucet vs noise
      • Glass breaking
      • Keyword Spotting
  • Time-series
    • Motion and Vibration Classification
      • Coffee machine stages
      • Continuous motion recognition
    • Sensor Fusion Classification
      • Coffee machine stages
Powered by GitBook
On this page
  • Description
  • Compatible Blocks
  • Dataset Details
  • Usage
  • Citation
Export as PDF
  1. Audio
  2. Audio Classification

Keyword Spotting

PreviousGlass breakingNextMotion and Vibration Classification

Last updated 1 month ago

Task: Audio Classification

License:

Description

This dataset can be used to build an Edge AI project detecting the "Hello World" keyword phrase.

Compatible Blocks

Dataset Details

  • Total Data Items: 2062

  • Total Data Length: 0h 34m 22s

  • Axis Summary: audio

  • Labeling Method: single label

  • Train/Test Split: 79.97% / 20.03%

Training Set

Testing Set

Total Data Items

1649

413

Labels

helloworld, noise, unknown

helloworld, noise, unknown

Total Data Length

0h 27m 29s

0h 6m 53s

Usage

  • Download

    • HuggingFace - Soon

    • Kaggle - Soon

  • Import this dataset to your Edge Impulse project

Citation

If you use this dataset in your research paper, please cite it using the following BibTeX:

@misc{edgeimpulse_dataset_499022,
    title = {Audio Classification - Keyword Spotting},
    author = {Edge Impulse},
    year = {2024},
    url = {https://studio.edgeimpulse.com/public/499022/latest},
    note = {Apache 2.0}
}

Have you ever wanted to make your own "Ok, Google" or "Alexa" keyword spotting model? The helloworld class has been collected by Edge Impulse teams, the added noise samples come from the and the unknown samples are based on a subset of data in the .

You can also follow to guide you through building your keyword spotting model, from data collection to deployment on embedded devices.

Feature extraction: , , ,

Learning block: ,

Not sure what to choose? Try out this dataset with the .

Clone the

To clone and use this project, visit the , click on the Clone button on the top-right corner and follow the cloning instructions.

This project uses the Edge Impulse Exporter Format (info.labels). See this for more info.

Edge Impulse also supports different and that you can import into your project to build your edge AI models:

Microsoft Scalable Noisy Speech Dataset
Google Speech Commands Dataset
our tutorial
Audio (MFCC)
Audio (MFE)
Spectrogram
Raw Data
Classification
Transfer Learning (Keyword Spotting)
EON Tuner
public project
Edge Impulse Studio link
Direct link
documentation page
data sample formats
dataset annotation formats
Studio uploader
CLI uploader
CSV Wizard
Python SDK
Ingestion API
Import from S3 buckets
Upload portals
BSD 3-Clause Clear
Dataset Screenshot