
MFCC parameters overview
Feature output format
The “Processed features” array has the following format:- Column major, from low cepstrum to high.
- Number of rows will be equal to the parameter “Number of coefficients” (or number of cepstra)
- Each column represents a single frame
Audio MFCC parameters
Compatible with the DSP Autotuner
Picking the right parameters for DSP algorithms can be difficult. It often requires a lot of experience and experimenting. The autotuning function makes this process easier by looking at the entire dataset and recommending a set of parameters that is tuned for your dataset.- Number of coefficients: Number of cepstral coefficients to keep after applying Discrete Cosine Transform
- Frame length: The length of each frame in seconds
- Frame stride: The step between successive frame in seconds
- Filter number: The number of triangular filters applied to the spectrogram
- FFT length: The FFT size
- Low frequency: Lowest band edge of Mel-scale filterbanks
- High frequency: Highest band edge of Mel-scale filterbanks
- Window size: The size of sliding window for local cepstral mean normalization. Windows size must be odd.
- Coefficient: The pre-emphasizing coefficient to apply to the input signal (0 equals to no filtering)
- Note: Shift has been removed and set to 1 for all future projects. Older & existing projects can still change this value or use an existing value.