Authors: | Anthony Larcher & Kong Aik Lee & Sylvain Meignier |
---|---|
Version: | 1.0 of 2014/10/29 |
When using SIDEKIT for research, please cite:
Kong Aik Lee and Anthony Larcher, Title of the paper to come, in IEEE Transaction on Audio, Speech and Language Processing, issue, year, pages...
Acoustic features extraction
- Linear-Frequency Cepstral Coefficients (LFCC)
- Mel-Frequency Cepstral Coefficients (MFCC)
- RASTA filtering
- Energy-based Voice Activity Detection (VAD)
- normalization (CMS, CMVN, Short Term Gaussianization)
Modeling and classification
- Gaussian Mixture Models (GMM)
- i - vectors
- Probabilistic Linear Discriminant Analysis (PLDA)
- Joint Factor Analysis (JFA)
- Support Vector Machine (SVM)
Presentation of the results (DET plot)