This paper investigates the contribution of frequency bands for automatic voice pathology detection. First, the input voice signal is passed through a number of time-domain band-pass filters. The center frequencies are spaced on an octave scale. Each filter output is then divided into overlapping frames. Auto-correlation function is applied to each block to find the first largest peak, in areas other than near the dc value, and its corresponding lag. Therefore, each frame is having only these two features (peak value and lag). As classifier, we use Gaussian mixture models (GMM) and support vector machine (SVM), separately. Two well-known available databases, one in English (MEEI) and the other one in German (SVD), are used in the investigation. The results demonstrate that the most significant frequency range to detect voice pathology is between 1500 Hz and 3500 Hz. Using this filter band and with only two features, the accuracy is above 97% in case of the MEEI database.
|Number of pages||6|
|Publication status||Published - 2 Apr 2015|
|Event||2014 11th IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2014 - Doha, Qatar|
Duration: 10 Nov 2014 → 13 Nov 2014
|Conference||2014 11th IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2014|
|Period||10/11/14 → 13/11/14|
- voice pathology detection