Farsi Accent Recognition based on speech signal using efficient features extraction and Combining of Classifiers

sharif noughabi, mojtaba; marvi, hossein; darabian, danial

Volume 13, Issue 2 (9-2016) JSDP 2016, 13(2): 91-103 | Back to browse issues page

Mendeley

Zotero

RefWorks

sharif noughabi M, marvi H, darabian D. Farsi Accent Recognition based on speech signal using efficient features extraction and Combining of Classifiers. JSDP 2016; 13 (2) :91-103
URL: http://jsdp.rcisp.ac.ir/article-1-315-en.html

Farsi Accent Recognition based on speech signal using efficient features extraction and Combining of Classifiers

Mojtaba Sharif noughabi ^*

, Hossein Marvi

, Danial Darabian

Abstract: (7753 Views)

Speech recognition has achieved great improvements recently. However, robustness is still one of the big problems, e.g. performance of recognition fluctuates sharply depending on the speaker, especially when the speaker has strong accent and difference Accents dramatically decrease the accuracy of an ASR system. In this paper we apply three new methods of feature extraction including Spectral Centroid Magnitude (SCM), its first order difference (∆SCM ) and Zak transformation to the original speech signal using accents selected from FARSDAT corpus then their performance of these methods have been compared with some common methods such as MFCC. Moreover a new feature based on MFCC algorithm have been proposed in order to use in noisy environments. Five different classifications, including MLP, KNN, PNN, RBF and SVM and their combination have been used to evaluate the performance of each feature extraction methods. Experimental results demonstrate improvement in the recognition rates in our proposed method.

Keywords: Spectral Centroid Magnitude, classifiers combination, Farsi accents, support vector machine, Improved Mel Frequency Cepstral Coefficient

Full-Text [PDF 1595 kb] (2780 Downloads)

Type of Study: Research | Subject: Paper
Received: 2015/01/12 | Accepted: 2016/01/15 | Published: 2016/09/18 | ePublished: 2016/09/18

Send email to the article author

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Signal and Data Processing

Vote