Volume 13, Issue 3 (12-2016)                   JSDP 2016, 13(3): 51-62 | Back to browse issues page



DOI: 10.18869/acadpub.jsdp.13.3.51

XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

BabaAli B. پایه‌گذاری بستری نو و کارآمد در حوزه بازشناسی گفتار فارسی. JSDP. 2016; 13 (3) :51-62
URL: http://jsdp.rcisp.ac.ir/article-1-348-en.html

Assistant Prof University of Tehran
Abstract:   (602 Views)
Although researches in the field of Persian speech recognition  claim  a  thirty-year-old  history in Iran  which has achieved considerable progresses, due to the lack of well-defined experimental framework, outcomes from many of these researches are not comparable to each other and their accurate assessment won’t be possible. The experimental framework includes ASR toolkit and speech database which consists of training, development and test datasets. In recent years,   as a state-of-the-art open-source ASR toolkit; Kaldi has been very well-received and welcomed in the community of the world-ranked speech researchers around the world. considering all aspects, Kaldi is the best option among all of the other ASR toolkits to establish a framework to do research in all languages, including Persian.
In this paper, we chose Fardat as the speech database which is the counterpart of TIMIT for Persian language because not only it has got a standard form  but it’s also accessible for all researchers around the world. Similar to the recipe on TIMIT database, we defined these three sets on the Farsdat: Training, Development and Test sets. After a survey on Kaldi’s components and features, we applied most of state-of-the-art ASR techniques in the Kaldi on the Farsdat based on three sets definition. The best phone error rate on development and test set have been 20.3% and 19.8%. All of the codes and the recipe that was written by author have been submitted to Kaldi repository and they are accessible  for free, so all the reported results  will be easily replicable if you have access to Farsdat database.
Full-Text [PDF 2790 kb]   (222 Downloads)    
Type of Study: Research | Subject: Paper
Received: 2015/03/26 | Accepted: 2016/03/5 | Published: 2017/04/23 | ePublished: 2017/04/23

Add your comments about this article : Your username or Email:
Write the security code in the box

Send email to the article author


© 2015 All Rights Reserved | Signal and Data Processing

Designed & Developed by : Yektaweb