Volume 12, Issue 3 (12-2015)                   JSDP 2015, 12(3): 3-14 | Back to browse issues page

XML Persian Abstract Print


university of Tehran
Abstract:   (7027 Views)

Nowadays, a large volume of documents is generated daily. These documents generated by different persons, thus, the documents contain spelling errors. These spelling errors cause quality of the documents are decrease. Therefore, existence of automatic writing assistance tools such as spell checker/corrector can help to improve their quality. Context-sensitive are misspelled words that have been wrongly converted into another word of the language. Thus, detection of real-word errors requires discourse analysis. In this paper, we propose a language independent discourse-aware discriminative ranker and use information of whole document and a log-linear model for ranking. To evaluate our method, we augment it into two context-sensitive spellchecker systems one is based on Statistical Machine Translation (SMT) and the other is based on language model. For more evaluation, we also use two different tests. Proposed method cause outperform about 17% over the SMT base approach with respect to detection and correction recall.

Full-Text [PDF 1740 kb]   (1768 Downloads)    
Type of Study: Research | Subject: Paper
Received: 2014/02/26 | Accepted: 2015/09/8 | Published: 2016/01/4 | ePublished: 2016/01/4

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.