In this paper, we will introduce an intelligent system to edit and spell check Persian texts. The goal is editing and preprocessing Persian texts for natural language processing tasks. This system is based on an expandable and engineering approach and is composed of three subsystems: Persian text editor, spell checker and stemmer. These parts interact with each other to edit texts. To do this, the stemmer subsystem process each word in the text if the subsystem could not find a stem in the lexicon, the word will be recognized as an incorrect word. Then, the spell checker provides a list of suggestions to correct the wrong word. Subsequently, the editor subsystem edits the text based on the standards of the Academy of Persian Language and Literature. Our evaluation shows nearly 92%, 95% and 96% precision numbers for editor, stemmer and spell checker subsystems, respectively.
Rights and permissions | |
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. |