TY - JOUR T1 - Improved Clustering Persian Text Based on Keyword Using Linguistic and Thesaurus Knowledge TT - ارائه روشی جدید برای شاخص‌گذاری خودکار و استخراج کلمات کلیدی برای بازیابی اطلاعات و خوشه‌بندی متون JF - jsdp JO - jsdp VL - 13 IS - 1 UR - http://jsdp.rcisp.ac.ir/article-1-139-en.html Y1 - 2016 SP - 87 EP - 100 KW - Keyword Extraction KW - Thesaurus KW - Computational Linguistic KW - Information Retrieval N2 - Persian words in writing with a diverse and cover all modes of grammatical words with the recruitment of a series of specific rules because it is impossible to extract keywords automatically from Persian texts difficult and complex. This thesis has attempted to use linguistic information and thesaurus, keywords Mnatry be provided. Using the symbol system is structured network can be keywords, including the exchange of words, words and words with hierarchical relationships complete the package has increased. Therefore the agreement between users and search keywords text search and recall is increased. In the first stage non-important words are removed and the public. Supervision in the text are words and more words to clarify the relative importance of using a blower numerical weight is assigned to each word that indicates the effectiveness of the word in connection with the subject and compared with the other words used in the text. Particularly complex operation that makes use of thesaurus keywords are extracted Mnytry that kind of hierarchical category scientific literature in the field of information retrieval is indicated. Test results on different topics several text accurately represents the proposed method and the ability to extract the keywords in accordance with user demand. M3 ER -