Persian XML Documents Metaheuristic Clustering Based on Structure and Content Similarity

Moradi, Ali; Shahbahrami, Asadollah; Ebrahimi Atani, Reza; Alidoust Nia, Mehran

Signal and Data Processing Journal A scientific journal officially licensed by the Commission for Scientific Publications of the (MSRT). Publisher: Research Ceter for Developmen of Technologies

EN FA

Volume 13, Issue 2 (9-2016) JSDP 2016, 13(2): 11-23 | Back to browse issues page

Mendeley

Zotero

RefWorks

Moradi A, Shahbahrami A, Ebrahimi Atani R, Alidoust Nia M. Persian XML Documents Metaheuristic Clustering Based on Structure and Content Similarity. JSDP 2016; 13 (2) :11-23
URL: http://jsdp.rcisp.ac.ir/article-1-29-en.html

Persian XML Documents Metaheuristic Clustering Based on Structure and Content Similarity

Ali Moradi

, Asadollah Shahbahrami

, Reza Ebrahimi Atani ^*

, Mehran Alidoust Nia

University of Guilan

Abstract: (8410 Views)

Due to the increasing number of documents, XML, effectively organize these documents in order to retrieve useful information from them is essential. A possible solution is performed on the clustering of XML documents in order to discover knowledge. Clustering XML documents is a key issue of how to measure the similarity between XML documents. Conventional clustering of text documents using a document similarity measure used in information content, they can cause structural information contained in XML documents is ignored. In this paper, a new model named matrix space model to represent both structural and content features of documents in XML, is proposed. Based on this model, the Jaccard similarity measure is defined and the colonial competitive algorithm for clustering XML documents is used. Experimental results show that the proposed model function in identifying similar documents which closely identified with the same structure and content information are effective. This method can improve the accuracy of clustering, and XML data can be used to increase productivity.

Keywords: Clustering, Persian, colonial competitive algorithm

Full-Text [PDF 2032 kb] (3601 Downloads)

Type of Study: Applicable | Subject: Paper
Received: 2013/04/27 | Accepted: 2016/06/15 | Published: 2016/09/18 | ePublished: 2016/09/18

Send email to the article author

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.