A fuzzy clustering approach for finding similar documents using a novel similarity measure
dc.contributor.author | Saracoglu, Ridvan | |
dc.contributor.author | Tutuncu, Kemal | |
dc.contributor.author | Allahverdi, Novruz | |
dc.date.accessioned | 2020-03-26T17:16:55Z | |
dc.date.available | 2020-03-26T17:16:55Z | |
dc.date.issued | 2007 | |
dc.department | Selçuk Üniversitesi | en_US |
dc.description.abstract | Searching for similar documents has a crucial role in document management. This paper aims for developing a fast and high quality method of searching similar documents based on fuzzy clustering in large document collections. In order to perform these requirements, a two layers structure is proposed. Formerly, finding the similarity in documents is based on the strategy that uses word-by-word comparison. The proposed method in this study uses two layers structure and lets the documents pass through it to find the similarities. In this system, predefined fuzzy clusters are used to extract feature vectors of related documents for finding similar documents of them. Similarity measure is estimated based on these vectors. To do this, a distance based similarity measure is proposed. It has been seen in empirical results that the proposed system uses new similarity measure and has better performance compared with conventional similarity measurement systems. (c) 2006 Elsevier Ltd. All rights reserved. | en_US |
dc.identifier.doi | 10.1016/j.eswa.2006.06.002 | en_US |
dc.identifier.endpage | 605 | en_US |
dc.identifier.issn | 0957-4174 | en_US |
dc.identifier.issn | 1873-6793 | en_US |
dc.identifier.issue | 3 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.startpage | 600 | en_US |
dc.identifier.uri | https://dx.doi.org/10.1016/j.eswa.2006.06.002 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12395/21168 | |
dc.identifier.volume | 33 | en_US |
dc.identifier.wos | WOS:000245754400007 | en_US |
dc.identifier.wosquality | Q1 | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | PERGAMON-ELSEVIER SCIENCE LTD | en_US |
dc.relation.ispartof | EXPERT SYSTEMS WITH APPLICATIONS | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.selcuk | 20240510_oaig | en_US |
dc.subject | text mining | en_US |
dc.subject | document similarity | en_US |
dc.subject | fuzzy clustering | en_US |
dc.subject | fuzzy similarity measure | en_US |
dc.subject | distance based similarity | en_US |
dc.title | A fuzzy clustering approach for finding similar documents using a novel similarity measure | en_US |
dc.type | Article | en_US |