📞 +91-7667918914 | ✉️ iarjset@gmail.com
International Advanced Research Journal in Science, Engineering and Technology
International Advanced Research Journal in Science, Engineering and Technology A Monthly Peer-Reviewed Multidisciplinary Journal
ISSN Online 2393-8021ISSN Print 2394-1588Since 2014
IARJSET aligns to the suggestive parameters by the latest University Grants Commission (UGC) for peer-reviewed journals, committed to promoting research excellence, ethical publishing practices, and a global scholarly impact.
← Back to VOLUME 4, ISSUE 7, JULY 2017

LABELING DOCUMENT CLUSTERS WITH THEMATIC PHRASES

Dr. Y. Sri Lalitha, Dr. N. V. Ganapathi Raju, Dr. O. Srinivasa Rao

👁 4 views📥 0 downloads
Share: 𝕏 f in

Abstract: Document clustering is a powerful technique to detect topics and their relations for information browsing, analysis, and organization. However, clustered documents require post-assignment of descriptive titles to help users interpret the results. Existing techniques often assign labels to clusters based only on the terms that the clustered documents contain, which may not be sufficient for some applications more over term labeling will not give clear meaning of the clustered contents. To solve the problem, a phrase based cluster labeling is considered in this work. The work considers embedding external knowledge to terms using WordNet and provides an approach to derive a theme in the group of documents and label that group with the most appropriate Phrase. Number of experiments conducted on benchmark datasets and observed that results produced are very accurate to the clusters formed.

Keywords: Thematic Phrases, Document clustering, Information Browsing, Analysis, and Organization.

How to Cite:

[1] Dr. Y. Sri Lalitha, Dr. N. V. Ganapathi Raju, Dr. O. Srinivasa Rao, “LABELING DOCUMENT CLUSTERS WITH THEMATIC PHRASES,” International Advanced Research Journal in Science, Engineering and Technology (IARJSET), DOI: 10.17148/IARJSET.2017.4703

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.