Publikationsansicht

Wordnet improves Text Document Clustering (2003)

Abstract
Text document clustering plays an important role in providing intuitive navigation and browsing mechanisms by organizing large amounts of information into a small number of meaningful clusters. The bag of words representation used for these clustering methods is often unsatisfactory as it ignores relationships between important terms that do not co-occur literally. In order to deal with the problem, we integrate background knowledge --- in our application Wordnet --- into the process of clustering text documents.

Details der Publikation
Download http://citeseer.ist.psu.edu/595803.html
Quelle http://www.aifb.uni-karlsruhe.de/WBS/sst/Research/Publications/sw_sigir2003_submit.pdf
Herausgeber unknown
Mitarbeiter The Pennsylvania State University CiteSeer Archives
Archiv CiteSeer (United States)
Keywords Andreas Hotho,Steffen Staab,Gerd Stumme Wordnet improves Text Document Clustering
Sprache Englisch