Hierarchical Document Clustering

View Sample PDF

Author(s): Benjamin C.M. Fung (Simon Fraser University, Canada), Ke Wang (Simon Fraser University, Canada)and Martin Ester (Simon Fraser University, Canada)
Copyright: 2005
Pages: 5
Source title: Encyclopedia of Data Warehousing and Mining
Source Author(s)/Editor(s): John Wang (Montclair State University, USA)
DOI: 10.4018/978-1-59140-557-3.ch105

Keywords: Data Mining and Databases / Data Warehousing / Information Science Reference / Library & Information Science

Purchase

View Hierarchical Document Clustering on the publisher's website for pricing and purchasing information.

Abstract

Document clustering is an automatic grouping of text documents into clusters so that documents within a cluster have high similarity in comparison to one another, but are dissimilar to documents in other clusters. Unlike document classification (Wang, Zhou, & He, 2001), no labeled documents are provided in clustering; hence, clustering is also known as unsupervised learning. Hierarchical document clustering organizes clusters into a tree or a hierarchy that facilitates browsing. The parent-child relationship among the nodes in the tree can be viewed as a topic-subtopic relationship in a subject hierarchy such as the Yahoo! directory.

The IRMA Community

Research IRM

Hierarchical Document Clustering

Purchase

Abstract

Related Content

IRMA Sponsors