Survey Of Text Mining: Clustering, Classification, And Retrieval

E-Book Overview

Extracting content from text continues to be an important research problem for information processing and management. Approaches to capture the semantics of text-based document collections may be based on Bayesian models, probability theory, vector space models, statistical models, or even graph theory. As the volume of digitized textual media continues to grow, so does the need for designing robust, scalable indexing and search strategies (software) to meet a variety of user needs. Knowledge extraction or creation from text requires systematic yet reliable processing that can be codified and adapted for changing needs and environments. This book will draw upon experts in both academia and industry to recommend practical approaches to the purification, indexing, and mining of textual information. It will address document identification, clustering and categorizing documents, cleaning text, and visualizing semantic models of text.

E-Book Content

Springer New York Berlin Heidelberg Hong Kong London Milan Paris Tokyo Michael W. Berry Editor Survey of Text Mining Clustering, Classification, and Retrieval Scanned by Velocity With 57 Illustrations Springer Michael W. Berry Department of Computer Science University of Tennessee 203 Claxton Complex Knoxville, TN 37996-3450, USA [email protected] Cover illustration: Visualization of three major clusters in the L.A. Times news database when document vectors are projected into the 3-D subspace spanned by the three most relevant axes determined using COV rescale. This figure appears on p. 118 of the text. Library of Congress Cataloging-in-Publication Data Survey of text mining : clustering, classification, and retrieval / editor, Michael W. Berry. p
You might also like

Spatial Data Modeling For 3d Gis
Authors: Alias Abdul-Rahman , Morakot Pilouk    134    0



A Practical Theory Of Programming
Authors: Eric C.R. Hehner    148    0


Object-oriented Programming Via Fortran 90-95
Authors: Ed Akin    141    0


Introduction To Robust Estimation And Hypothesis Testing
Authors: Rand R. Wilcox    105    0


Linux For Embedded And Real-time Applications
Authors: Doug Abbott    118    0


A Service Creation Environment Based On End To End Composition Of Web Services
Authors: Agarwal V. , Dasgupta K. , Karnik N.    144    0



Protocols For High Efficiency Wireless Networks
Authors: Andreadis A. , Giambene G.    130    0