π Text Mining
Monday, November 26, 2018 by dev
Text Mining
By:Sholom M. Weiss,Nitin Indurkhya,Tong Zhang,Fred Damerau
Published on 2010-01-08 by Springer Science & Business Media
Data mining is a mature technology. The prediction problem, looking for predictive patterns in data, has been widely studied. Strong me- ods are available to the practitioner. These methods process structured numerical information, where uniform measurements are taken over a sample of data. Text is often described as unstructured information. So, it would seem, text and numerical data are different, requiring different methods. Or are they? In our view, a prediction problem can be solved by the same methods, whether the data are structured - merical measurements or unstructured text. Text and documents can be transformed into measured values, such as the presence or absence of words, and the same methods that have proven successful for pred- tive data mining can be applied to text. Yet, there are key differences. Evaluation techniques must be adapted to the chronological order of publication and to alternative measures of error. Because the data are documents, more specialized analytical methods may be preferred for text. Moreover, the methods must be modi?ed to accommodate very high dimensions: tens of thousands of words and documents. Still, the central themes are similar.
This Book was ranked at 16 by Google Books for keyword mining.
Book ID of Text Mining's Books is NZteXd4qf9sC, Book which was written bySholom M. Weiss,Nitin Indurkhya,Tong Zhang,Fred Damerauhave ETAG "tOC05pfpjgM"
Book which was published by Springer Science & Business Media since 2010-01-08 have ISBNs, ISBN 13 Code is 9780387345550 and ISBN 10 Code is 0387345558
Reading Mode in Text Status is false and Reading Mode in Image Status is true
Book which have "237 Pages" is Printed at BOOK under CategoryComputers
This Book was rated by 3 Raters and have average rate at "3.5"
This eBook Maturity (Adult Book) status is NOT_MATURE
Book was written in en
eBook Version Availability Status at PDF is true and in ePub is false