Text mining can be broadly defined as a knowledge-intensive process in which a user interacts with a document collection over time by using a suite of analysis tools. In a manner analogous to data mining, text mining seeks to extract useful information from data sources through the identification and exploration of interesting patterns. In the case of text mining, however, the data sources are document collections, and interesting patterns are found not among formalized database records but in the unstructured textual data in the documents in these collections.
Text mining can be broadly defined as a knowledge-intensive process in which a user interacts with a document collection over time by using a suite of analysis tools from data sources. More specifically, text mining is the extraction of useful information from a collection of documents similar to data mining processes. And it seeks to extract useful information from the data sources through the identification and exploration of interesting patterns. Furthermore, it involves the preprocessing of document collections (text categorization, information extraction, term extraction), the storage of the intermediate representations, the techniques to analyze these intermediate representations (such as distribution analysis, clustering, trend analysis, and association rules), and visualization of the results.
The objective of text mining is to exploit hidden information contained in textual documents in various ways and it is the discovery of valuable knowledge in text documents.
Text mining is a new and exciting research area that tries to solve the information overload problem by using techniques from data mining, machine learning, natural language processing (NLP), information extraction and knowledge management.
Text mining is the extraction of useful information from a collection of documents. The objective of text mining is to exploit hidden information contained in textual documents in various ways and it is the discovery of valuable knowledge in text documents. However, it is a challenging issue to find accurate knowledge (or features) in text documents to help users find what they want.
Text mining, which is also referred to as text data mining and roughly equivalent to text analytics, is the process of deriving high-quality information from text. Besides, high-quality information is typically derived through the devising of patterns and trends through means such as statistical pattern learning.