Building re-usable dictionary repositories for real-world text mining.
Shantanu GodboleIndrajit BhattacharyaAjay GuptaAshish VermaPublished in: CIKM (2010)
Keyphrases
- text mining
- real world
- data mining
- wide range
- natural language processing
- text classification
- web mining
- information extraction
- digital libraries
- metadata
- biomedical literature
- sparse representation
- synthetic data
- genia corpus
- textual data
- data repositories
- data sets
- knowledge representation
- website
- machine learning
- software engineering
- real life
- expert systems
- case study
- web services
- document clustering
- knowledge base
- information retrieval
- textual documents
- text categorisation
- record keeping
- neural network