Extraction and search of chemical formulae in text documents on the web.
Bingjun SunQingzhao TanPrasenjit MitraC. Lee GilesPublished in: WWW (2007)
Keyphrases
- text documents
- information extraction
- extraction patterns
- text mining
- text categorization
- textual data
- topic models
- text classification
- web documents
- document clustering
- keywords
- document classification
- news articles
- wordnet
- textual information
- text data
- user queries
- automatic text categorization
- k nearest neighbor
- bag of words
- named entities
- computer vision
- neural network
- databases
- knn
- probabilistic model
- object recognition
- natural language
- information retrieval
- real world
- database