A Language Modeling Text Mining Approach to the Annotation of Protein Community.
Xiaodan ZhangDaniel Duanqing WuXiaohua ZhouXiaohua HuPublished in: BIBE (2006)
Keyphrases
- language modeling
- text mining
- information retrieval
- language model
- text classification
- query expansion
- retrieval model
- cross lingual
- probabilistic model
- relevance model
- n gram
- information extraction
- natural language processing
- active learning
- web mining
- text documents
- metadata
- protein sequences
- image annotation
- improvements in retrieval effectiveness
- document clustering
- information retrieval systems
- data analysis
- biomedical literature
- named entities
- word segmentation
- statistical language models
- dirichlet prior
- test collection
- vector space model
- document retrieval
- databases
- text categorization
- knn
- machine learning