Bridging the Language Gap: Topic Adaptation for Documents with Different Technicality.
Shuang-Hong YangSteven P. CrainHongyuan ZhaPublished in: AISTATS (2011)
Keyphrases
- document content
- expert finding
- document set
- multi document summarization
- topic modeling
- topic segmentation
- programming language
- topic discovery
- document collections
- multilingual documents
- information retrieval
- textual content
- natural language
- information retrieval systems
- topic hierarchy
- text documents
- document retrieval
- language learning
- related documents
- text corpora
- automatic summarization
- indian languages
- keywords
- topic specific
- web documents
- document clustering
- wikipedia articles
- latent topics
- topic models
- document level
- xml documents
- concept space
- document classification
- word frequency
- extensible markup language
- relevant documents
- semantic information
- query topic
- number of relevant documents
- cross document
- parallel corpus
- writing style
- focused crawling
- source language
- logical structure
- topic detection
- retrieved documents
- document representation
- vector space model
- retrieval systems
- text mining
- probabilistic model
- digital libraries
- metadata