Topic-Based Vietnamese News Document Filtering in the BioCaster Project.
Vu Cong Duy HoangNguyen Le NguyenDien DinhNigel CollierPublished in: ALPIT (2007)
Keyphrases
- news articles
- document representation
- topic detection and tracking
- news stories
- document content
- keywords
- topic tracking
- text documents
- textual content
- blog posts
- topic detection
- topic discovery
- information filtering
- scientific papers
- document set
- document clustering
- case study
- hot topics
- news items
- topic models
- information retrieval
- news topics
- document retrieval
- online news
- document images
- web news
- automatic summarization
- topic hierarchy
- document corpus
- news corpus
- concept space
- latent topics
- document classification
- document level
- named entity recognition
- project management
- vector space
- relevant documents
- retrieval systems
- language model
- search engine
- probabilistic model
- text mining
- information retrieval systems
- opinion retrieval
- software development
- short texts
- named entities
- latent dirichlet allocation
- text content