Aggregating skip bigrams into key phrase-based vector space model for web person disambiguation.
Jian XuQin LuZhengzhong LiuPublished in: KONVENS (2012)
Keyphrases
- vector space model
- web documents
- language model
- web people search
- information retrieval
- document retrieval
- retrieval model
- web pages
- n gram
- tf idf
- latent semantic indexing
- semantic similarity
- web directories
- document representation
- vector space
- co occurrence
- document collections
- index terms
- document clustering
- statistical machine translation
- natural language processing
- information extraction
- semantic information
- named entities
- database
- query terms
- bayesian networks
- machine translation
- text retrieval
- database systems
- word sense disambiguation
- search engine
- principal component analysis
- knowledge discovery
- cross language information retrieval
- part of speech
- domain knowledge
- natural language
- keywords
- document space
- similarity measure