CEN@Amrita: Information Retrieval on CodeMixed Hindi-English Tweets Using Vector Space Models.
Shivkaran SinghM. Anand KumarSoman K. PPublished in: FIRE (Working Notes) (2016)
Keyphrases
- vector space model
- information retrieval
- machine translation
- language identification
- language model
- retrieval model
- tf idf
- latent semantic indexing
- vector space
- document clustering
- indian languages
- information retrieval systems
- semantic similarity
- web documents
- cross lingual
- target language
- term weighting
- search engine
- semantic information
- language modeling
- natural language
- document retrieval
- information extraction
- agglomerative hierarchical clustering
- named entities
- term weighting schemes
- ir models
- trec collections
- text mining
- relevance model
- query expansion
- question answering
- cross language information retrieval
- test collection
- retrieval systems
- retrieval effectiveness