Uncovering Languages from written documents.
Nikitas N. KaranikolasPanagiotis OuranosPublished in: Panhellenic Conference on Informatics (2014)
Keyphrases
- multilingual documents
- arabic language
- document collections
- document retrieval
- information retrieval
- document classification
- databases
- information retrieval systems
- linguistic resources
- database
- indian languages
- manually constructed
- expressive power
- language independent
- document clustering
- text retrieval
- multilingual information retrieval
- keywords
- text documents
- web documents
- metadata
- cross lingual
- relevant documents
- parallel corpora
- retrieval systems
- semantic information
- vector space model
- source language
- comparable corpora
- xml documents
- digital libraries