Identification of Document Language is Not yet a Completely Solved Problem.
Joaquim Ferreira da SilvaGabriel Pereira LopesPublished in: CIMCA/IAWTIC (2006)
Keyphrases
- language learning
- programming language
- retrieval systems
- document collections
- intended meaning
- document clustering
- document retrieval
- document images
- web documents
- information retrieval
- information retrieval systems
- natural language
- keywords
- database
- document classification
- vector space model
- modeling language
- user queries
- object oriented
- machine learning
- tf idf
- target language
- specification language
- source language
- document content