Character Profiling in Low-Resource Language Documents.
Tak-sum WongJohn LeePublished in: ADCS (2019)
Keyphrases
- information retrieval
- document collections
- printed documents
- text documents
- multilingual documents
- language learning
- web documents
- programming language
- natural language
- parallel corpus
- information retrieval systems
- resource allocation
- document classification
- document retrieval
- xml documents
- linguistic analysis
- metadata
- database
- relevant documents
- keywords
- resource management
- optical character recognition
- probabilistic model
- linguistic resources
- indian languages
- character n grams
- vector space
- source language
- test collection
- digital libraries
- search engine
- chinese text retrieval