Word-Level Representation From Bytes For Language Modeling.
Chu-Tak LeeQipeng GuoXipeng QiuPublished in: CoRR (2022)
Keyphrases
- language modeling
- word level
- language model
- n gram
- word segmentation
- information retrieval
- retrieval model
- chinese text retrieval
- language independent
- cross lingual
- query expansion
- document images
- machine translation
- probabilistic model
- word recognition
- document analysis
- document level
- document retrieval
- character recognition
- text classification
- relevance model
- similarity measure
- speech recognition
- translation model
- information retrieval systems
- retrieval effectiveness