Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model.
Runzhe ZhanXinyi YangDerek F. WongLidia S. ChaoYue ZhangPublished in: CoRR (2024)
Keyphrases
- language model
- word level
- n gram
- statistical machine translation
- information retrieval
- multiword
- language modeling
- word alignment
- cross language retrieval
- document retrieval
- text retrieval
- document level
- machine translation system
- retrieval model
- translation model
- language independent
- query expansion
- language modelling
- probabilistic model
- context sensitive
- cross language
- cross lingual
- text mining
- machine translation
- test collection
- speech recognition
- smoothing methods
- keywords
- ad hoc information retrieval
- mixture model
- word segmentation
- bayesian networks
- cross language information retrieval
- query terms
- document images
- natural language processing
- natural language
- pseudo relevance feedback
- vector space model
- semantic roles
- text documents
- broadcast news
- topic models
- web documents
- text classification
- word clouds