Signal in Noise: Exploring Meaning Encoded in Random Character Sequences with Character-Aware Language Models.
Mark ChuBhargav Srinivasa DesikanEthan O. NadlerDonald Ruggerio Lo SardoElise Darragh-FordDouglas GuilbeaultPublished in: CoRR (2022)
Keyphrases
- language model
- language modeling
- language modelling
- n gram
- document retrieval
- speech recognition
- probabilistic model
- information retrieval
- statistical language models
- retrieval model
- optical character recognition
- hidden markov models
- language models for information retrieval
- ad hoc information retrieval
- document images
- query expansion
- collaborative filtering
- natural language
- feature selection
- search engine