Utilizing social media data through similarity-based text normalization for LVCSR language modeling.
Ananlada ChotimongkolKwanchiva ThangthaiChai WutiwiwatchaiPublished in: O-COCOSDA (2014)
Keyphrases
- language modeling
- social media data
- information retrieval
- language model
- finite state transducers
- retrieval model
- social media
- query expansion
- probabilistic model
- cross lingual
- n gram
- text classification
- anchor text
- keywords
- text mining
- massive amounts
- information retrieval systems
- word segmentation
- web documents
- semantic information
- test collection
- retrieval effectiveness
- information extraction
- relevance model
- active learning