Corpus Synthesis for Zero-Shot ASR Domain Adaptation Using Large Language Models.
Hsuan SuTing-Yao HuHema Swetha KoppulaRaviteja VemulapalliJen-Hao Rick ChangKarren D. YangGautam Varma MantenaOncel TuzelPublished in: ICASSP (2024)
Keyphrases
- domain adaptation
- language model
- speech recognition
- word error rate
- automatic speech recognition
- language modeling
- document level
- document retrieval
- probabilistic model
- n gram
- information retrieval
- cross domain
- query expansion
- semi supervised learning
- semi supervised
- multiple sources
- labeled data
- transfer learning
- test collection
- speech signal
- sentiment classification
- target domain
- retrieval model
- document classification
- test data
- test set
- pseudo relevance feedback
- cross lingual
- unlabeled data
- search engine
- relevance model
- prior knowledge
- data model
- high dimensional
- training data
- learning algorithm
- machine learning