Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition.
Yosuke HiguchiTetsuji OgawaTetsunori KobayashiPublished in: CoRR (2023)
Keyphrases
- end to end
- speech recognition
- language model
- language modeling
- retrieval model
- n gram
- document retrieval
- information retrieval
- probabilistic model
- automatic speech recognition
- mixture model
- query expansion
- test collection
- query terms
- speech signal
- word error rate
- statistical machine translation
- speech recognition systems
- handwriting recognition
- multimedia
- relevance model
- translation model