Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection.
Jing DuShiliang PuQinbo DongChao JinXin QiDian GuRu WuHongwei ZhouPublished in: CoRR (2022)
Keyphrases
- error correction
- post processing
- cross modal
- speech recognition
- automatic speech recognition
- multi modal
- preprocessing
- multimedia retrieval
- image retrieval
- visual recognition
- error detection
- language model
- multimedia databases
- human experts
- watermarking scheme
- visual data
- visual similarity
- image annotation
- databases
- visual features
- multimedia
- information retrieval