Khmer Speech Translation Corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC).
Soky KakMasato MimuraTatsuya KawaharaSheng LiChenchen DingChenhui ChuSethserey SamPublished in: O-COCOSDA (2021)
Keyphrases
- statistical machine translation
- spontaneous speech
- speech recognition
- parallel corpus
- machine translation
- lexical features
- parallel corpora
- conversational speech
- machine translation system
- spoken language
- cross language information retrieval
- speech signal
- automatic speech recognition
- manually annotated
- english words
- sentence pairs
- test set
- speech processing
- endpoint detection
- chinese english
- spanish language
- language model
- open domain
- speech synthesis
- comparable corpora
- multiword
- recognition engine
- query translation
- text classification
- broadcast news
- training corpus
- linguistic features
- language resources
- noisy environments
- dialogue system
- audio visual
- mono lingual
- cross language