A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation.
Cécile MacaireChloé DionJordan ArrigoClaire LemaireEmmanuelle Esperança-RodierBenjamin LecouteuxDidier SchwabPublished in: LREC/COLING (2024)
Keyphrases
- machine translation
- spontaneous speech
- machine translation system
- finite state transducers
- mono lingual
- natural language generation
- cross lingual
- statistical machine translation
- conversational speech
- speech recognition
- audio visual
- automatic speech recognition
- parallel corpora
- speech signal
- natural language processing
- language independent
- cross language information retrieval
- target language
- dialogue system
- information extraction
- chinese english
- word level
- text mining
- hidden markov models
- pos tagging
- language processing
- parallel corpus
- training corpus
- information retrieval
- word sense disambiguation
- natural language
- broadcast news
- keywords