ELITR Minuting Corpus: A Novel Dataset for Automatic Minuting from Multi-Party Meetings in English and Czech.
Anna NedoluzhkoMuskaan SinghMarie HledíkováTirthankar GhosalOndrej BojarPublished in: LREC (2022)
Keyphrases
- multi party
- privacy preserving
- link grammar
- open domain
- cross language
- statistical machine translation
- person names
- broad coverage
- wide coverage
- floor control
- description language
- virtual humans
- english words
- cl sr
- artificial intelligence
- language independent
- manually generated
- automatically generated
- penn treebank
- training corpus
- mental states
- parallel corpora
- spoken language
- machine translation system
- speech retrieval
- cross lingual
- intelligent agents