The GV-LEx corpus of tales in French - Text and speech corpora enriched with lexical, discourse, structural, phonemic and prosodic annotations.
David DoukhanSophie RossetAlbert RilliardChristophe d'AlessandroMartine Adda-DeckerPublished in: Lang. Resour. Evaluation (2015)
Keyphrases
- lexical features
- text to speech synthesis
- text corpus
- text to speech
- text corpora
- spontaneous speech
- word frequency
- speech recognition
- prosodic features
- topic segmentation
- anaphora resolution
- natural language processing
- document corpus
- keywords
- text data
- linguistic information
- recognizing textual entailment
- natural language text
- training corpus
- speech synthesis
- natural language
- word pairs
- syntactic features
- text documents
- synthesized speech
- text classification
- reference resolution
- text collections
- spoken language
- automatic speech recognition
- information retrieval
- discourse structure
- annotated corpus
- textual entailment
- word sense
- linguistic features
- lexical resources
- automatic summarization
- text mining
- statistical machine translation
- multiword