Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization.
Gonçal V. Garcés Díaz-MuníoJoan Albert Silvestre-CerdàJavier JorgeAdrià Giménez-PastorJavier Iranzo-SánchezPau Baquero-ArnalNahuel RosellóAlejandro Pérez González de MartosJorge CiveraAlbert SanchísAlfons JuanPublished in: Interspeech (2021)
Keyphrases
- data sets
- raw data
- data distribution
- speech recognition
- data processing
- database
- synthetic data
- data collection
- image data
- original data
- probability distribution
- knowledge discovery
- data sources
- data analysis
- pattern recognition
- missing data
- data points
- sensor data
- data quality
- noisy environments
- spontaneous speech
- statistical analysis
- hidden markov models
- prior knowledge
- databases
- real time