PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining.
Machel ReidMikel ArtetxePublished in: CoRR (2021)
Keyphrases
- input data
- data sets
- database
- data sources
- sensor data
- statistical analysis
- knowledge discovery
- data points
- data collection
- data analysis
- prior knowledge
- sequential data
- data structure
- high quality
- missing data
- training data
- event sequences
- genomic sequences
- cross language information retrieval
- data quality
- raw data
- data mining techniques
- image data
- hidden markov models
- databases