AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data.
Jianwei YuHangting ChenYanyao BianXiang LiYi LuoJinchuan TianMengyang LiuJiayi JiangShuai WangPublished in: ICASSP (2024)
Keyphrases
- data sets
- preprocessing
- high quality
- data sources
- data processing
- data analysis
- data collection
- knowledge discovery
- data distribution
- statistical analysis
- database
- image data
- high dimensional data
- computer systems
- probability distribution
- probabilistic model
- training data
- machine learning
- small number
- end users
- xml documents
- data structure
- main contribution
- synthetic data
- experimental data
- raw data
- original data
- large scale data sets