Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation.
Chunyu QiangPeng YangHao CheJinba XiaoXiaorui WangZhongyuan WangPublished in: CoRR (2022)
Keyphrases
- data sets
- training data
- data analysis
- data structure
- original data
- data collection
- raw data
- high quality
- database
- data sources
- synthetic data
- image data
- data processing
- data quality
- probability distribution
- co occurrence
- computer systems
- statistical analysis
- attribute values
- complex data
- data points
- natural language
- database systems
- knowledge base
- feature selection
- information retrieval