Contextual Embeddings for Arabic-English Code-Switched Data.
Caroline SabtyMohamed IslamSlim AbdennadherPublished in: WANLP@COLING (2020)
Keyphrases
- data sets
- raw data
- data collection
- data processing
- data quality
- synthetic data
- data sources
- high quality
- database
- spatial data
- missing data
- training data
- high dimensional data
- data points
- probability distribution
- principal component analysis
- language model
- data mining techniques
- data structure
- image data
- statistical analysis
- similarity search
- natural language
- feature space
- data distribution
- vector space
- learning algorithm
- data analysis