Lessons Learned from Creating a Balanced Corpus from Online Data.
Roberts DargisKristine Levane-PetrovaIlmars PoikansPublished in: Baltic HLT (2020)
Keyphrases
- lessons learned
- data sets
- case study
- data analysis
- data collection
- future directions
- training data
- synthetic data
- data sources
- experimental data
- statistical analysis
- database
- image data
- knowledge discovery
- raw data
- data processing
- original data
- data objects
- spatial data
- missing data
- small number
- data structure
- high quality
- sensor data
- data distribution
- clustering algorithm
- real time