Building a Data Infrastructure for a Mid-Resource Language: The Case of Catalan.
Aitor Gonzalez-AgirreMontserrat MarimonCarlos Rodríguez PenagosJavier Aula-BlascoIrene Baucells de la PeñaCarme Armentano-OllerJorge Palomar-GinerBaybars KulebiMarta VillegasPublished in: LREC/COLING (2024)
Keyphrases
- data sets
- statistical analysis
- database
- data collection
- data analysis
- missing data
- raw data
- data quality
- data structure
- high quality
- input data
- computer systems
- complex data
- data sources
- probability distribution
- training data
- data processing
- website
- databases
- data distribution
- sensor data
- data transfer
- infrared
- synthetic data
- programming language
- image data
- end users
- computer science
- artificial intelligence