Unmasking the Myth of Effortless Big Data - Making an Open Source Multi-lingual Infrastructure and Building Language Resources from Scratch.
Linda WiechetekKatri Hiovain-AsikainenInga Lill Sigga MikkelsenSjur N. MoshagenFlammie PirinenTrond TrosterudBørre GaupPublished in: LREC (2022)
Keyphrases
- big data
- multi lingual
- language resources
- open source
- machine translation
- cross lingual
- language independent
- cloud computing
- information access
- data analysis
- social media
- data management
- data processing
- language identification
- metadata
- business intelligence
- knowledge discovery
- information retrieval
- broadcast news
- databases
- natural language processing
- artificial intelligence
- machine learning
- data mining