Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks.
Colin LeongJoshua NemecekJacob MansdorferAnna FiligheraAbraham OwodunniDaniel WhitenackPublished in: CoRR (2022)
Keyphrases
- wide variety
- supply chain
- data sets
- amazon mechanical turk
- benchmark datasets
- wide range
- neural network
- digital libraries
- text classification
- multi modal
- databases
- expressive power
- multi task
- language independent
- database systems
- search engine
- information retrieval
- data mining tasks
- synthetic and real datasets
- grammatical inference
- multimodal interaction
- database