Login / Signup
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks.
Colin Leong
Joshua Nemecek
Jacob Mansdorfer
Anna Filighera
Abraham Owodunni
Daniel Whitenack
Published in:
EMNLP (2022)
Keyphrases
</>
wide variety
wide range
amazon mechanical turk
expressive power
multi modal
language independent
multimedia
database systems
mobile robot
supply chain
data mining tasks
data sets
digital libraries
machine learning
neural network
audio visual
training dataset
synthetic and real datasets
multi lingual