Login / Signup

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

Holy LoveniaRahmad MahendraSalsabil Maulana AkbarLester James V. MirandaJennifer SantosoElyanah AcoAkhdan FadhilahJonibek MansurovJoseph Marvin ImperialOnno Pepijn KampmanJoel Ruben Antony MonizMuhammad Ravi Shulthan HabibiFrederikus HudiRailey MontalanRyan IgnatiusJoanito Agili LopoWilliam NixonBörje F. KarlssonJames JayaRyandito DiandaruYuze GaoPatrick Amadeus IrawanBin WangJan Christian Blaise CruzChenxi WhitehouseIvan Halim ParmonanganMaria KhelliWenyu ZhangLucky SusantoReynard Adha RyandaSonny Lazuardi HermawanDan John VelascoMuhammad Dehan Al KautsarWilly Fitra HendriaYasmin MoslemNoah FlynnMuhammad Farid AdilazuardaHaochen LiJohanes LeeR. DamanhuriShuo SunMuhammad Reza QoribAmirbek DjanibekovWei Qi LeongQuyet V. DoNiklas MuennighoffTanrada PansuwanIlham Firdausi PutraYan XuNgee Chia TaiAyu PurwariantiSebastian RuderWilliam-Chandra TjhiPeerat LimkonchotiwatAlham Fikri AjiSedrick KehGenta Indra WinataRuochen ZhangFajri KotoZheng Xin YongSamuel Cahyawijaya
Published in: CoRR (2024)
Keyphrases
  • multimodal data
  • benchmark suite
  • cross lingual
  • multimedia databases
  • cross modal
  • similarity search
  • machine translation
  • kernel trick
  • data sets
  • visual features
  • document clustering