Indic-TEDST: Datasets and Baselines for Low-Resource Speech to Text Translation.
Nivedita SethiyaSaanvi NairChandresh Kumar MauryaPublished in: LREC/COLING (2024)
Keyphrases
- resource allocation
- neural network
- database
- high levels
- machine translation
- experimental results on real world
- query translation
- web resources
- benchmark datasets
- resource management
- information retrieval systems
- cross language information retrieval
- digital libraries
- resource constraints
- website
- synthetic and real datasets
- word recognition
- uci machine learning repository
- resource selection
- web pages