Indic-TEDST: Datasets and Baselines for Low-Resource Speech to Text Translation.

Nivedita Sethiya Saanvi Nair Chandresh Kumar Maurya

Published in: LREC/COLING (2024)

Keyphrases

resource allocation
neural network
database
high levels
machine translation
experimental results on real world
query translation
web resources
benchmark datasets
resource management
information retrieval systems
cross language information retrieval
digital libraries
resource constraints
website
synthetic and real datasets
word recognition
uci machine learning repository
resource selection
web pages