100, 000 Podcasts: A Spoken English Document Corpus.
Ann CliftonSravana ReddyYongze YuAasish PappuRezvaneh RezapourHamed R. BonabMaria EskevichGareth J. F. JonesJussi KarlgrenBen CarteretteRosie JonesPublished in: COLING (2020)
Keyphrases
- document corpus
- spoken language
- broadcast news
- document clustering
- information retrieval
- natural language
- machine translation
- speech recognition
- language learning
- keywords
- higher education
- automatic speech recognition
- speech retrieval
- topic detection
- query translation
- professional development
- cross language
- cross lingual
- blended learning
- language model
- information retrieval systems
- data mining