Novel Language Resources for Hindi: An Aesthetics Text Corpus and a Comprehensive Stop Lemma List.
Gayatri Venugopal-WairagadeJatinderkumar R. SainiDhanya PramodPublished in: CoRR (2020)
Keyphrases
- text corpus
- language resources
- machine translation
- text corpora
- cross language information retrieval
- text documents
- cross lingual
- query translation
- named entities
- natural language processing
- metadata
- broadcast news
- parallel corpora
- information extraction
- statistical machine translation
- wikipedia articles
- digital libraries
- natural language
- web search
- target language
- computational linguistics