JESC: Japanese-English Subtitle Corpus.
Reid PryzantYoungjoo ChungDan JurafskyDenny BritzPublished in: LREC (2018)
Keyphrases
- link grammar
- person names
- open domain
- statistical machine translation
- parallel corpus
- wide coverage
- native speakers
- broad coverage
- english words
- english language
- sentence pairs
- linguistic features
- mono lingual
- language learning
- penn treebank
- machine translation
- unknown words
- semantic roles
- natural language
- multiword
- cross lingual
- parallel corpora
- lexical units
- cross language information retrieval
- training corpus
- computer assisted language learning
- japanese language
- pos tagging
- chinese characters
- word sense
- machine learning
- machine translation system
- spontaneous speech
- stop words
- foreign language
- natural language text
- comparable corpora
- tree bank
- manually annotated
- word sense disambiguation
- sentence level
- spoken language