The Natural Stories corpus: a reading-time corpus of English texts containing rare syntactic constructions.
Richard FutrellEdward GibsonHarry J. TilyIdan BlankAnastasia VishnevetskySteven T. PiantadosiEvelina FedorenkoPublished in: Lang. Resour. Evaluation (2021)
Keyphrases
- natural language text
- training corpus
- natural language
- english words
- link grammar
- recognizing textual entailment
- open domain
- broad coverage
- person names
- newspaper articles
- linguistic patterns
- manually annotated
- statistical machine translation
- linguistic features
- information extraction systems
- word sense
- wide coverage
- multiword
- story link detection
- parallel corpus
- keywords
- syntactic analysis
- text corpus
- semantic roles
- machine translation
- information extraction
- world knowledge
- dependency parser
- linguistic information
- syntactic structures
- machine translation system
- text corpora
- parse tree
- cross lingual
- language learning
- probabilistic context free grammars
- penn treebank
- text classification