A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books.
Yoav GoldbergJon OrwantPublished in: *SEM@NAACL-HLT (2013)
Keyphrases
- natural language
- semantic roles
- link grammar
- broad coverage
- natural language text
- parse tree
- recognizing textual entailment
- statistical machine translation
- wide coverage
- person names
- open domain
- semantic role labeling
- dependency parser
- syntactic analysis
- english language
- parallel corpus
- word alignment
- syntactic features
- english words
- n gram
- parallel corpora
- machine translation
- language learning
- unknown words
- syntactic semantic
- highly ambiguous
- probabilistic context free grammars
- question answering
- multiword
- linguistic patterns
- semantic analysis
- manually annotated
- penn treebank
- automatically acquiring
- sentence pairs
- co occurrence
- training corpus
- machine learning