Characterizing Text Complexity with Core Vocabulary Distributional Patterns: Corpus-based Approach.
Marina SolnyshkinaVladimir IvanovValery D. SolovyevPublished in: AIST (Supplement) (2018)
Keyphrases
- linguistic patterns
- supervised machine learning
- broad coverage
- open domain
- keywords
- plain text
- co occurrence
- text data
- newspaper articles
- sentence level
- text corpora
- word sense
- english words
- world knowledge
- noun phrases
- data mining techniques
- text collections
- scientific papers
- text mining
- natural language text
- recognizing textual entailment
- lexico syntactic
- natural language processing
- text corpus
- database
- multiword
- text retrieval
- document corpus
- information extraction systems
- named entity disambiguation
- document level
- anaphora resolution
- textual features
- training corpus
- free text
- semantic information
- information extraction
- search engine