Revisiting the idea of a representative linguistic corpus.
Alexandru DinuAdriana VladPublished in: COMM (2022)
Keyphrases
- natural language text
- linguistic features
- natural language processing
- linguistic patterns
- reference resolution
- manually annotated
- machine learning
- linguistic information
- case study
- natural language
- knowledge base
- information systems
- information retrieval
- supervised machine learning
- training data
- probabilistic model
- information extraction
- test set
- linguistic knowledge
- open domain
- neural network
- real time