Review of Practices of Collecting and Annotating Texts in the Learner Corpus REALEC.
Olga VinogradovaOlga LyashevskayaPublished in: TSD (2022)
Keyphrases
- training corpus
- natural language text
- manually annotated
- information extraction systems
- world knowledge
- text corpus
- english words
- newspaper articles
- manual annotation
- word sense
- data collection
- learning materials
- linguistic patterns
- semantic annotation
- learning process
- natural language generation
- scientific papers
- learning environment
- metadata
- e learning
- text classification
- co occurrence
- machine learning
- case study
- writing style
- text corpora
- linguistic information
- linguistic features
- coreference resolution
- text documents
- test set
- search engine