A corpus of Persian literary text.
Shahab RajiMalihe AlikhaniGerard de MeloMatthew StonePublished in: Lang. Resour. Evaluation (2024)
Keyphrases
- text retrieval
- supervised machine learning
- text data
- text analysis
- open domain
- text classification
- broad coverage
- newspaper articles
- natural language text
- text collections
- text corpus
- text mining
- noun phrases
- plain text
- sentence level
- text corpora
- textual data
- anaphora resolution
- linguistic information
- information retrieval
- temporal expressions
- information extraction systems
- lexical features
- training corpus
- keywords
- scientific papers
- named entity disambiguation
- machine learning
- machine translation system
- manually annotated
- text documents
- world knowledge
- document level
- multiword
- text processing
- english words
- spontaneous speech
- web documents
- query expansion
- information retrieval systems