A Standardized Project Gutenberg Corpus for Statistical Analysis of Natural Language and Quantitative Linguistics.
Martin GerlachFrancesc Font-ClosPublished in: Entropy (2020)
Keyphrases
- natural language
- statistical analysis
- natural language processing
- natural language text
- case study
- natural language interface
- machine learning
- knowledge representation
- natural language understanding
- clinical data
- european project
- neural network
- natural language generation
- semantic representation
- language processing
- qualitative and quantitative
- statistical methods
- data analysis
- project management
- software projects
- machine translation
- dialogue system
- question answering
- test set
- information extraction
- computer science