Word frequency and text type: Some observations based on the LOB corpus of British English texts.
Stig JohanssonPublished in: Comput. Humanit. (1985)
Keyphrases
- word frequency
- word order
- english words
- natural language generation
- automatic summarization
- keywords
- training corpus
- word sense
- bag of words
- text categorization
- natural language text
- text documents
- semantic relatedness
- natural language
- text corpus
- statistical machine translation
- text mining
- machine translation
- wordnet
- web documents
- language independent
- text classification
- image classification
- multiword
- cross lingual
- document frequency
- term frequency
- vector space model
- search engine
- word sense disambiguation
- document retrieval
- question answering
- knn
- feature selection