Analyzing Word Frequencies in Large Text Corpora Using Inter-arrival Times and Bootstrapping.
Jefrey LijffijtPanagiotis PapapetrouKai PuolamäkiHeikki MannilaPublished in: ECML/PKDD (2) (2011)
Keyphrases
- text corpora
- text corpus
- word frequencies
- arrival times
- text mining
- document collections
- computational linguistics
- text analysis
- information extraction
- topic models
- text documents
- text classifiers
- single server
- topic modeling
- concept hierarchy
- artificial intelligence
- named entities
- information retrieval
- document retrieval
- knowledge discovery
- classification accuracy