PRETO: a high-performance text mining tool for preprocessing Turkish texts.
Volkan TunaliTurgay Tugay BilginPublished in: CompSysTech (2012)
Keyphrases
- text mining
- preprocessing
- text documents
- post processing
- natural language text
- knowledge discovery
- textual documents
- data sets
- biomedical literature
- web mining
- named entities
- text classification
- information extraction
- natural language
- learning algorithm
- topic modeling
- information retrieval
- machine learning
- scientific literature
- database
- text categorisation
- genia corpus