Automatically Annotated Turkish Corpus for Named Entity Recognition and Text Categorization using Large-Scale Gazetteers.
H. Bahadir SahinCaglar TirkazEray YildizMustafa Tolga ErenOmer Ozan SonmezPublished in: CoRR (2017)
Keyphrases
- text categorization
- named entity recognition
- annotated corpus
- named entities
- genia corpus
- information extraction
- natural language processing
- semi supervised
- text classification
- feature selection
- conditional random fields
- text documents
- maximum entropy
- semi supervised learning
- relation extraction
- text summarization
- unlabeled data
- knn
- k nearest neighbor
- multi label
- naive bayes
- question answering
- text mining
- co occurrence
- tf idf
- learning algorithm
- information retrieval