An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks.
Kyubyong ParkJoohong LeeSeongbo JangDawoon JungPublished in: CoRR (2020)
Keyphrases
- named entities
- natural language processing
- natural language
- keywords
- knowledge discovery
- information extraction
- text mining
- question answering
- text processing
- optimal strategy
- field of natural language processing
- database
- morphological analysis
- text analysis
- named entity recognition
- free text
- search strategies
- semi supervised
- machine learning