HuSpaCy: an industrial-strength Hungarian natural language processing toolkit.
György OroszZsolt SzántóPéter BerkeczGergo SzabóRichárd FarkasPublished in: CoRR (2022)
Keyphrases
- industrial strength
- natural language processing
- text mining
- information extraction
- computational linguistics
- machine learning
- computational biology
- language processing
- natural language
- named entities
- text processing
- free text
- machine translation
- wordnet
- text classification
- knowledge representation
- text summarization
- linguistic knowledge
- named entity recognition
- semantic relations
- semantic analysis
- language independent
- artificial intelligence
- question answering
- dynamic time warping
- data mining
- real world
- textual data
- part of speech
- databases
- text to speech
- word sense disambiguation
- sentiment analysis
- data sets
- expert systems
- multiscale
- similarity measure
- case study
- information systems
- learning algorithm
- statistical and machine learning methods