Investigating the Effectiveness of Representations Based on Word-Embeddings in Active Learning for Labelling Text Datasets.
Jinghui LuMaeve HenchionBrian Mac NameePublished in: CoRR (2019)
Keyphrases
- active learning
- keywords
- english text
- text data
- text corpus
- string matching
- data sets
- word counts
- sentence level
- machine learning
- linguistic information
- text mining
- text input
- english words
- related words
- natural language text
- chinese text
- text documents
- syntactic categories
- learning process
- information retrieval
- spoken documents
- named entity recognizer
- word recognition
- noun phrases
- text retrieval
- vector space
- labeled data
- dimensionality reduction
- co occurrence
- learning algorithm