A Hybrid Information Extraction Approach using Transfer Learning on Richly-Structured Documents.
Arnab Ghosh ChowdhuryNils SchutMartin AtzmüllerPublished in: LWDA (2021)
Keyphrases
- transfer learning
- structured documents
- information extraction
- web documents
- text mining
- information retrieval
- machine learning
- cross domain
- information retrieval systems
- xml documents
- reinforcement learning
- active learning
- semi supervised learning
- labeled data
- collaborative filtering
- named entities
- machine learning algorithms
- natural language processing
- relevant documents
- transfer knowledge
- learning algorithm
- text classification
- query language
- text documents
- cross lingual
- semi supervised
- learning process
- nearest neighbor
- unlabeled data
- knowledge representation
- feature extraction
- machine translation
- document clustering
- document representation
- data mining
- data sets