Data Centric Domain Adaptation for Historical Text with OCR Errors.
Luisa MärzStefan SchweterNina PörnerBenjamin RothHinrich SchützePublished in: CoRR (2021)
Keyphrases
- domain adaptation
- data centric
- data management
- multiple sources
- data driven
- business processes
- labeled data
- semi supervised
- cross domain
- semi supervised learning
- transfer learning
- information retrieval
- text mining
- optical character recognition
- database
- information management
- routing protocol
- distributed systems
- target domain
- xml schema
- data representation
- test data
- sentiment classification
- training data
- web documents
- keywords
- supervised learning
- text documents
- wireless sensor networks
- document classification
- application development
- domain specific
- unlabeled data
- face recognition
- web services
- information extraction
- sensor networks
- general purpose
- data mining