Automated Transcription for Pre-modern Japanese Kuzushiji Documents by Random Lines Erasure and Curriculum Training.
Anh Duc LePublished in: DAS (2020)
Keyphrases
- document collections
- web documents
- information retrieval systems
- straight line
- relevant documents
- hough transform
- text documents
- information retrieval
- xml documents
- training set
- metadata
- user queries
- keywords
- document retrieval
- vector space model
- neural network
- document classification
- feature selection
- training corpus
- cooperative learning
- training process
- query terms
- information extraction
- online learning
- language model