Semi-Automatic LaTeX-Based Labeling of Mathematical Objects in PDF Documents: MOP Data Set.
Donald BeyetteZelun WangJason LinJyh-Charn LiuPublished in: DocEng (2019)
Keyphrases
- semi automatic
- data sets
- fully automatic
- mathematical expressions
- gold standard
- semi automatically
- pdf documents
- ontology mapping
- landmark extraction
- multi objective
- labor intensive
- semantic annotation
- wrapper generation
- multi objective optimization
- ontology construction
- active learning
- automatic construction
- similarity measure
- image segmentation