Conversion of Multi-lingual STEM Documents in E-Born PDF into Various Accessible E-Formats.
Masakazu SuzukiKatsuhito YamaguchiPublished in: ICCHP-AAATE (1) (2022)
Keyphrases
- multi lingual
- pdf documents
- information retrieval
- scientific documents
- information access
- metadata
- language independent
- electronic documents
- document collections
- pdf files
- language identification
- digital libraries
- cross lingual
- relevant documents
- information retrieval systems
- document retrieval
- multiple information sources
- document clustering
- text retrieval
- text classification
- natural language processing
- web documents
- mixture model