Harvesting Textual and Structured Data from the HAL Publication Repository.
Francis KulumbaWissam AntounGuillaume VimontLaurent RomaryPublished in: CoRR (2024)
Keyphrases
- structured data
- metadata
- digital libraries
- textual data
- unstructured data
- free text
- semi structured
- information extraction
- relational data
- semi structured data
- structured databases
- xml documents
- structured information
- unstructured text
- semistructured data
- linked data
- keywords
- keyword search
- keyword queries
- structured and unstructured data
- structured queries
- text data
- data sources
- graph structures
- big data
- database
- text mining
- natural language processing
- knowledge discovery
- natural language
- genetic algorithm