A Lightweight Approach to Extract Interschema Properties from Structured, Semi-Structured and Unstructured Sources in a Big Data Scenario.
Francesco CauteruccioPaolo Lo GiudiceLorenzo MusarellaGiorgio TerracinaDomenico UrsinoLuca VirgiliPublished in: Int. J. Inf. Technol. Decis. Mak. (2020)
Keyphrases
- lightweight
- unstructured data
- semi structured
- big data
- structured data
- web data sources
- web sources
- information extraction
- unstructured text
- data model
- semi structured data
- data sources
- structured knowledge
- cloud computing
- data extraction
- web data
- data collections
- data processing
- data management
- wrapper generation
- database
- big data analytics
- wrapper induction
- social media
- knowledge discovery
- web documents
- business intelligence
- free text
- data analysis
- xml documents
- wireless sensor networks
- text mining
- decision support system
- semistructured data
- textual data
- rfid tags
- multiple sources
- keyword search
- machine learning
- data mining