Py_ape: Text Data Acquiring, Extracting, Cleaning and Schema Matching in Python.
Bich-Ngan T. NguyenPhuong N. H. PhamVu Thanh NguyenPhan Quoc VietLe Dinh TuanVáclav SnáselPublished in: FDSE (CCIS Volume) (2020)
Keyphrases
- text data
- schema matching
- data extraction
- data integration
- text mining
- text classification
- information integration
- web sources
- structured data
- high dimensional
- deep web
- query interface
- semi structured
- text documents
- data sources
- document collections
- domain ontology
- web pages
- high dimensional data
- schema mappings
- text categorization
- object oriented
- web databases
- heterogeneous data sources
- data sets
- information sources
- data management
- data points
- query language
- feature extraction
- metadata
- databases