PubSqueezer: A Text-Mining Web Tool to Transform Unstructured Documents into Structured Data.
Alberto CalderonePublished in: CoRR (2020)
Keyphrases
- structured data
- unstructured documents
- free text
- semi structured
- information extraction
- textual data
- unstructured text
- text data
- semi structured data
- xml documents
- data sources
- relational data
- keyword search
- unstructured data
- web documents
- text mining
- structured information
- linked data
- semistructured data
- text databases
- metadata
- tree structured data
- data sets
- machine learning
- text classification
- natural language processing
- tree kernels
- structured databases
- structured and unstructured data