An Extensible Parsing Pipeline for Unstructured Data Processing.
Shubham JainAmy de BuitléirEnda FallonPublished in: ICACT (2021)
Keyphrases
- data processing
- computer systems
- data model
- semi structured
- data management
- natural language
- data acquisition
- data analysis
- natural language processing
- structured data
- markup language
- big data
- data types
- pipeline architecture
- natural language parsing
- database systems
- object oriented
- shallow parsing
- intermediate representation
- databases
- data mining
- speech understanding
- error recovery
- pattern matching
- unstructured data
- dependency parsing
- context free grammars
- unsupervised learning
- syntactic analysis
- application specific
- wide coverage
- tree bank