TASTI: Semantic Indexes for Machine Learning-based Queries over Unstructured Data.
Daniel KangJohn GuibasPeter D. BailisTatsunori HashimotoMatei ZahariaPublished in: SIGMOD Conference (2022)
Keyphrases
- unstructured data
- machine learning
- structured data
- query processing
- textual data
- data sources
- query language
- big data
- information extraction
- semi structured
- database
- relational databases
- natural language
- query evaluation
- semantic information
- information management
- semistructured data
- natural language processing
- text classification
- semi structured data
- artificial intelligence
- text mining
- query logs
- raw data
- data warehouse
- data mining
- data management
- knowledge representation
- feature selection