Unstructured and structured data: Can we have the best of both worlds with large language models?
Wang-Chiew TanPublished in: IEEE Data Eng. Bull. (2023)
Keyphrases
- language model
- structured data
- semi structured
- unstructured data
- language modeling
- probabilistic model
- n gram
- document retrieval
- structured databases
- information extraction
- structured information
- unstructured text
- information retrieval
- query expansion
- free text
- data sources
- textual data
- retrieval model
- test collection
- xml documents
- linked data
- query terms
- semi structured data
- smoothing methods
- language models for information retrieval
- databases
- vector space model
- retrieval effectiveness
- metadata
- keyword queries
- pseudo relevance feedback
- relevance model
- text data
- co occurrence
- text mining
- data sets