TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning.
Yuan SuiJiaru ZouMengyu ZhouXinyi HeLun DuShi HanDongmei ZhangPublished in: CoRR (2023)
Keyphrases
- language model
- semi structured data
- structured data
- semi structured
- language modeling
- web mining
- probabilistic model
- information retrieval
- xml documents
- n gram
- document retrieval
- xml data
- html documents
- speech recognition
- query expansion
- context sensitive
- retrieval model
- mixture model
- web data
- database
- ad hoc information retrieval
- test collection
- knowledge representation
- relational databases
- heterogeneous data
- relevance model
- smoothing methods
- databases
- information retrieval systems
- xml schema
- information extraction
- cross lingual
- search engine
- machine learning
- translation model