Data-Centric AI in the Age of Large Language Models.
Xinyi XuZhaoxuan WuRui QiaoArun VermaYao ShuJingtan WangXinyuan NiuZhenfeng HeJiangwei ChenZijian ZhouGregory Kang Ruey LauHieu DaoLucas AgussurjaRachael Hwee Ling SimXiaoqiang LinWenyang HuZhongxiang DaiPang Wei KohBryan Kian Hsiang LowPublished in: CoRR (2024)
Keyphrases
- language model
- data centric
- language modeling
- artificial intelligence
- document retrieval
- business processes
- speech recognition
- data management
- n gram
- information management
- language modelling
- probabilistic model
- expert systems
- data driven
- language models for information retrieval
- statistical language models
- test collection
- information retrieval
- retrieval model
- data representation
- query expansion
- machine learning
- xml schema
- smoothing methods
- application development
- routing protocol
- distributed systems
- wireless sensor networks
- information integration
- data mining
- data storage
- pseudo relevance feedback
- databases
- database systems
- object oriented databases
- relevance model