SynthBio: A Case Study in Faster Curation of Text Datasets.
Ann YuanDaphne IppolitoVitaly NikolaevChris Callison-BurchAndy CoenenSebastian GehrmannPublished in: NeurIPS Datasets and Benchmarks (2021)
Keyphrases
- text collections
- text mining
- database
- text retrieval
- keywords
- information retrieval
- case study
- text data
- free text
- natural language generation
- test bed
- benchmark datasets
- textual data
- automatically extracted
- string matching
- memory efficient
- object detection
- highly efficient
- key concepts
- knowledge discovery
- text processing
- artificial intelligence
- sentence level