TextMachina: Seamless Generation of Machine-Generated Text Datasets.
Areg Mikael SarvazyanJosé Ángel GonzálezMarc Franco-SalvadorPublished in: CoRR (2024)
Keyphrases
- generation process
- text generation
- generation method
- database
- human generated
- text retrieval
- text collections
- text data
- context awareness
- batch processing
- string matching
- randomly selected
- information retrieval
- free text
- natural language generation
- manually constructed
- automatically generating
- uci machine learning repository
- text information
- benchmark datasets
- text processing
- automatically generated
- image classification
- web documents
- decision trees
- semantic information
- metadata
- artificial intelligence
- key concepts