Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora.
Sara PapiAlina KarakantaMatteo NegriMarco TurchiPublished in: CoRR (2022)
Keyphrases
- data sources
- data sets
- data quality
- raw data
- synthetic data
- data collection
- database
- training data
- knowledge discovery
- data distribution
- statistical analysis
- image data
- image analysis
- information retrieval
- data analysis
- data structure
- labor intensive
- databases
- data objects
- original data
- computer vision
- experimental data
- metadata
- computer systems
- data processing
- input data
- association rules
- high dimensional
- end users