Towards A Better Metric for Text-to-Video Generation.
Jay Zhangjie WuGuian FangHaoning WuXintao WangYixiao GeXiaodong CunDavid Junhao ZhangJia-Wei LiuYuchao GuRui ZhaoWeisi LinWynne HsuYing ShanMike Zheng ShouPublished in: CoRR (2024)
Keyphrases
- text generation
- natural language descriptions
- video data
- news video
- video sequences
- video segments
- multimedia documents
- video content
- real time
- text detection
- video frames
- video streams
- information retrieval
- multimedia
- video search
- database
- quality metrics
- text retrieval
- multimedia search
- video analysis
- space time
- video shots
- metric space
- web documents
- video surveillance
- natural language
- multimedia data
- digital video
- video clips
- event detection
- temporal information
- spatio temporal
- generation process
- information retrieval systems
- evaluation metrics
- keywords
- video scene
- human activities
- spatial and temporal
- text documents
- audio content
- closed captions