Login / Signup
ECCV (35)
2022
2022
2022
Keyphrases
Publications
2022
Chuan Guo
,
Xinxin Zuo
,
Sen Wang
,
Li Cheng
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts.
ECCV (35)
(2022)
Guanxiong Sun
,
Yang Hua
,
Guosheng Hu
,
Neil Robertson
TDViT: Temporal Dilated Video Transformer for Dense Video Tasks.
ECCV (35)
(2022)
Jingcheng Ni
,
Nan Zhou
,
Jie Qin
,
Qian Wu
,
Junqi Liu
,
Boxun Li
,
Di Huang
Motion Sensitive Contrastive Learning for Self-supervised Video Representation.
ECCV (35)
(2022)
Yiheng Li
,
Connelly Barnes
,
Kun Huang
,
Fang-Lue Zhang
Deep 360$^\circ $ Optical Flow Estimation Based on Multi-projection Fusion.
ECCV (35)
(2022)
Ziyi Lin
,
Shijie Geng
,
Renrui Zhang
,
Peng Gao
,
Gerard de Melo
,
Xiaogang Wang
,
Jifeng Dai
,
Yu Qiao
,
Hongsheng Li
Frozen CLIP Models are Efficient Video Learners.
ECCV (35)
(2022)
Wei Suo
,
Mengyang Sun
,
Kai Niu
,
Yiqi Gao
,
Peng Wang
,
Yanning Zhang
,
Qi Wu
A Simple and Robust Correlation Filtering Method for Text-Based Person Search.
ECCV (35)
(2022)
David Junhao Zhang
,
Kunchang Li
,
Yali Wang
,
Yunpeng Chen
,
Shashwat Chandra
,
Yu Qiao
,
Luoqi Liu
,
Mike Zheng Shou
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning.
ECCV (35)
(2022)
Laura Hanu
,
James Thewlis
,
Yuki M. Asano
,
Christian Rupprecht
VTC: Improving Video-Text Retrieval with User Comments.
ECCV (35)
(2022)
Zizhang Li
,
Mengmeng Wang
,
Huaijin Pi
,
Kechun Xu
,
Jianbiao Mei
,
Yong Liu
E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context.
ECCV (35)
(2022)
Guodong Ding
,
Angela Yao
Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation.
ECCV (35)
(2022)
Mengxue Qu
,
Yu Wu
,
Wu Liu
,
Qiqi Gong
,
Xiaodan Liang
,
Olga Russakovsky
,
Yao Zhao
,
Yunchao Wei
SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding.
ECCV (35)
(2022)
Guy Erez
,
Ron Shapira Weber
,
Oren Freifeld
A Deep Moving-Camera Background Model.
ECCV (35)
(2022)
Fanyi Xiao
,
Joseph Tighe
,
Davide Modolo
MaCLR: Motion-Aware Contrastive Learning of Representations for Videos.
ECCV (35)
(2022)
Huan Li
,
Ping Wei
,
Jiapeng Li
,
Zeyu Ma
,
Jiahui Shang
,
Nanning Zheng
Asymmetric Relation Consistency Reasoning for Video Relation Grounding.
ECCV (35)
(2022)
Chen Ju
,
Tengda Han
,
Kunhao Zheng
,
Ya Zhang
,
Weidi Xie
Prompting Visual-Language Models for Efficient Video Understanding.
ECCV (35)
(2022)
Yuxuan Wang
,
Difei Gao
,
Licheng Yu
,
Weixian Lei
,
Matt Feiszli
,
Mike Zheng Shou
GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval.
ECCV (35)
(2022)
Woobin Im
,
Sebin Lee
,
Sung-Eui Yoon
Semi-supervised Learning of Optical Flow by Flow Supervisor.
ECCV (35)
(2022)
Seong Hyeon Park
,
Jihoon Tack
,
Byeongho Heo
,
Jung-Woo Ha
,
Jinwoo Shin
K-centered Patch Sampling for Efficient Video Recognition.
ECCV (35)
(2022)
Junke Wang
,
Xitong Yang
,
Hengduo Li
,
Li Liu
,
Zuxuan Wu
,
Yu-Gang Jiang
Efficient Video Transformers with Spatial-Temporal Token Selection.
ECCV (35)
(2022)
Xiao Han
,
Licheng Yu
,
Xiatian Zhu
,
Li Zhang
,
Yi-Zhe Song
,
Tao Xiang
FashionViL: Fashion-Focused Vision-and-Language Representation Learning.
ECCV (35)
(2022)
Guanxiong Sun
,
Yang Hua
,
Guosheng Hu
,
Neil Robertson
Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency.
ECCV (35)
(2022)
Amirhossein Habibian
,
Haitam Ben Yahia
,
Davide Abati
,
Efstratios Gavves
,
Fatih Porikli
Delta Distillation for Efficient Video Processing.
ECCV (35)
(2022)
Jiacheng Li
,
Ruize Han
,
Haomin Yan
,
Zekun Qian
,
Wei Feng
,
Song Wang
Self-supervised Social Relation Representation for Human Group Detection.
ECCV (35)
(2022)
Aisha Urooj Khan
,
Hilde Kuehne
,
Chuang Gan
,
Niels da Vitoria Lobo
,
Mubarak Shah
Weakly Supervised Grounding for VQA in Vision-Language Transformers.
ECCV (35)
(2022)
Jun Wang
,
Abhir Bhalerao
,
Yulan He
Cross-Modal Prototype Driven Network for Radiology Report Generation.
ECCV (35)
(2022)
Fuchen Long
,
Zhaofan Qiu
,
Yingwei Pan
,
Ting Yao
,
Chong-Wah Ngo
,
Tao Mei
Dynamic Temporal Filtering in Video Models.
ECCV (35)
(2022)
Yang Jiao
,
Shaoxiang Chen
,
Zequn Jie
,
Jingjing Chen
,
Lin Ma
,
Yu-Gang Jiang
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes.
ECCV (35)
(2022)
Md Mohaiminul Islam
,
Gedas Bertasius
Long Movie Clip Classification with State-Space Video Models.
ECCV (35)
(2022)
Lianyu Hu
,
Liqing Gao
,
Zekang Liu
,
Wei Feng
Temporal Lift Pooling for Continuous Sign Language Recognition.
ECCV (35)
(2022)
Jiafei Duan
,
Samson Yu
,
Soujanya Poria
,
Bihan Wen
,
Cheston Tan
PIP: Physical Interaction Prediction via Mental Simulation with Span Selection.
ECCV (35)
(2022)
Chaoyang Zhu
,
Yiyi Zhou
,
Yunhang Shen
,
Gen Luo
,
Xingjia Pan
,
Mingbao Lin
,
Chao Chen
,
Liujuan Cao
,
Xiaoshuai Sun
,
Rongrong Ji
SeqTR: A Simple Yet Universal Network for Visual Grounding.
ECCV (35)
(2022)
Heeseung Yun
,
Sehun Lee
,
Gunhee Kim
Panoramic Vision Transformer for Saliency Detection in 360$^\circ $ Videos.
ECCV (35)
(2022)
Honglu Zhou
,
Asim Kadav
,
Aviv Shamsian
,
Shijie Geng
,
Farley Lai
,
Long Zhao
,
Ting Liu
,
Mubbasir Kapadia
,
Hans Peter Graf
COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality.
ECCV (35)
(2022)
Aditi Basu Bal
,
Ramy Mounir
,
Sathyanarayanan N. Aakur
,
Sudeep Sarkar
,
Anuj Srivastava
Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration.
ECCV (35)
(2022)
Nikita Dvornik
,
Isma Hadji
,
Hai Pham
,
Dhaivat Bhatt
,
Brais Martinez
,
Afsaneh Fazly
,
Allan D. Jepson
Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization.
ECCV (35)
(2022)
Nadine Behrmann
,
S. Alireza Golestaneh
,
Zico Kolter
,
Jürgen Gall
,
Mehdi Noroozi
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation.
ECCV (35)
(2022)
Liliane Momeni
,
Hannah Bull
,
K. R. Prajwal
,
Samuel Albanie
,
Gül Varol
,
Andrew Zisserman
Automatic Dense Annotation of Large-Vocabulary Sign Language Videos.
ECCV (35)
(2022)
Kyle Min
,
Sourya Roy
,
Subarna Tripathi
,
Tanaya Guha
,
Somdeb Majumdar
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection.
ECCV (35)
(2022)
Renrui Zhang
,
Wei Zhang
,
Rongyao Fang
,
Peng Gao
,
Kunchang Li
,
Jifeng Dai
,
Yu Qiao
,
Hongsheng Li
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification.
ECCV (35)
(2022)
James Hong
,
Haotian Zhang
,
Michaël Gharbi
,
Matthew Fisher
,
Kayvon Fatahalian
Spotting Temporally Precise, Fine-Grained Events in Video.
ECCV (35)
(2022)
Yuying Ge
,
Yixiao Ge
,
Xihui Liu
,
Jinpeng Wang
,
Jianping Wu
,
Ying Shan
,
Xiaohu Qie
,
Ping Luo
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval.
ECCV (35)
(2022)
Eitan Kosman
,
Dotan Di Castro
GraphVid: It only Takes a Few Nodes to Understand a Video.
ECCV (35)
(2022)
volume 13695, 2022
Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXXV
ECCV (35)
13695 (2022)