Login / Signup
Liangzhe Yuan
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 35
Top Topics
Optical Flow
Human Pose
Motion Compensation
Video Data
Top Venues
CoRR
CVPR
ICLR
ICCV
</>
Publications
</>
Long Zhao
,
Nitesh Bharadwaj Gundavarapu
,
Liangzhe Yuan
,
Hao Zhou
,
Shen Yan
,
Jennifer J. Sun
,
Luke Friedman
,
Rui Qian
,
Tobias Weyand
,
Yue Zhao
,
Rachel Hornung
,
Florian Schroff
,
Ming-Hsuan Yang
,
David A. Ross
,
Huisheng Wang
,
Hartwig Adam
,
Mikhail Sirotenko
,
Ting Liu
,
Boqing Gong
VideoPrism: A Foundational Visual Encoder for Video Understanding.
CoRR
(2024)
Xuan Yang
,
Liangzhe Yuan
,
Kimberly Wilber
,
Astuti Sharma
,
Xiuye Gu
,
Siyuan Qiao
,
Stephanie Debats
,
Huisheng Wang
,
Hartwig Adam
,
Mikhail Sirotenko
,
Liang-Chieh Chen
PolyMaX: General Dense Prediction with Mask Transformer.
WACV
(2024)
Yue Zhao
,
Long Zhao
,
Xingyi Zhou
,
Jialin Wu
,
Chun-Te Chu
,
Hui Miao
,
Florian Schroff
,
Hartwig Adam
,
Ting Liu
,
Boqing Gong
,
Philipp Krähenbühl
,
Liangzhe Yuan
Distilling Vision-Language Models on Millions of Videos.
CoRR
(2024)
Yuanhao Xiong
,
Long Zhao
,
Boqing Gong
,
Ming-Hsuan Yang
,
Florian Schroff
,
Ting Liu
,
Cho-Jui Hsieh
,
Liangzhe Yuan
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding.
ICLR
(2024)
Long Zhao
,
Liangzhe Yuan
,
Boqing Gong
,
Yin Cui
,
Florian Schroff
,
Ming-Hsuan Yang
,
Hartwig Adam
,
Ting Liu
Unified Visual Relationship Detection with Vision and Language Models.
CoRR
(2023)
Qitong Wang
,
Long Zhao
,
Liangzhe Yuan
,
Ting Liu
,
Xi Peng
Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition.
CoRR
(2023)
Long Zhao
,
Liangzhe Yuan
,
Boqing Gong
,
Yin Cui
,
Florian Schroff
,
Ming-Hsuan Yang
,
Hartwig Adam
,
Ting Liu
Unified Visual Relationship Detection with Vision and Language Models.
ICCV
(2023)
Xuan Yang
,
Liangzhe Yuan
,
Kimberly Wilber
,
Astuti Sharma
,
Xiuye Gu
,
Siyuan Qiao
,
Stephanie Debats
,
Huisheng Wang
,
Hartwig Adam
,
Mikhail Sirotenko
,
Liang-Chieh Chen
PolyMaX: General Dense Prediction with Mask Transformer.
CoRR
(2023)
Liangzhe Yuan
,
Nitesh Bharadwaj Gundavarapu
,
Long Zhao
,
Hao Zhou
,
Yin Cui
,
Lu Jiang
,
Xuan Yang
,
Menglin Jia
,
Tobias Weyand
,
Luke Friedman
,
Mikhail Sirotenko
,
Huisheng Wang
,
Florian Schroff
,
Hartwig Adam
,
Ming-Hsuan Yang
,
Ting Liu
,
Boqing Gong
VideoGLUE: Video General Understanding Evaluation of Foundation Models.
CoRR
(2023)
Yuanhao Xiong
,
Long Zhao
,
Boqing Gong
,
Ming-Hsuan Yang
,
Florian Schroff
,
Ting Liu
,
Cho-Jui Hsieh
,
Liangzhe Yuan
Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding.
CoRR
(2023)
Qitong Wang
,
Long Zhao
,
Liangzhe Yuan
,
Ting Liu
,
Xi Peng
Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition.
ICCV
(2023)
Juntang Zhuang
,
Boqing Gong
,
Liangzhe Yuan
,
Yin Cui
,
Hartwig Adam
,
Nicha C. Dvornek
,
Sekhar Tatikonda
,
James S. Duncan
,
Ting Liu
Surrogate Gap Minimization Improves Sharpness-Aware Training.
ICLR
(2022)
Liangzhe Yuan
,
Rui Qian
,
Yin Cui
,
Boqing Gong
,
Florian Schroff
,
Ming-Hsuan Yang
,
Hartwig Adam
,
Ting Liu
Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.
CVPR
(2022)
Juntang Zhuang
,
Boqing Gong
,
Liangzhe Yuan
,
Yin Cui
,
Hartwig Adam
,
Nicha C. Dvornek
,
Sekhar Tatikonda
,
James S. Duncan
,
Ting Liu
Surrogate Gap Minimization Improves Sharpness-Aware Training.
CoRR
(2022)
Ting Liu
,
Jennifer J. Sun
,
Long Zhao
,
Jiaping Zhao
,
Liangzhe Yuan
,
Yuxiao Wang
,
Liang-Chieh Chen
,
Florian Schroff
,
Hartwig Adam
View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose.
Int. J. Comput. Vis.
130 (1) (2022)
Rui Qian
,
Yeqing Li
,
Liangzhe Yuan
,
Boqing Gong
,
Ting Liu
,
Matthew Brown
,
Serge J. Belongie
,
Ming-Hsuan Yang
,
Hartwig Adam
,
Yin Cui
On Temporal Granularity in Self-Supervised Video Representation Learning.
BMVC
(2022)
Rui Qian
,
Yeqing Li
,
Liangzhe Yuan
,
Boqing Gong
,
Ting Liu
,
Matthew Brown
,
Serge J. Belongie
,
Ming-Hsuan Yang
,
Hartwig Adam
,
Yin Cui
Exploring Temporal Granularity in Self-Supervised Video Representation Learning.
CoRR
(2021)
Mark Weber
,
Huiyu Wang
,
Siyuan Qiao
,
Jun Xie
,
Maxwell D. Collins
,
Yukun Zhu
,
Liangzhe Yuan
,
Dahun Kim
,
Qihang Yu
,
Daniel Cremers
,
Laura Leal-Taixé
,
Alan L. Yuille
,
Florian Schroff
,
Hartwig Adam
,
Liang-Chieh Chen
DeepLab2: A TensorFlow Library for Deep Labeling.
CoRR
(2021)
Hassan Akbari
,
Liangzhe Yuan
,
Rui Qian
,
Wei-Hong Chuang
,
Shih-Fu Chang
,
Yin Cui
,
Boqing Gong
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.
CoRR
(2021)
Liangzhe Yuan
,
Rui Qian
,
Yin Cui
,
Boqing Gong
,
Florian Schroff
,
Ming-Hsuan Yang
,
Hartwig Adam
,
Ting Liu
Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.
CoRR
(2021)
Hassan Akbari
,
Liangzhe Yuan
,
Rui Qian
,
Wei-Hong Chuang
,
Shih-Fu Chang
,
Yin Cui
,
Boqing Gong
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.
NeurIPS
(2021)
Dan Kondratyuk
,
Liangzhe Yuan
,
Yandong Li
,
Li Zhang
,
Mingxing Tan
,
Matthew Brown
,
Boqing Gong
MoViNets: Mobile Video Networks for Efficient Video Recognition.
CVPR
(2021)
Dan Kondratyuk
,
Liangzhe Yuan
,
Yandong Li
,
Li Zhang
,
Mingxing Tan
,
Matthew Brown
,
Boqing Gong
MoViNets: Mobile Video Networks for Efficient Video Recognition.
CoRR
(2021)
Long Zhao
,
Yuxiao Wang
,
Jiaping Zhao
,
Liangzhe Yuan
,
Jennifer J. Sun
,
Florian Schroff
,
Hartwig Adam
,
Xi Peng
,
Dimitris N. Metaxas
,
Ting Liu
Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization.
CVPR
(2021)
Long Zhao
,
Yuxiao Wang
,
Jiaping Zhao
,
Liangzhe Yuan
,
Jennifer J. Sun
,
Florian Schroff
,
Hartwig Adam
,
Xi Peng
,
Dimitris N. Metaxas
,
Ting Liu
Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization.
CoRR
(2020)
Ting Liu
,
Jennifer J. Sun
,
Long Zhao
,
Jiaping Zhao
,
Liangzhe Yuan
,
Yuxiao Wang
,
Liang-Chieh Chen
,
Florian Schroff
,
Hartwig Adam
View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose.
CoRR
(2020)
Alex Zihao Zhu
,
Liangzhe Yuan
,
Kenneth Chaney
,
Kostas Daniilidis
Unsupervised Event-Based Learning of Optical Flow, Depth, and Egomotion.
CVPR
(2019)
Liangzhe Yuan
,
Christopher M. Reardon
,
Garrett Warnell
,
Giuseppe Loianno
Human Gaze-Driven Spatial Tasking of an Autonomous MAV.
IEEE Robotics Autom. Lett.
4 (2) (2019)
Alex Zihao Zhu
,
Liangzhe Yuan
,
Kenneth Chaney
,
Kostas Daniilidis
Live Demonstration: Unsupervised Event-Based Learning of Optical Flow, Depth and Egomotion.
CVPR Workshops
(2019)
Liangzhe Yuan
,
Yibo Chen
,
Hantian Liu
,
Tao Kong
,
Jianbo Shi
Zoom-In-To-Check: Boosting Video Interpolation via Instance-Level Discrimination.
CVPR
(2019)
Liangzhe Yuan
,
Yibo Chen
,
Hantian Liu
,
Tao Kong
,
Jianbo Shi
Zoom-In-to-Check: Boosting Video Interpolation via Instance-level Discrimination.
CoRR
(2018)
Alex Zihao Zhu
,
Liangzhe Yuan
,
Kenneth Chaney
,
Kostas Daniilidis
EV-FlowNet: Self-Supervised Optical Flow Estimation for Event-based Cameras.
CoRR
(2018)
Alex Zihao Zhu
,
Liangzhe Yuan
,
Kenneth Chaney
,
Kostas Daniilidis
Unsupervised Event-based Learning of Optical Flow, Depth, and Egomotion.
CoRR
(2018)
Alex Zihao Zhu
,
Liangzhe Yuan
,
Kenneth Chaney
,
Kostas Daniilidis
Unsupervised Event-Based Optical Flow Using Motion Compensation.
ECCV Workshops (6)
(2018)
Alex Zihao Zhu
,
Liangzhe Yuan
,
Kenneth Chaney
,
Kostas Daniilidis
EV-FlowNet: Self-Supervised Optical Flow Estimation for Event-based Cameras.
Robotics: Science and Systems
(2018)