Khoa Vo

Publication Activity (10 Years)

Years Active: 2021-2024
Publications (10 Years): 23

Top Topics

Spatio Temporal

Action Detection

Scene Representation

Top Venues

Publications

Khoa Vo, Thinh Phan, Kashu Yamazaki, Minh Tran, Ngan Le
HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model. CoRR (2024)
Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation. ICRA (2024)
Minh Tran, Winston Bounsavy, Khoa Vo, Anh Nguyen, Tri Nguyen, Ngan Le
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation. CoRR (2024)
Thinh Phan, Khoa Vo, Duy Le, Gianfranco Doretto, Donald A. Adjeroh, Ngan Le
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection. WACV (2024)
Thinh Phan, Khoa Vo, Duy Le, Gianfranco Doretto, Donald A. Adjeroh, Ngan Le
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection. CoRR (2023)
Khoa Vo, Trong-Thang Pham, Kashu Yamazaki, Minh Q. Tran, Ngan Le
DNA: Deformable Neural Articulations Network for Template-free Dynamic 3D Human Reconstruction from Monocular RGB-D Video. CVPR Workshops (2023)
Hyekang Kevin Joo, Khoa Vo, Kashu Yamazaki, Ngan Le
CLIP-TSA: Clip-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection. ICIP (2023)
Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation. CoRR (2023)
Kashu Yamazaki, Khoa Vo, Quang Sang Truong, Bhiksha Raj, Ngan Le
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning. AAAI (2023)
Minh Tran, Khoa Vo, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le
AISFormer: Amodal Instance Segmentation with Transformer. CoRR (2022)
Kashu Yamazaki, Khoa Vo, Sang Truong, Bhiksha Raj, Ngan Le
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning. CoRR (2022)
Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le
VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning. CoRR (2022)
Minh Q. Tran, Khoa Vo, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le
AISFormer: Amodal Instance Segmentation with Transformer. BMVC (2022)
Hyekang Kevin Joo, Khoa Vo, Kashu Yamazaki, Ngan Le
CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection. CoRR (2022)
Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation. CoRR (2022)
Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, Ngan Le
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation. CoRR (2022)
Khoa Vo, Kashu Yamazaki, Phong X. Nguyen, Phat Nguyen, Khoa Luu, Ngan Le
Contextual Explainable Video Representation: Human Perception-based Understanding. IEEECONF (2022)
Khoa Vo, Kashu Yamazaki, Phong X. Nguyen, Phat Nguyen, Khoa Luu, Ngan Le
Contextual Explainable Video Representation: Human Perception-based Understanding. CoRR (2022)
Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, Ngan Le
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation. IEEE Access 9 (2021)
Charles R. Qi, Yin Zhou, Mahyar Najibi, Pei Sun, Khoa Vo, Boyang Deng, Dragomir Anguelov
Offboard 3D Object Detection from Point Cloud Sequences. CoRR (2021)
Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, Ngan Le
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation. BMVC (2021)
Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, Ngan Le
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation. CoRR (2021)
Charles R. Qi, Yin Zhou, Mahyar Najibi, Pei Sun, Khoa Vo, Boyang Deng, Dragomir Anguelov
Offboard 3D Object Detection From Point Cloud Sequences. CVPR (2021)