​
Login / Signup
Khoa Vo
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 23
Top Topics
Spatio Temporal
Shape Prior
Action Detection
Scene Representation
Top Venues
CoRR
BMVC
ICRA
ICIP
</>
Publications
</>
Khoa Vo
,
Thinh Phan
,
Kashu Yamazaki
,
Minh Tran
,
Ngan Le
HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model.
CoRR
(2024)
Kashu Yamazaki
,
Taisei Hanyu
,
Khoa Vo
,
Thang Pham
,
Minh Tran
,
Gianfranco Doretto
,
Anh Nguyen
,
Ngan Le
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation.
ICRA
(2024)
Minh Tran
,
Winston Bounsavy
,
Khoa Vo
,
Anh Nguyen
,
Tri Nguyen
,
Ngan Le
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation.
CoRR
(2024)
Thinh Phan
,
Khoa Vo
,
Duy Le
,
Gianfranco Doretto
,
Donald A. Adjeroh
,
Ngan Le
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection.
WACV
(2024)
Thinh Phan
,
Khoa Vo
,
Duy Le
,
Gianfranco Doretto
,
Donald A. Adjeroh
,
Ngan Le
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection.
CoRR
(2023)
Khoa Vo
,
Trong-Thang Pham
,
Kashu Yamazaki
,
Minh Q. Tran
,
Ngan Le
DNA: Deformable Neural Articulations Network for Template-free Dynamic 3D Human Reconstruction from Monocular RGB-D Video.
CVPR Workshops
(2023)
Hyekang Kevin Joo
,
Khoa Vo
,
Kashu Yamazaki
,
Ngan Le
CLIP-TSA: Clip-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection.
ICIP
(2023)
Kashu Yamazaki
,
Taisei Hanyu
,
Khoa Vo
,
Thang Pham
,
Minh Tran
,
Gianfranco Doretto
,
Anh Nguyen
,
Ngan Le
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation.
CoRR
(2023)
Kashu Yamazaki
,
Khoa Vo
,
Quang Sang Truong
,
Bhiksha Raj
,
Ngan Le
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
AAAI
(2023)
Minh Tran
,
Khoa Vo
,
Kashu Yamazaki
,
Arthur Fernandes
,
Michael Kidd
,
Ngan Le
AISFormer: Amodal Instance Segmentation with Transformer.
CoRR
(2022)
Kashu Yamazaki
,
Khoa Vo
,
Sang Truong
,
Bhiksha Raj
,
Ngan Le
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
CoRR
(2022)
Kashu Yamazaki
,
Sang Truong
,
Khoa Vo
,
Michael Kidd
,
Chase Rainwater
,
Khoa Luu
,
Ngan Le
VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning.
CoRR
(2022)
Minh Q. Tran
,
Khoa Vo
,
Kashu Yamazaki
,
Arthur Fernandes
,
Michael Kidd
,
Ngan Le
AISFormer: Amodal Instance Segmentation with Transformer.
BMVC
(2022)
Hyekang Kevin Joo
,
Khoa Vo
,
Kashu Yamazaki
,
Ngan Le
CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection.
CoRR
(2022)
Khoa Vo
,
Sang Truong
,
Kashu Yamazaki
,
Bhiksha Raj
,
Minh-Triet Tran
,
Ngan Le
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation.
CoRR
(2022)
Khoa Vo
,
Kashu Yamazaki
,
Sang Truong
,
Minh-Triet Tran
,
Akihiro Sugimoto
,
Ngan Le
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation.
CoRR
(2022)
Khoa Vo
,
Kashu Yamazaki
,
Phong X. Nguyen
,
Phat Nguyen
,
Khoa Luu
,
Ngan Le
Contextual Explainable Video Representation: Human Perception-based Understanding.
IEEECONF
(2022)
Khoa Vo
,
Kashu Yamazaki
,
Phong X. Nguyen
,
Phat Nguyen
,
Khoa Luu
,
Ngan Le
Contextual Explainable Video Representation: Human Perception-based Understanding.
CoRR
(2022)
Khoa Vo
,
Kashu Yamazaki
,
Sang Truong
,
Minh-Triet Tran
,
Akihiro Sugimoto
,
Ngan Le
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation.
IEEE Access
9 (2021)
Charles R. Qi
,
Yin Zhou
,
Mahyar Najibi
,
Pei Sun
,
Khoa Vo
,
Boyang Deng
,
Dragomir Anguelov
Offboard 3D Object Detection from Point Cloud Sequences.
CoRR
(2021)
Khoa Vo
,
Hyekang Joo
,
Kashu Yamazaki
,
Sang Truong
,
Kris Kitani
,
Minh-Triet Tran
,
Ngan Le
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation.
BMVC
(2021)
Khoa Vo
,
Hyekang Joo
,
Kashu Yamazaki
,
Sang Truong
,
Kris Kitani
,
Minh-Triet Tran
,
Ngan Le
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation.
CoRR
(2021)
Charles R. Qi
,
Yin Zhou
,
Mahyar Najibi
,
Pei Sun
,
Khoa Vo
,
Boyang Deng
,
Dragomir Anguelov
Offboard 3D Object Detection From Point Cloud Sequences.
CVPR
(2021)