First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment.
Tom Tongjia ChenHongshan YuZhengeng YangMing LiZechuan LiJingwen WangWei MiaoWei SunChen ChenPublished in: CoRR (2023)