​
Login / Signup
Layne Berry
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 11
Top Topics
Control Signals
Pre Trained
Visual Representation
Image Retrieval
Top Venues
CoRR
ICASSP
EMNLP
MICRO
</>
Publications
</>
Yuan Tseng
,
Layne Berry
,
Yiting Chen
,
I-Hsiang Chiu
,
Hsuan-Hao Lin
,
Max Liu
,
Puyuan Peng
,
Yi-Jen Shih
,
Hung-Yu Wang
,
Haibin Wu
,
Poyao Huang
,
Chun-Mao Lai
,
Shang-Wen Li
,
David Harwath
,
Yu Tsao
,
Abdelrahman Mohamed
,
Chi-Luen Feng
,
Hung-Yi Lee
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models.
ICASSP
(2024)
Hsuan-Fu Wang
,
Yi-Jen Shih
,
Heng-Jui Chang
,
Layne Berry
,
Puyuan Peng
,
Hung-yi Lee
,
Hsin-Min Wang
,
David Harwath
SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data.
CoRR
(2024)
Hung-Chieh Fang
,
Nai-Xuan Ye
,
Yi-Jen Shih
,
Puyuan Peng
,
Hsuan-Fu Wang
,
Layne Berry
,
Hung-yi Lee
,
David Harwath
Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model.
CoRR
(2024)
Layne Berry
,
Yi-Jen Shih
,
Hsuan-Fu Wang
,
Heng-Jui Chang
,
Hung-Yi Lee
,
David Harwath
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval.
ICASSP
(2023)
Yuan Tseng
,
Layne Berry
,
Yi-Ting Chen
,
I-Hsiang Chiu
,
Hsuan-Hao Lin
,
Max Liu
,
Puyuan Peng
,
Yi-Jen Shih
,
Hung-Yu Wang
,
Haibin Wu
,
Po-Yao Huang
,
Chun-Mao Lai
,
Shang-Wen Li
,
David Harwath
,
Yu Tsao
,
Shinji Watanabe
,
Abdelrahman Mohamed
,
Chi-Luen Feng
,
Hung-yi Lee
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models.
CoRR
(2023)
Anuj Diwan
,
Layne Berry
,
Eunsol Choi
,
David Harwath
,
Kyle Mahowald
Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality.
EMNLP
(2022)
Yi-Jen Shih
,
Hsuan-Fu Wang
,
Heng-Jui Chang
,
Layne Berry
,
Hung-yi Lee
,
David Harwath
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model.
SLT
(2022)
Logan Moody
,
Wei Qi
,
Abdolrasoul Sharifi
,
Layne Berry
,
Joey Rudek
,
Jayesh Gaur
,
Jeff Parkhurst
,
Sreenivas Subramoney
,
Kevin Skadron
,
Ashish Venkat
Speculative Code Compaction: Eliminating Dead Code via Speculative Microcode Transformations.
MICRO
(2022)
Yi-Jen Shih
,
Hsuan-Fu Wang
,
Heng-Jui Chang
,
Layne Berry
,
Hung-yi Lee
,
David Harwath
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model.
CoRR
(2022)
Layne Berry
,
Yi-Jen Shih
,
Hsuan-Fu Wang
,
Heng-Jui Chang
,
Hung-yi Lee
,
David Harwath
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval.
CoRR
(2022)
Anuj Diwan
,
Layne Berry
,
Eunsol Choi
,
David Harwath
,
Kyle Mahowald
Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality.
CoRR
(2022)