​
Login / Signup
Zeyu Xie
ORCID
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 21
Top Topics
Text Data
Global Information
Medical Image Fusion
Status Quo
Top Venues
CoRR
ICASSP
IET Image Process.
Sensors
</>
Publications
</>
Zeyu Xie
,
Baihan Li
,
Xuenan Xu
,
Mengyue Wu
,
Kai Yu
Enhancing Audio Generation Diversity with Visual Information.
ICASSP
(2024)
Zeyu Xie
,
Xuenan Xu
,
Zhizheng Wu
,
Mengyue Wu
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation.
CoRR
(2024)
Zeyu Xie
,
Baihan Li
,
Xuenan Xu
,
Zheng Liang
,
Kai Yu
,
Mengyue Wu
FakeSound: Deepfake General Audio Detection.
CoRR
(2024)
Zeyu Xie
,
Xuenan Xu
,
Zhizheng Wu
,
Mengyue Wu
AudioTime: A Temporally-aligned Audio-text Benchmark Dataset.
CoRR
(2024)
Xingyuan Li
,
Sinong Wang
,
Zeyu Xie
,
Mengyue Wu
,
Kenny Q. Zhu
Phonetic and Lexical Discovery of a Canine Language using HuBERT.
CoRR
(2024)
Xuenan Xu
,
Zeyu Xie
,
Mengyue Wu
,
Kai Yu
Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Xuenan Xu
,
Xiaohang Xu
,
Zeyu Xie
,
Pingyue Zhang
,
Mengyue Wu
,
Kai Yu
A Detailed Audio-Text Data Simulation Pipeline Using Single-Event Sounds.
ICASSP
(2024)
Zeyu Xie
,
Baihan Li
,
Xuenan Xu
,
Mengyue Wu
,
Kai Yu
Enhancing Audio Generation Diversity with Visual Information.
CoRR
(2024)
Baihan Li
,
Zeyu Xie
,
Xuenan Xu
,
Yiwei Guo
,
Ming Yan
,
Ji Zhang
,
Kai Yu
,
Mengyue Wu
DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation.
CoRR
(2024)
Xuenan Xu
,
Xiaohang Xu
,
Zeyu Xie
,
Pingyue Zhang
,
Mengyue Wu
,
Kai Yu
A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds.
CoRR
(2024)
Xuenan Xu
,
Zhiling Zhang
,
Zelin Zhou
,
Pingyue Zhang
,
Zeyu Xie
,
Mengyue Wu
,
Kenny Q. Zhu
BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data.
ACM Multimedia
(2023)
Hanxue Zhang
,
Zeyu Xie
,
Xuenan Xu
,
Mengyue Wu
,
Kai Yu
Improving Audio Caption Fluency with Automatic Error Correction.
CoRR
(2023)
Zeyu Xie
,
Xuenan Xu
,
Mengyue Wu
,
Kai Yu
Enhance Temporal Relations in Audio Captioning with Sound Event Detection.
INTERSPEECH
(2023)
Xuenan Xu
,
Zhiling Zhang
,
Zelin Zhou
,
Pingyue Zhang
,
Zeyu Xie
,
Mengyue Wu
,
Kenny Q. Zhu
BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data.
CoRR
(2023)
Zeyu Xie
,
Xuenan Xu
,
Mengyue Wu
,
Kai Yu
Enhance Temporal Relations in Audio Captioning with Sound Event Detection.
CoRR
(2023)
Zelin Zhou
,
Zhiling Zhang
,
Xuenan Xu
,
Zeyu Xie
,
Mengyue Wu
,
Kenny Q. Zhu
Can Audio Captions Be Evaluated With Image Caption Metrics?
ICASSP
(2022)
Zelin Zhou
,
Zhiling Zhang
,
Xuenan Xu
,
Zeyu Xie
,
Mengyue Wu
,
Kenny Q. Zhu
Can Audio Captions Be Evaluated with Image Caption Metrics?
CoRR
(2021)
Haipeng Chen
,
Zeyu Xie
,
Yongping Huang
,
Di Gai
Intuitionistic Fuzzy C-Means Algorithm Based on Membership Information Transfer-Ring and Similarity Measurement.
Sensors
21 (3) (2021)
Xuenan Xu
,
Heinrich Dinkel
,
Mengyue Wu
,
Zeyu Xie
,
Kai Yu
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning.
ICASSP
(2021)
Xuenan Xu
,
Heinrich Dinkel
,
Mengyue Wu
,
Zeyu Xie
,
Kai Yu
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning.
CoRR
(2021)
Di Gai
,
Xuanjing Shen
,
Haipeng Chen
,
Zeyu Xie
,
Pengxiang Su
Medical image fusion using the PCNN based on IQPSO in NSST domain.
IET Image Process.
14 (9) (2020)