​
Login / Signup
Davide Berghi
ORCID
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 15
Top Topics
Multi Modal Fusion
Activity Detection
Audio Visual
Temporal Context
Top Venues
CoRR
VR Workshops
WASPAA
ICASSP
</>
Publications
</>
Davide Berghi
,
Philip J. B. Jackson
Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Davide Berghi
,
Peipei Wu
,
Jinzheng Zhao
,
Wenwu Wang
,
Philip J. B. Jackson
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection.
ICASSP
(2024)
Davide Berghi
,
Philip J. B. Jackson
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction.
CoRR
(2024)
Davide Berghi
,
Philip J. B. Jackson
Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization.
CoRR
(2023)
Davide Berghi
,
Philip J. B. Jackson
Audio Inputs for Active Speaker Detection and Localization Via Microphone Array.
WASPAA
(2023)
Jinzheng Zhao
,
Yong Xu
,
Xinyuan Qian
,
Davide Berghi
,
Peipei Wu
,
Meng Cui
,
Jianyuan Sun
,
Philip J. B. Jackson
,
Wenwu Wang
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions.
CoRR
(2023)
Davide Berghi
,
Peipei Wu
,
Jinzheng Zhao
,
Wenwu Wang
,
Philip J. B. Jackson
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection.
CoRR
(2023)
Davide Berghi
,
Marco Volino
,
Philip J. B. Jackson
Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research.
CoRR
(2022)
Davide Berghi
,
Marco Volino
,
Philip J. B. Jackson
Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research.
CVMP
(2022)
Davide Berghi
,
Adrian Hilton
,
Philip J. B. Jackson
Visually Supervised Speaker Detection and Localization via Microphone Array.
CoRR
(2022)
Hanne Stenzel
,
Davide Berghi
,
Marco Volino
,
Philip J. B. Jackson
Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction.
VR Workshops
(2021)
Davide Berghi
,
Adrian Hilton
,
Philip J. B. Jackson
Visually Supervised Speaker Detection and Localization via Microphone Array.
MMSP
(2021)
Hanne Stenzel
,
Davide Berghi
,
Marco Volino
,
Philip J. B. Jackson
Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction.
CoRR
(2021)
Davide Berghi
,
Hanne Stenzel
,
Marco Volino
,
Adrian Hilton
,
Philip J. B. Jackson
Audio-Visual Spatial Alignment Requirements of Central and Peripheral Object Events.
VR Workshops
(2020)
Davide Berghi
,
Hanne Stenzel
,
Marco Volino
,
Adrian Hilton
,
Philip J. B. Jackson
Audio-Visual Spatial Aligment Requirements of Central and Peripheral Object Events.
CoRR
(2020)