​
Login / Signup
Yuya Fujita
ORCID
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 39
Top Topics
Speech Recognition
Fully Unsupervised
Bayesian Information Criterion
Heart Rate
Top Venues
CoRR
ICASSP
INTERSPEECH
Interspeech
</>
Publications
</>
Takashi Maekaku
,
Jiatong Shi
,
Xuankai Chang
,
Yuya Fujita
,
Shinji Watanabe
Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model.
ICASSP
(2024)
Brian Yan
,
Xuankai Chang
,
Antonios Anastasopoulos
,
Yuya Fujita
,
Shinji Watanabe
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing.
ICASSP
(2024)
Xuankai Chang
,
Brian Yan
,
Kwanghee Choi
,
Jee-Weon Jung
,
Yichen Lu
,
Soumi Maiti
,
Roshan S. Sharma
,
Jiatong Shi
,
Jinchuan Tian
,
Shinji Watanabe
,
Yuya Fujita
,
Takashi Maekaku
,
Pengcheng Guo
,
Yao-Fei Cheng
,
Pavel Denisov
,
Kohei Saijo
,
Hsiu-Hsuan Wang
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
ICASSP
(2024)
Takashi Maekaku
,
Yuya Fujita
,
Xuankai Chang
,
Shinji Watanabe
Fully Unsupervised Topic Clustering of Unlabelled Spoken Audio Using Self-Supervised Representation Learning and Topic Model.
ICASSP
(2023)
Xuankai Chang
,
Brian Yan
,
Yuya Fujita
,
Takashi Maekaku
,
Shinji Watanabe
Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning.
INTERSPEECH
(2023)
Xuankai Chang
,
Brian Yan
,
Kwanghee Choi
,
Jee-Weon Jung
,
Yichen Lu
,
Soumi Maiti
,
Roshan S. Sharma
,
Jiatong Shi
,
Jinchuan Tian
,
Shinji Watanabe
,
Yuya Fujita
,
Takashi Maekaku
,
Pengcheng Guo
,
Yao-Fei Cheng
,
Pavel Denisov
,
Kohei Saijo
,
Hsiu-Hsuan Wang
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
CoRR
(2023)
Yuya Fujita
,
Shinji Watanabe
,
Xuankai Chang
,
Takashi Maekaku
LV-CTC: Non-Autoregressive ASR With CTC and Latent Variable Models.
ASRU
(2023)
Xuankai Chang
,
Brian Yan
,
Yuya Fujita
,
Takashi Maekaku
,
Shinji Watanabe
Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning.
CoRR
(2023)
Takashi Maekaku
,
Jiatong Shi
,
Xuankai Chang
,
Yuya Fujita
,
Shinji Watanabe
HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model.
CoRR
(2023)
Motoi Omachi
,
Brian Yan
,
Siddharth Dalmia
,
Yuya Fujita
,
Shinji Watanabe
Align, Write, Re-Order: Explainable End-to-End Speech Translation via Operation Sequence Generation.
ICASSP
(2023)
Brian Yan
,
Xuankai Chang
,
Antonios Anastasopoulos
,
Yuya Fujita
,
Shinji Watanabe
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing.
CoRR
(2023)
Motoi Omachi
,
Yuya Fujita
,
Shinji Watanabe
,
Tianzi Wang
Non-Autoregressive End-To-End Automatic Speech Recognition Incorporating Downstream Natural Language Processing.
ICASSP
(2022)
Takashi Maekaku
,
Yuya Fujita
,
Yifan Peng
,
Shinji Watanabe
Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR.
INTERSPEECH
(2022)
Motoi Omachi
,
Brian Yan
,
Siddharth Dalmia
,
Yuya Fujita
,
Shinji Watanabe
Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation.
CoRR
(2022)
Xuankai Chang
,
Takashi Maekaku
,
Yuya Fujita
,
Shinji Watanabe
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation.
CoRR
(2022)
Xuankai Chang
,
Takashi Maekaku
,
Yuya Fujita
,
Shinji Watanabe
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation.
INTERSPEECH
(2022)
Takashi Maekaku
,
Xuankai Chang
,
Yuya Fujita
,
Shinji Watanabe
An Exploration of Hubert with Large Number of Cluster Units and Model Assessment Using Bayesian Information Criterion.
ICASSP
(2022)
Yuya Fujita
,
Kazuhiro Izui
,
Shinji Nishiwaki
,
Zhe Zhang
,
Yong Yin
production systems under demand uncertainty.
Comput. Ind. Eng.
163 (2022)
Yosuke Higuchi
,
Nanxin Chen
,
Yuya Fujita
,
Hirofumi Inaguma
,
Tatsuya Komatsu
,
Jaesong Lee
,
Jumon Nozaki
,
Tianzi Wang
,
Shinji Watanabe
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.
ASRU
(2021)
Yosuke Higuchi
,
Nanxin Chen
,
Yuya Fujita
,
Hirofumi Inaguma
,
Tatsuya Komatsu
,
Jaesong Lee
,
Jumon Nozaki
,
Tianzi Wang
,
Shinji Watanabe
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.
CoRR
(2021)
Tianzi Wang
,
Yuya Fujita
,
Xuankai Chang
,
Shinji Watanabe
Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models.
CoRR
(2021)
Yuya Fujita
,
Tianzi Wang
,
Shinji Watanabe
,
Motoi Omachi
Toward Streaming ASR with Non-Autoregressive Insertion-Based Model.
Interspeech
(2021)
Takashi Maekaku
,
Xuankai Chang
,
Yuya Fujita
,
Li-Wei Chen
,
Shinji Watanabe
,
Alexander I. Rudnicky
Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021.
Interspeech
(2021)
Motoi Omachi
,
Yuya Fujita
,
Shinji Watanabe
,
Matthew Wiesner
End-to-end ASR to jointly predict transcriptions and linguistic annotations.
NAACL-HLT
(2021)
Tianzi Wang
,
Yuya Fujita
,
Xuankai Chang
,
Shinji Watanabe
Streaming End-to-End ASR Based on Blockwise Non-Autoregressive Models.
Interspeech
(2021)
Takashi Maekaku
,
Xuankai Chang
,
Yuya Fujita
,
Li-Wei Chen
,
Shinji Watanabe
,
Alexander I. Rudnicky
Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021.
CoRR
(2021)
Yuya Fujita
,
Shinji Watanabe
,
Motoi Omachi
,
Xuankai Chang
Insertion-Based Modeling for End-to-End Automatic Speech Recognition.
INTERSPEECH
(2020)
Yuya Fujita
,
Shinji Watanabe
,
Motoi Omachi
,
Xuankai Chang
Insertion-Based Modeling for End-to-End Automatic Speech Recognition.
CoRR
(2020)
Xuankai Chang
,
Aswin Shanmugam Subramanian
,
Pengcheng Guo
,
Shinji Watanabe
,
Yuya Fujita
,
Motoi Omachi
End-to-End ASR with Adaptive Span Self-Attention.
INTERSPEECH
(2020)
Yuya Fujita
,
Aswin Shanmugam Subramanian
,
Motoi Omachi
,
Shinji Watanabe
Attention-Based ASR with Lightweight and Dynamic Convolutions.
ICASSP
(2020)
Aswin Shanmugam Subramanian
,
Xiaofei Wang
,
Murali Karthick Baskar
,
Shinji Watanabe
,
Toru Taniguchi
,
Dung T. Tran
,
Yuya Fujita
Speech Enhancement Using End-to-End Speech Recognition Objectives.
WASPAA
(2019)
Toru Taniguchi
,
Aswin Shanmugam Subramanian
,
Xiaofei Wang
,
Dung T. Tran
,
Yuya Fujita
,
Shinji Watanabe
Generalized Weighted-Prediction-Error Dereverberation with Varying Source Priors For Reverberant Speech Recognition.
WASPAA
(2019)
Aswin Shanmugam Subramanian
,
Xiaofei Wang
,
Shinji Watanabe
,
Toru Taniguchi
,
Dung T. Tran
,
Yuya Fujita
Dry, Focus, and Transcribe: End-to-End Integration of Dereverberation, Beamforming, and ASR.
CoRR
(2019)
Yuya Fujita
,
Masayuki Hiromoto
,
Takashi Sato
Fast And Robust Heart Rate Estimation From Videos Through Dynamic Region Selection.
EMBC
(2018)
Dung T. Tran
,
Ken-ichi Iso
,
Motoi Omachi
,
Yuya Fujita
Multi Scale Feedback Connection for Noise Robust Acoustic Modeling.
ICASSP
(2018)
Yusuke Kida
,
Dung T. Tran
,
Motoi Omachi
,
Toru Taniguchi
,
Yuya Fujita
Speaker Selective Beamformer with Keyword Mask Estimation.
SLT
(2018)
Yuya Fujita
,
Masayuki Hiromoto
,
Takashi Sato
PARHELIA: Particle Filter-Based Heart Rate Estimation From Photoplethysmographic Signals During Physical Exercise.
IEEE Trans. Biomed. Eng.
65 (1) (2018)
Yusuke Kida
,
Dung T. Tran
,
Motoi Omachi
,
Toru Taniguchi
,
Yuya Fujita
Speaker Selective Beamformer with Keyword Mask Estimation.
CoRR
(2018)
Yuya Fujita
,
Ken-ichi Iso
Robust DNN-Based VAD Augmented with Phone Entropy Based Rejection of Background Speech.
INTERSPEECH
(2016)
Akio Kobayashi
,
Takahiro Oku
,
Yuya Fujita
,
Shoei Sato
Lightly supervised training for risk-based discriminative language models.
INTERSPEECH
(2013)
Takahiro Oku
,
Yuya Fujita
,
Akio Kobayashi
,
Shoei Sato
Progressive language model adaptation for disaster broadcasting with closed-captions.
APSIPA
(2013)
Takahiro Oku
,
Yuya Fujita
,
Akio Kobayashi
,
Toru Imai
Speaker adaptation intensively weighted on mis-recognized speech segments.
APSIPA
(2012)