Login / Signup
Kei Sawada
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 26
Top Topics
Markov Model
Japanese Language
Neural Network
Image Recognition
Top Venues
CoRR
INTERSPEECH
APSIPA
ICASSP
</>
Publications
</>
Kentaro Mitsui
,
Koh Mitsuda
,
Toshiaki Wakatsuki
,
Yukiya Hono
,
Kei Sawada
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems.
CoRR
(2024)
Kei Sawada
,
Tianyu Zhao
,
Makoto Shing
,
Kentaro Mitsui
,
Akio Kaga
,
Yukiya Hono
,
Toshiaki Wakatsuki
,
Koh Mitsuda
Release of Pre-Trained Models for the Japanese Language.
LREC/COLING
(2024)
Kei Sawada
,
Tianyu Zhao
,
Makoto Shing
,
Kentaro Mitsui
,
Akio Kaga
,
Yukiya Hono
,
Toshiaki Wakatsuki
,
Koh Mitsuda
Release of Pre-Trained Models for the Japanese Language.
CoRR
(2024)
Yukiya Hono
,
Koh Mitsuda
,
Tianyu Zhao
,
Kentaro Mitsui
,
Toshiaki Wakatsuki
,
Kei Sawada
Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
ACL (Findings)
(2024)
Kentaro Mitsui
,
Yukiya Hono
,
Kei Sawada
Towards human-like spoken dialogue generation between AI agents from written dialogue.
CoRR
(2023)
Congda Ma
,
Tianyu Zhao
,
Makoto Shing
,
Kei Sawada
,
Manabu Okumura
Focused Prefix Tuning for Controllable Text Generation.
CoRR
(2023)
Kentaro Mitsui
,
Yukiya Hono
,
Kei Sawada
UniFLG: Unified Facial Landmark Generator from Text or Speech.
INTERSPEECH
(2023)
AprilPyone MaungMaung
,
Makoto Shing
,
Kentaro Mitsui
,
Kei Sawada
,
Fumio Okura
Text-Guided Scene Sketch-to-Photo Synthesis.
CoRR
(2023)
Kentaro Mitsui
,
Yukiya Hono
,
Kei Sawada
UniFLG: Unified Facial Landmark Generator from Text or Speech.
CoRR
(2023)
Congda Ma
,
Tianyu Zhao
,
Makoto Shing
,
Kei Sawada
,
Manabu Okumura
Focused Prefix Tuning for Controllable Text Generation.
ACL (2)
(2023)
Yukiya Hono
,
Koh Mitsuda
,
Tianyu Zhao
,
Kentaro Mitsui
,
Toshiaki Wakatsuki
,
Kei Sawada
An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
CoRR
(2023)
Kentaro Mitsui
,
Kei Sawada
MSR-NV: Neural Vocoder Using Multiple Sampling Rates.
INTERSPEECH
(2022)
Kentaro Mitsui
,
Tianyu Zhao
,
Kei Sawada
,
Yukiya Hono
,
Yoshihiko Nankaku
,
Keiichi Tokuda
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.
INTERSPEECH
(2022)
Divesh Lala
,
Koji Inoue
,
Tatsuya Kawahara
,
Kei Sawada
Backchannel Generation Model for a Third Party Listener Agent.
HAI
(2022)
Kentaro Mitsui
,
Tianyu Zhao
,
Kei Sawada
,
Yukiya Hono
,
Yoshihiko Nankaku
,
Keiichi Tokuda
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.
CoRR
(2022)
Ruozi Huang
,
Huang Hu
,
Wei Wu
,
Kei Sawada
,
Mi Zhang
,
Daxin Jiang
Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning.
ICLR
(2021)
Kentaro Mitsui
,
Kei Sawada
MSR-NV: Neural Vocoder Using Multiple Sampling Rates.
CoRR
(2021)
Yukiya Hono
,
Kazuna Tsuboi
,
Kei Sawada
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.
INTERSPEECH
(2020)
Yukiya Hono
,
Kazuna Tsuboi
,
Kei Sawada
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.
CoRR
(2020)
Ruozi Huang
,
Huang Hu
,
Wei Wu
,
Kei Sawada
,
Mi Zhang
Dance Revolution: Long Sequence Dance Generation with Music via Curriculum Learning.
CoRR
(2020)
Takayuki Kasugai
,
Yoshinari Tsuzuki
,
Kei Sawada
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Image Recognition Based on Convolutional Neural Networks Using Features Generated from Separable Lattice Hidden Markov Models.
APSIPA
(2018)
Eiji Ichikawa
,
Kei Sawada
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Image Recognition Based on Separable Lattice Hmms Using a Deep Neural Network for Output Probability Distributions.
ICASSP
(2018)
Koki Senda
,
Yukiya Hono
,
Kei Sawada
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Singing Voice Conversion Using Posted Waveform Data on Music Social Media.
APSIPA
(2018)
Yoshinari Tsuzuki
,
Kei Sawada
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Image recognition based on discriminative models using features generated from separable lattice HMMS.
ICASSP
(2017)
Kei Sawada
,
Keiichi Tokuda
,
Simon King
,
Alan W. Black
The blizzard machine learning challenge 2017.
ASRU
(2017)
Kei Sawada
,
Akira Tamamori
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
A Bayesian Approach to Image Recognition Based on Separable Lattice Hidden Markov Models.
IEICE Trans. Inf. Syst.
(12) (2016)
Kei Sawada
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Image recognition based on hidden Markov eigen-image models using variational Bayesian method.
APSIPA
(2013)
Kei Sawada
,
Akira Tamamori
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Face recognition based on separable lattice 2-D HMMS using variational bayesian method.
ICASSP
(2012)