​
Login / Signup
Shogo Seki
ORCID
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 36
Top Topics
Source Separation
Lightweight
Optimization Methods
Speech Enhancement
Top Venues
CoRR
ICASSP
INTERSPEECH
EUSIPCO
</>
Publications
</>
Li Li
,
Shogo Seki
Improved Remixing Process for Domain Adaptation-Based Speech Enhancement by Mitigating Data Imbalance in Signal-to-Noise Ratio.
CoRR
(2024)
Hirokazu Kameoka
,
Takuhiro Kaneko
,
Kou Tanaka
,
Nobukatsu Hojo
,
Shogo Seki
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Shoma Ayano
,
Li Li
,
Shogo Seki
,
Daichi Kitamura
Audio Spotforming Using Nonnegative Tensor Factorization with Attractor-Based Regularization.
CoRR
(2024)
Li Li
,
Shogo Seki
Remixed2remixed: Domain Adaptation for Speech Enhancement by Noise2noise Learning with Remixing.
ICASSP
(2024)
Takuhiro Kaneko
,
Hirokazu Kameoka
,
Kou Tanaka
,
Shogo Seki
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN.
CoRR
(2023)
Li Li
,
Shogo Seki
Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise learning with Remixing.
CoRR
(2023)
Takuhiro Kaneko
,
Hirokazu Kameoka
,
Kou Tanaka
,
Shogo Seki
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis.
CoRR
(2023)
Shogo Seki
,
Kanami Imamura
,
Hirokazu Kameoka
,
Takuhiro Kaneko
,
Kou Tanaka
,
Noboru Harada
W2N-AVSC: Audiovisual Extension For Whisper-To-Normal Speech Conversion.
EUSIPCO
(2023)
Shogo Seki
,
Hirokazu Kameoka
,
Kou Tanaka
,
Takuhiro Kaneko
JSV-VC: Jointly Trained Speaker Verification and Voice Conversion Models.
ICASSP
(2023)
Takuhiro Kaneko
,
Hirokazu Kameoka
,
Kou Tanaka
,
Shogo Seki
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN.
INTERSPEECH
(2023)
Takuhiro Kaneko
,
Hirokazu Kameoka
,
Kou Tanaka
,
Shogo Seki
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis.
ICASSP
(2023)
Shogo Seki
,
Hirokazu Kameoka
,
Takuhiro Kaneko
,
Kou Tanaka
Non-Parallel Whisper-to-Normal Speaking Style Conversion Using Auxiliary Classifier Variational Autoencoder.
IEEE Access
11 (2023)
Kou Tanaka
,
Takuhiro Kaneko
,
Hirokazu Kameoka
,
Shogo Seki
CFVC: Conditional Filtering for Controllable Voice Conversion.
INTERSPEECH
(2023)
Li Li
,
Hirokazu Kameoka
,
Shogo Seki
HBP: An Efficient Block Permutation Solver Using Hungarian Algorithm and Spectrogram Inpainting for Multichannel Audio Source Separation.
ICASSP
(2022)
Kou Tanaka
,
Hirokazu Kameoka
,
Takuhiro Kaneko
,
Shogo Seki
Distilling Sequence-to-Sequence Voice Conversion Models for Streaming Conversion Applications.
SLT
(2022)
Takuhiro Kaneko
,
Kou Tanaka
,
Hirokazu Kameoka
,
Shogo Seki
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform.
CoRR
(2022)
Hirokazu Kameoka
,
Takuhiro Kaneko
,
Shogo Seki
,
Kou Tanaka
CAUSE: Crossmodal Action Unit Sequence Estimation from Speech.
INTERSPEECH
(2022)
Takuhiro Kaneko
,
Kou Tanaka
,
Hirokazu Kameoka
,
Shogo Seki
ISTFTNET: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform.
ICASSP
(2022)
Takuhiro Kaneko
,
Hirokazu Kameoka
,
Kou Tanaka
,
Shogo Seki
MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks.
INTERSPEECH
(2022)
Shogo Seki
,
Hirokazu Kameoka
,
Li Li
Investigation And Comparison of Optimization Methods for Variational Autoencoder-Based Underdetermined Multichannel Source Separation.
ICASSP
(2022)
Hirokazu Kameoka
,
Shogo Seki
,
Li Li
,
Chihiro Watanabe
Attentionpit: Soft Permutation Invariant Training for Audio Source Separation with Attention Mechanism.
ICASSP
(2022)
Shogo Seki
,
Haruka Taga
,
Tomoki Toda
Singing Fundamental Frequency Contour Generation Using Generalized Command-Response Model and Score-Conditional Variational Autoencoder.
MLSP
(2021)
Hirokazu Kameoka
,
Takuhiro Kaneko
,
Kou Tanaka
,
Nobukatsu Hojo
,
Shogo Seki
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics.
CoRR
(2020)
Shogo Seki
,
Moe Takada
,
Tomoki Toda
Semi-Supervised Self-Produced Speech Enhancement and Suppression Based on Joint Source Modeling of Air- and Body-Conducted Signals Using Variational Autoencoder.
INTERSPEECH
(2020)
Moe Takada
,
Shogo Seki
,
Patrick Lumban Tobing
,
Tomoki Toda
Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals.
EUSIPCO
(2020)
Shu Hikosaka
,
Shogo Seki
,
Tomoki Hayashi
,
Kazuhiro Kobayashi
,
Kazuya Takeda
,
Hideki Banno
,
Tomoki Toda
Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment.
INTERSPEECH
(2020)
Shogo Seki
,
Hirokazu Kameoka
,
Li Li
,
Tomoki Toda
,
Kazuya Takeda
Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation.
EUSIPCO
(2019)
Shogo Seki
,
Hirokazu Kameoka
,
Li Li
,
Tomoki Toda
,
Kazuya Takeda
Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder.
IEEE Access
7 (2019)
Shota Inoue
,
Hirokazu Kameoka
,
Li Li
,
Shogo Seki
,
Shoji Makino
Joint Separation and Dereverberation of Reverberant Mixtures with Multichannel Variational Autoencoder.
ICASSP
(2019)
Shogo Seki
,
Tomoki Toda
,
Kazuya Takeda
Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci.
(7) (2018)
Moe Takada
,
Shogo Seki
,
Tomoki Toda
Self-Produced Speech Enhancement and Suppression Method using Air- and Body-Conductive Microphones.
APSIPA
(2018)
Shogo Seki
,
Hirokazu Kameoka
,
Li Li
,
Tomoki Toda
,
Kazuya Takeda
Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation.
CoRR
(2018)
Shogo Seki
,
Takeo Igarashi
Sketch-based 3D hair posing by contour drawings.
Symposium on Computer Animation
(2017)
Shogo Seki
,
Hirokazu Kameoka
,
Tomoki Toda
,
Kazuya Takeda
Missing component restoration for masked speech signals based on time-domain spectrogram factorization.
MLSP
(2017)
Shogo Seki
,
Tomoki Toda
,
Kazuya Takeda
Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization.
EUSIPCO
(2017)
Atsunori Ogawa
,
Shogo Seki
,
Keisuke Kinoshita
,
Marc Delcroix
,
Takuya Yoshioka
,
Tomohiro Nakatani
,
Kazuya Takeda
Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.
INTERSPEECH
(2016)