​
Login / Signup
Shota Orihashi
Publication Activity (10 Years)
Years Active: 2015-2023
Publications (10 Years): 38
Top Topics
Speech Recognition
Cross Modal
Subjective Quality
Supervised Learning
Top Venues
CoRR
Interspeech
INTERSPEECH
ICIP
</>
Publications
</>
Shota Orihashi
,
Yoshihiro Yamazaki
,
Mihiro Uchida
,
Akihiko Takashima
,
Ryo Masumura
Distilling Knowledge of Bidirectional Language Model for Scene Text Recognition.
ICIP
(2023)
Ryo Masumura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning.
EUSIPCO
(2023)
Mihiro Uchida
,
Shota Orihashi
,
Akihiko Takashima
,
Yoshihiro Yamazaki
,
Ryo Masumura
Open-Set Recognition for Facial-Expression Recognition.
ICIP
(2023)
Shota Orihashi
,
Yoshihiro Yamazaki
,
Mihiro Uchida
,
Akihiko Takashima
,
Ryo Masumura
Fully Shareable Scene Text Recognition Modeling for Horizontal and Vertical Writing.
ICIP
(2022)
Yoshihiro Yamazaki
,
Shota Orihashi
,
Ryo Masumura
,
Mihiro Uchida
,
Akihiko Takashima
Audio Visual Scene-Aware Dialog Generation with Transformer-based Video Representations.
CoRR
(2022)
Akihiko Takashima
,
Ryo Masumura
,
Atsushi Ando
,
Yoshihiro Yamazaki
,
Mihiro Uchida
,
Shota Orihashi
Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition.
INTERSPEECH
(2022)
Ryo Masumura
,
Yoshihiro Yamazaki
,
Saki Mizuno
,
Naoki Makishima
,
Mana Ihori
,
Mihiro Uchida
,
Hiroshi Sato
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Satoshi Suzuki
,
Shota Orihashi
,
Takafumi Moriya
,
Nobukatsu Hojo
,
Atsushi Ando
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training.
INTERSPEECH
(2022)
Naoki Makishima
,
Mana Ihori
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
,
Ryo Masumura
Enrollment-Less Training for Personalized Voice Activity Detection.
Interspeech
(2021)
Shota Orihashi
,
Yoshihiro Yamazaki
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Ryo Masumura
Hierarchical Knowledge Distillation for Dialogue Sequence Labeling.
ASRU
(2021)
Mana Ihori
,
Naoki Makishima
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
,
Ryo Masumura
MAPGN: MAsked Pointer-Generator Network for sequence-to-sequence pre-training.
CoRR
(2021)
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
,
Ryo Masumura
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss.
ICASSP
(2021)
Tomohiro Tanaka
,
Ryo Masumura
,
Mana Ihori
,
Akihiko Takashima
,
Takafumi Moriya
,
Takanori Ashihara
,
Shota Orihashi
,
Naoki Makishima
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.
Interspeech
(2021)
Ryo Masumura
,
Daiki Okamura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation.
CoRR
(2021)
Mana Ihori
,
Naoki Makishima
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
,
Ryo Masumura
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks Using Switching Tokens.
Interspeech
(2021)
Shota Orihashi
,
Yoshihiro Yamazaki
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Ryo Masumura
Hierarchical Knowledge Distillation for Dialogue Sequence Labeling.
CoRR
(2021)
Ryo Masumura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Large-Context Conversational Representation Learning: Self-Supervised Learning For Conversational Documents.
SLT
(2021)
Ryo Masumura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Large-Context Conversational Representation Learning: Self-Supervised Learning for Conversational Documents.
CoRR
(2021)
Shota Orihashi
,
Yoshihiro Yamazaki
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Ryo Masumura
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages.
MMAsia
(2021)
Shota Orihashi
,
Yoshihiro Yamazaki
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Ryo Masumura
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages.
CoRR
(2021)
Mana Ihori
,
Naoki Makishima
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
,
Ryo Masumura
MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training.
ICASSP
(2021)
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
,
Ryo Masumura
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss.
CoRR
(2021)
Ryo Masumura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation.
ICASSP
(2021)
Tomohiro Tanaka
,
Ryo Masumura
,
Mana Ihori
,
Akihiko Takashima
,
Takafumi Moriya
,
Takanori Ashihara
,
Shota Orihashi
,
Naoki Makishima
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.
CoRR
(2021)
Mana Ihori
,
Naoki Makishima
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
,
Ryo Masumura
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens.
CoRR
(2021)
Shinobu Kudo
,
Shota Orihashi
,
Ryuichi Tanida
,
Seishi Takamura
,
Hideaki Kimata
GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity.
IEICE Trans. Inf. Syst.
(3) (2021)
Ryo Masumura
,
Daiki Okamura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation.
Interspeech
(2021)
Tomohiro Tanaka
,
Ryo Masumura
,
Mana Ihori
,
Akihiko Takashima
,
Shota Orihashi
,
Naoki Makishima
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning.
Interspeech
(2021)
Ryo Masumura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Hierarchical Transformer-based Large-Context End-to-end ASR with Large-Context Knowledge Distillation.
CoRR
(2021)
Tomohiro Tanaka
,
Ryo Masumura
,
Mana Ihori
,
Akihiko Takashima
,
Shota Orihashi
,
Naoki Makishima
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning.
CoRR
(2021)
Naoki Makishima
,
Mana Ihori
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
,
Ryo Masumura
Enrollment-less training for personalized voice activity detection.
CoRR
(2021)
Akihiko Takashima
,
Naoki Makishima
,
Mana Ihori
,
Tomohiro Tanaka
,
Shota Orihashi
,
Ryo Masumura
Unsupervised Domain Adversarial Training in Angular Space for Facial Expression Recognition.
APSIPA
(2020)
Mana Ihori
,
Ryo Masumura
,
Naoki Makishima
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model.
CoRR
(2020)
Shota Orihashi
,
Shinobu Kudo
,
Ryuichi Tanida
,
Hideaki Kimata
Subjective Quality Driven Image Encoding Method Using Image Completion.
APSIPA
(2020)
Shota Orihashi
,
Mana Ihori
,
Tomohiro Tanaka
,
Ryo Masumura
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training.
INTERSPEECH
(2020)
Mana Ihori
,
Ryo Masumura
,
Naoki Makishima
,
Tomohiro Tanaka
,
Akihiko Takashima
,
Shota Orihashi
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model.
INLG
(2020)
Ryo Masumura
,
Naoki Makishima
,
Mana Ihori
,
Akihiko Takashima
,
Tomohiro Tanaka
,
Shota Orihashi
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition.
INTERSPEECH
(2020)
Shinobu Kudo
,
Shota Orihashi
,
Ryuichi Tanida
,
Atsushi Shimizu
GAN-based Image Compression Using Mutual Information Maximizing Regularization.
PCS
(2019)
Shota Orihashi
,
Rintaro Harada
,
Yasutaka Matsuo
,
Jiro Katto
Improvement of H.265/HEVC encoding for 8K UHDTV by detecting motion complexity.
ICCE
(2016)
Ryoki Takada
,
Shota Orihashi
,
Yasutaka Matsuo
,
Jiro Katto
Improvement of 8K UHDTV picture quality for H.265/HEVC by global zoom estimation.
ICCE
(2015)
Shota Orihashi
,
Rintaro Harada
,
Yasutaka Matsuo
,
Jiro Katto
An Adaptive H.265/HEVC Encoding Control for 8K UHDTV Movies Based on Motion Complexity Estimation.
ISM
(2015)