Conference Papers
2025

- 2025
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions
S. Wang, W. Yu, X. Chen, X. Tian, J. Zhang, L. Lu, Y. Tsao, J. Yamagishi, Y. Wang, C. Zhang
ACL, 2025

- 2025
A Study on Speech Assessment with Visual Cues
S. Ahmed, R. E. Zezario, N. Saleem, A. Hussain, H.-M. Wang, and Y. Tsao
Interspeech, 2025

- 2025
Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR
X. Lu, P. Shen, Y. Tsao, H. Kawai
Interspeech, 2025

- 2025
VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations
S.-F. Huang et. al.
Interspeech, 2025

- 2025
ZSDEVC: Zero-Shot Diffusion-based Emotional Voice Conversion with Disentangled Mechanism
H.-H. Chou, Y.-S. Lin, C.-C. Sung, Y. Tsao, and C.-C. Lee
Interspeech, 2025

- 2025
Universal Speech Enhancement with Regression and Generative Mamba
R. Chao, R. Nasretdinov, Y.-C. F. Wang, A. Jukić, S.-W. Fu, and Y. Tsao
Interspeech, 2025

- 2025
A Comparative Study on Proactive and Passive Detection of Deepfake Speech
C.-H. Wu, W. Ge, X. Wang, J. Yamagishi, Y. Tsao, and H.-M. Wang
Interspeech, 2025

- 2025
Feature Importance across Domains for Improving Non-Intrusive Speech Intelligibility Prediction in Hearing Aids
R. E. Zezario, S. M. Siniscalchi, F. Chen, H.-M. Wang, and Y. Tsao
Interspeech, 2025

- 2025
Speech Enhancement with MAP-based Training for Robust ASR
Y.-J. Li, R. Chao, B. Su, and Y. Tsao
IEEE ICASSP 2025, 2025

- 2025
MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network
Y.-T. Liu, K.-C. Wang, R. Chao, S. M. Siniscalchi, P.-C. Yeh, and Y. Tsao
IEEE ICASSP 2025, 2025

- 2025
Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
W. Ren, H. Wu, Y.-C. Lin, X. Chen, R. Chao, K.-H. Hung, Y.-J. Li, W.-Y. Ting, H.-M. Wang, and Y. Tsao
IEEE ICASSP 2025, 2025

- 2025
Neural Variational Mode Decomposition and Its Application for ECG Denoising
D.-Y. Lu, J.-J. Ding, and Y. Tsao
IEEE ICASSP 2025, 2025

- 2025
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
R. E. Zezario, S. M. Siniscalchi, H.-M. Wang, and Y. Tsao
IEEE ICASSP 2025, 2025

- 2025
MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution
J. Lin, I Chiu, K.-C. Wang, K.-C. Liu, H.-M. Wang, P.-C. Yeh, and Y. Tsao
IEEE ICASSP 2025, 2025
2024

- 2024
DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
J. Du, I-M. Lin, I-H. Chiu, X. Chen, H. Wu, W. Ren, Y. Tsao, H.-y. Lee, and J.-S. R. Jang
IEEE SLT 2024, 2024

- 2024
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
C.-H. H. Yang et al.
IEEE SLT 2024, 2024

- 2024
FLANEC: Exploring Flan-T5 for Post-ASR Error Correction
M. L. Quatra, V. M. Salerno, Y. Tsao, S. M. Siniscalchi
IEEE SLT 2024, 2024

- 2024
Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR
X. Lu, P. Shen, Y. Tsao, and H. Kawai
IEEE SLT 2024, 2024

- 2024
An Investigation of Incorporating Mamba for Speech Enhancement
R. Chao, W.-H. Cheng, M. L. Quatra, S. M. Siniscalchi, C.-H. H. Yang, S.-W. Fu, and Y. Tsao
IEEE SLT 2024, 2024

- 2024
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
W.-C. Huang, S.-W. Fu, E. Cooper, R. E. Zezario, T. Toda, H.-M. Wang, J. Yamagishi, and Y. Tsao
IEEE SLT 2024, 2024

- 2024
RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier
P.-Y. Huang, S.-W. Fu, and Y. Tsao
NeurIPS 2024, 2024

- 2024
Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
S.-F. Huang, H.-C. Kuo, Z. Chen, X. Yang, C.-H. H. Yang, Y. Tsao, Y.-C. F. Wang, H.-y. Lee, and S.-W. Fu
IEEE SLT 2024, 2024

- 2024
MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal
K.-H. Hung, K.-C. Wang, K.-C. Liu, W.-L. Chen, X. Lu, Y. Tsao, and C.-W. Lin
IEEE BigData 2024, 2024

- 2024
SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models
C. Yin, T.-S. Chi, Y. Tsao, and H.-M. Wang
Interspeech, 2024

- 2024
Learnable Layer Selection and Model Fusion for Speech Self-Supervised Learning Models
S.-C. Chiu, C.-H. Wu, J.-K. Hsieh, Y. Tsao, and H.-M. Wang
Interspeech, 2024

- 2024
Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata
R. E. Zezario, F. Chen, C.-S.Fuh, H.-M. Wang, and Y. Tsao
Interspeech, 2024

- 2024
Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition
K.-C. Wang, Y.-J. Li, W.-L. Chen, Y.-W. Chen, Y.-C. Wang, P.-C. Yeh, C. Zhang, and Y. Tsao
IEEE EUSIPCO 2024, 2024

- 2024
A Study on Incorporating Whisper for Robust Speech Assessment
R. E. Zezario, Y.-W. Chen, S.-W. Fu, Y. Tsao, H.-M. Wang, C.-S. Fuh
IEEE ICME 2024 , 2024 (Top Performance on the Track 3 - VoiceMOS Challenge 2023)

- 2024
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
S.-W. Fu, K.-H. Hung, Y. Tsao, and Y.-C. F. Wang
ICLR, 2024

- 2024
SDEMG: Score-based Diffusion Model For Surface Electromyographic Signal Denoising
Y.-T. Liu, K.-C. Wang, K.-C. Liu, S.-Y. Peng, and Y. Tsao
IEEE ICASSP, 2024

- 2024
Hierarchical Cross-modality Knowledge Transfer With Sinkhorn Attention For Ctc-based ASR
X. Lu, P. Shen, Y. Tsao, and H. Kawai
IEEE ICASSP, 2024

- 2024
Scalable Ensemble-based Detection Method Against Adversarial Attacks For Speaker Verification
H. Wu, H.-C. Kuo, Y. Tsao, H.-y. Lee
IEEE ICASSP, 2024

- 2024
A Multi-task Evaluation Benchmark For Audio-visual Representation Models
Y. Tseng, L. Berry, and Y.-T. Chen et al.,
IEEE ICASSP, 2024

- 2024
Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model
R. E. Zezario, B.-R. B. Bai, C.-S. Fuh, H.-M. Wang, and Y. Tsao
IEEE ICASSP, 2024
2023

- 2023
Cross-modal alignment with optimal transport for CTC-based ASR
X. Lu, P. Shen, Y. Tsao, and H. Kawa
IEEE ASRU, 2023

- 2023
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models
C.-C. Lee, H.-W. Chen, C.-S. Chen, H.-M. Wang, T.-T. Liu, and Y. Tsao
IEEE ASRU, 2023

- 2023
Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility
H.-T. Chiang, K.-H. Hung, S.-W. Fu, H.-C. Kuo, M.-H. Tsai, and Y. Tsao
IEEE ASRU, 2023

- 2023
The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
E. Cooper, W.-C. Huang, Y.Tsao, H.-M. Wang, T. Toda, and J. Yamagishi
IEEE ASRU, 2023

- 2023
Inference and Denoise: Causal Inference-based Neural Speech Enhancement
T.-A. Hsieh, C.-H. Huck Y., P.-Y. Chen, S. M. Siniscalchi, Y. Tsao
IEEE MLSP, 2023

- 2023
IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays
W.-Y. Ting, S.-S. Wang, Y. Tsao, and B. Su
IEEE MLSP, 2023

- 2023
Voice Direction-of-Arrival Conversion
I-C. Chern, S. Chern, H.-C. Kuo, H.-H. Tseng, K.-H. Hung, and Y. Tsao
IEEE MLSP, 2023

- 2023
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition
H. Yen, P.-J. Ku, C.-H. H. Yang, H. Hu, S. M. Siniscalchi, P.-Y. Chen, and Y. Tsao
Interspeech, 2023

- 2023
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Y.-L. Chien, H.-H. Chen, M.-C. Yen, S.-W. Tsai, H.-M. Wang, Y. Tsao, T.-S. Chi
Interspeech, 2023

- 2023
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
L.-W. Chen, Y.-F. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang
Interspeech, 2023

- 2023
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
H.-H. Chen, Y.-L. Chien, M.-C. Yen, S.-W. Tsai, T.-S. Chi, Y. Tsao, and H.-M. Wang
Interspeech, 2023

- 2023
Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation
E.-P. Chu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, and C.-T. Chan
IEEE EMBC, 2023

- 2023
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks
C.-P. Liu, J.-H. Li, E.-P. Chu, C.-Y. Hsieh, K.-C. Liu, C.-T. Chan, and Y. Tsao
IEEE MeMeA, 2023

- 2023
Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning
C.-H. Chen, K.-C. Liu, T.-Y. Lu, C.-Y. Chang, C.-T. Chan, and Y. Tsao
IEEE NER, 2023

- 2023
Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids
J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain
IEEE ICASSP 2023 (AMHAT 2023 Workshop), 2023

- 2023
D4AM: A General Denoising Framework for Downstream Acoustic Models
C.-C. Lee, Y. Tsao, H.-M. Wang and C.-S. Chen
ICLR, 2023

- 2023
Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation
T.-H. Chi, K.-C. Liu, C.-Y. Hsieh, Y. Tsao, and C.-T. Chan
IEEE ICASSP , 2023

- 2023
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks
K.-C. Wang, K.-C. Liu, S.-Y. Peng, Y. Tsao
IEEE ICASSP, 2023

- 2023
On the Robustness of Non-intrusive Speech Quality Model by Adversarial Examples
H.-Y. Lin, H.-H. Tseng, and Y. Tsao
IEEE ICASSP, 2023

- 2023
Audio-visual Speech Enhancement And Separation By Utilizing Multi-modal Self-supervised Embeddings
I-C. Chern, K.-H. Hung, Y.-T. Chen, T. Hussain, M. Gogate, A. Hussain, Y. Tsao, and J.-C. Hou
IEEE ICSSP 2023 (AMHAT 2023 Workshop), 2023

- 2023
Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids
J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain
IEEE ICSSP 2023 (AMHAT 2023 Workshop), 2023

- 2023
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
C.-J. Hsu, H.-L. Chung, H.-y. Lee, amd Y. Tsao
IEEE ICSSP, 2023

- 2023
Interpretations of Domain Adaptations via Layer Variational Analysis
H.-H. Tseng, H.-Y. Lin, H.-K. Hsuan and Y. Tsao
ICLR, 2023
2022

- 2022
Dysarthric Speech Enhancement Based on Convolution Neural Network
S.-S. Wang, Y. Tsao, W.-Z. Zheng, H.-W. Yeh, P.-C. Li, S.-H. Fang, Y.-H. Lai
IEEE EMBC, 2022

- 2022
A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning
T. Hussain, M. Diyan, M. Gogate, K. Dashtipour, A. Adeel, Y. Tsao, A. Hussain
IEEE EMBC, 2022

- 2022
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
C.-C. Lee, C.-H. Hu, Y.-C. Lin, C.-S. Chen, H.-M. Wang and Y. Tsao
Interspeech, 2022

- 2022
Boosting Self-Supervised Embeddings for Speech Enhancement
K.-H. Hung, S.-W. Fu, H.-H. Tseng, H.-T. Chiang, Y. Tsao, C.-W. Lin
Interspeech, 2022

- 2022
Perceptual Characteristics Based Multi-objective Model for Speech Enhancement
C.-J. Peng, Y.-J. Chan, Y.-L.Shen, C. Yu, Y. Tsao and T.-S. Chi
Interspeech, 2022

- 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
Y.-J. Lu et al
Interspeech, 2022

- 2022
🏆
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility
Prediction Model for Hearing Aids
R. E. Zezario, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao
Interspeech, 2022

- 2022
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao
Interspeech, 2022

- 2022
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Authors
F.-L. Wang, H.-S. Lee, Y. Tsao and H.-M. Wang
Interspeech, 2022

- 2022
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
R. Chao, C. Yu, S.-W. Fu, X. Lu and Y. Tsao
Interspeech, 2022

- 2022
The VoiceMOS Challenge 2022
W.-C. Huang, E. C., Y. Tsao, H.-M. Wang, T. Toda and J. Yamagishi
Interspeech, 2022

- 2022
InQSS: a speech intelligibility and quality assessment model using a multi-task learning network
Y.-W. Chen and Y. Tsao
Interspeech, 2022

- 2022
OSSEM: one-shot speaker adaptive speech enhancement using meta learning
C. Yu, S.-W. Fu, T.-An Hsieh, Y. Tsao and M. Ravanelli
Interspeech, 2022

- 2022
When Bert Meets Quantum Temporal Convolution Learning for Text Classification In Heterogeneous Computing
C.-H. H. Yang, J. Qi, S. Y.-C. Chen, Y. Tsao, and P.-Y. Chen
ICASSP, 2022

- 2022
XDBERT: Distilling Visual Information to BERT via Cross-Modal Encoders to Improve Language Understanding
C.-J. Hsu, H.-Y. Lee, Y. Tsao
ACL , 2022

- 2022
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
H. Wu, H.-C. Kuo, N. Zheng, K.-H. Hung, H.-Y. Lee, Y. Tsao, H.-M. Wang, and H. Meng
ICASSP, 2022

- 2022
Speech Recovery For Real-world Self-powered Intermittent Devices
Y.-C. Lin,T.-A. Hsieh, K.-H. Hung, C. Yu, H. Garudadri, Y. Tsao, and T.-W. Kuo
ICASSP, 2022

- 2022
EMGSE: Acoustic/emg Fusion For Multimodal Speech Enhancement
K.-C. Wang, K.-C. Liu, H.-M. Wang, and Y. Tsao
ICASSP, 2022

- 2022
Conditional Diffusion Probabilistic Model For Speech Enhancement
Y.-J. Lu, Z.-Q. Wang, S. Watanabe, A. Richard, C. Yu, and Y. Tsao
ICASSP, 2022

- 2022
MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation based Only On Noisy/ Reverberated Speech
S.-W. Fu, C. Yu, K.-H. Hung, M. Ravanelli, and Y. Tsao
ICASSP, 2022

- 2022
Analyzing The Robustness Of Unsupervised Speech Recognition
G.-T. Lin, C.-J. Hsu, D.-R. Liu, H.-Y. Lee, and Y. Tsao
ICASSP, 2022
2021

- 2021
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport
H.-Y. Lin, H.-H. Tseng, X. Lu, and Y. Tsao
NeurIPS, 2021

- 2021
HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network
H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, and Y. Tsao
ASRU, 2021

- 2021
Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Model-ing
M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tsai, Y.
Tsao,
T. Toda, J.-S. Jang, and H.-M. Wang
ASRU, 2021

- 2021
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
X. Chang, T. Maekaku, P. Guo, J. Shi,Y.-J. Lu, A. S. Subramanian, T.
Wang,
S.-w. Yang, Y. Tsao, H.-y. Lee, and S. Watanabe
ASRU, 2021

- 2021
Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder
T.-Y. Lu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, C.-T. Chan
IEEE BHI, 2021

- 2021
Investigation of A Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-To-End Bengali Automatic Speech Recogni-tion Under Unseen Noisy Conditions
M. E Noor, Y.-J. Lu, S.-Si. Wang, S. Ghose, C.-Y. Chang, R. E. Zezario,
S. Ahmed,
W.-H. Chung, Y. Tsao and H.-M. Wang
Oriental COCOSDA, 2021

- 2021
MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder
Y.-J. Li, S.-S. Wang, Y. Tsao, and B. Su
APSIPA ASC, 2021

- 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Y.-J. Lu, Y. Tsao, and S. Watanabe
APSIPA ASC, 2021

- 2021
Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion
Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, and H.-M. Wang
APSIPA ASC, 2021

- 2021
Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues
Z. Feng, Y. Tsao, and F. Chen
APSIPA ASC, 2021

- 2021
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
X. Lu, P. Shen, Y. Tsao, and H. Kawai
APSIPA ASC, 2021

- 2021
Unsupervised neural adaptation model based on optimal transport for spoken language identification
X. Lu, P. Shen, Y. Tsao, and H. Kawai
ICASSP, 2021

- 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving SpeakerIdentity in Dysarthric Voice Conversion
W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, and T. Toda
Interspeech, 2021

- 2021
A Study of Incorporating Articulatory Movement Information in Speech Enhancement
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, X. Lu, and Y. Tsao
EUSIPCO, 2021

- 2021
Speech Enhancement with Zero-Shot Model Selection
R. E Zezario, C.-S. Fuh, H.-M. Wang, Y. Tsao
EUSIPCO, 2021

- 2021
One shot learning for speech separation
Y.-K. Wu, K.-P. Huang, Y. Tsao, and H.-Y. Lee
ICASSP, 2021

- 2021
QISTA-Net-Audio: Audio Super-resolution via Non-Convex Lq-normMinimization
G.-X. Lin, S.-W. Hu, Y.-J. Lu, Y. Tsao, and C.-S. Lu
Interspeech, 2021

- 2021
MetricGAN +: An Improved Version of MetricGAN for Speech Enhancement
S.-W. Fu, C. Yu, T.-A. Hsieh, P. Plantinga, M. Ravanelli, X. Lu, and Y. Tsao
Interspeech, 2021

- 2021
Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement
T.-A. Hsieh, C. Yu, S.-W. Fu, X. Lu, and Y. Tsao
Interspeech, 2021

- 2021
Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Y,-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang, and T. Toda
Interspeech, 2021

- 2021
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, W.-C. Huang, X. Lu, and Y. Tsao
ISCAS, 2021

- 2021
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario
C.-J. Peng, Y.-J. Chan, C. Yu, S.-S. Wang, Y. Tsao, and T.-S. Chi
ISCAS, 2021

- 2021
MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration
Y.-T. Chang, Y.-H. Yang, Y.-H. Peng, S.-S. Wang, T.-S. Chi, Y. Tsao and H.-M. Wang
ISCSLP, 2021
2020

- 2020
Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-based Voice Conversion System
C.-Y. Chen, W.-Z. Zheng, S.-S. Wang, Y. Tsao, P.-C. Li, and Y.-H. Lai
Interspeech, 2020

- 2020
Lite Audio-Visual Speech Enhancement
S.-Y. Chuang, Y. Tsao, C.-C. Lo, and H.-M. Wang
Interspeech, 2020

- 2020
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model
R. E. Zezario, S.-W. Fu, C.-S. Fuh, Y. Tsao, and H.-M. Wang
APSIPA, 2020

- 2020
Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
S.-W. Fu et al.,
APSIPA, 2020

- 2020
SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning
C.-C. Lee, Y.-C. Lin, H.-T. Lin, H.-M. Wang, and Y. Tsao
Interspeech, 2020

- 2020
iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning
H. Li, S.-W. Fu, Y. Tsao, and J. Yamagishi
Interspeech, 2020

- 2020
Incorporating Broad Phonetic Information for Speech Enhancement
Y.-J. Lu, C.-F. Liao, X. Lu, J.-w. Hung, and Y. Tsao
Interspeech, 2020

- 2020
Self-supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement
R. E. Zezario, T. Hussain, X. Lu, H.-M. Wang, and Y. Tsao
ICASSP, 2020
2019

- 2019
Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement
W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang
APSIPA, 2019

- 2019
Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement
T. Hussaink, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao
ICASSP, 2019

- 2019
Subjective Feedback-based Neural Network Pruning for Speech Enhancement
W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang
APSIPA, 2019

- 2019
Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
K.-Y. Liu, S.-S. Wang, Y. Tsao, and J.-w. Hung
ROCLING, 2019

- 2019
Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion
W.-C. Huang, Y.-C. Wu, K. Kobayashi, Y.-H. Peng, H.-T. Hwang, P. L. Tobing, Y. Tsao,
H.-M. Wang, and T. Toda
ISCA SSW 10, 2019

- 2019
Speaker-aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement
F.-K. Chuang, S.-S. Wang, J.-w. Hung, Y. Tsao, and S.-H. Fang
Interspeech, 2019

- 2019
IA-NET: Acceleration and Compression of Speech Enhancement using Integer-adder Deep Neural Network
Y.-C. Lin, Y.-T. Hsu, S.-W. Fu, Y. Tsao, and T.-W. Kuo
Interspeech, 2019

- 2019
MOSNet: Deep Learning-based Objective Assessment for Voice Conversion
C.-C. Lo, S.-w. Fu, W. C. Huang, X. Wang, J. Yamagishi, Y. Tsao, and H.-M. Wang
Interspeech, 2019

- 2019
Class-wise Centroid Distance Metric Learning for Acoustic Event Detection
X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai
Interspeech, 2019

- 2019
Incorporating Symbolic Sequential Modeling for Speech Enhancement
C.-F. Liao, Y. Tsao, X. Lu, and H. Kawai
Interspeech, 2019
with ISCA Travel Grant

- 2019
Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric
R. E. Zezario, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao
Interspeech, 2019

- 2019
Noise Adaptive Speech Enhancement using Domain Adversarial Training
C.-F. Liao, Y. Tsao, H.-y. Lee, and H.-M. Wang
Interspeech, 2019
with ISCA Travel Grant

- 2019
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech
L.-W. Chen, H.-Y. Lee, and Y. Tsao
Interspeech, 2019

- 2019
Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR
P.-T. Huang, H.-S. Lee, S.-S. Wang, K.-Y. Chen, Y. Tsao, and H.-M. Wang
Interspeech, 2019
with ISCA Travel Grant

- 2019
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P. L. Tobing, T. Hayashiy, K. Kobayashi, T. Toda,
Y. Tsao, and H.-M. Wang
European Signal Processing Conference (EUSIPCO), 2019

- 2019
Audio-Visual Speech Enhancement Using Hierarchical Extreme Learning Machine
T. Hussain, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao
European Signal Processing Conference (EUSIPCO), 2019

- 2019
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion
W.-C. Huang, Y.-C. Wu, C.-C. Lo, P. L. Tobing, T. Hayashi, K. Kobayashi, T. Toda,
Y. Tsao, and H.-M. Wang
Interspeech, 2019

- 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
S.-W. Fu, C.-F. Liao, Y. Tsao, and S.-D. Lin
ICML, 2019

- 2019
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition
Y.-L. Shen, C.-Y. Huang, S.-S. Wang, Y. Tsao, H.-M. Wang, and T.-S. Chi
ICASSP, 2019

- 2019
Reducing noise and reverberation in speech signals via the integration of denoising autoencoder and temporal lowpass filtering
K.-Y. Liu, S.-k. Lee, S.-S. Wang, Y. Tsao, and J.-W. Hung
IEEE International Conference on Applied System Innovation (ICASI), 2019

- 2019
Bone-conducted Speech Enhancement using Hierarchical Extreme Learning Machine
T. Hussain, Y. Tsao, S. M. Sinicalchi, J.-C. Wang, H.-M. Wang, and W.-H. Liao
International Workshop on Spoken Dialogue Systems Technology (IWSDS), 2019
2018

- 2018
An Abnormal Detection Strategy of Rotating Electric Machine based on Frequency Distribution
S.-C. Lin, Y. Tsao, S.-F. Su, Yennun Huang, and Z.-Q. Zhong
The 39th Symposium on Electrical Power Engineering, 2018

- 2018
Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement
R. E. Zezario, J.-W. Huang, X. Lu, Y. Tsao, H.-T. Hwang, and H.-M. Wang
APSIPA, 2018

- 2018
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y.-T. Hsu, Y.-C. Lin, S.-W. Fu, Y. Tsao, and T.-W. Kuo
IEEE Spoken Language Technology (SLT), 2018

- 2018
Robustness against the channel effect in pathological voice detection
Y.-T. Hsu, Z. Zhu, C.-T. Wang, S.-H. Fang, F. Rudzicz, and Y. Tsao
NeurIPS 2018, Machine Learning for Health (ML4H) Workshop

- 2018
An Industrial IoT Analysis System Based on Machining Data of Metal Materials
S.-C. Lin, Y. Tsao, S.-F. Su, and Yennun Huang
International Conference on Fuzzy Theory and Its Applications, 2018

- 2018
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
W.-C. Huang, H.-T. Hwang, Y.-H. Peng, Y. Tsao, and H.-M. Wang
ISCSLP, 2018
Best Student Paper Award

- 2018
Speech Enhancement based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform
S.-k. Lee, S.-S. Wang, Y. Tsao, and J.-w. Hung
ISCSLP, 2018

- 2018
FIS-based Domestic Milling Machine PHM System Considering Multi-speed Frequency Variation
S.-C. Lin, C.-H. Su, Y. Tsao, S.-F. Su, H.-Y. Mark Liao, and Yennun Huang
IEEE International Conference on Advanced Manufacturing, 2018
Best Paper Award (獲推薦轉投SCI期刊, 擴充研究�?改中)

- 2018
A Supervised Learning Algorithm Considering Light Conditions for Visual Inspection of Metal Objects
H.-C. Li, S.-C. Lin, Y. Tsao, S.-F. Su, P.-L. Sun, and Yennun Huang
The 54th Annual Conference of Chinese Society for Quality 2018 International
Symposium of Quality Management
Makalot Industry-Academic Collaboration Award (獲推薦轉投EI期刊, 擴充研究�?改中)

- 2018
Exemplar-Based Spectral Detail Compensation for Voice Conversion
Y.-H. Peng, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang
Interspeech ,2018

- 2018
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
S.-W. Fu, Y. Tsao, H.-T. Hwang, and H.-M. Wang
Interspeech ,2018

- 2018
Temporal Attentive Pooling for Acoustic Event Detection
X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai
Interspeech, 2018

- 2018
Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation
Y.-Y. Kao, H.-P. Hsu, C.-F. Liao, Y. Tsao, H.-C. Yang, J.-L. Li, C.-C. Lee,
H.-S. Lee, and H.-M. Wang
IEEE IWAENC ,2018

- 2018
Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform
S.-K. Lee, S.-S. Wang, Y. Tsao, and J.-W. Hung
IEEE SiPS ,2018

- 2018
Improving the Performance of Hearing Aids in Noisy Environments based on Deep Learning Technology
Y.-H. Lai, W.-Z. Zheng, S.-T. Tang, S.-H. Fang, W.-H. Liao, and Y. Tsao
IEEE Engineering in Medicine and Biology Society (EMBC), 2018

- 2018
A Novel LSTM-based Speech Preprocessor For Speaker Diarization in Realistic Mismatch Conditions
L. Sun, J. Du, T. Gao, Y.-D. Lu, Y. Tsao, C.-H. Lee, and N. Ryant
ICASSP, 2018


- 2018
Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm
W.-J. Lee, S.-S. Wang, F. Chen, X. Lu, S.-Y. Chien, and Y. Tsao
ICASSP, 2018
2017

- 2017
Computing Biodiversity Change via a Soundscape Monitoring Network
T.-H. Lin, Y.-H. Wang, S.-S. Lu, H.-W. Yen, and Y. Tsao
PNC, 2017

- 2017
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks
S.-W. Fu, Y. Tsao, X. Lu, and H. Kawai
APSIPA, 2017

- 2017
A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise
S.-S. Wang, Y. Tsao, H.-L. S. Wang, Y.-H. Lai, and L. P.-H. Li
APSIPA, 2017

- 2017
Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion
Y.-H. Peng, C.-C. Hsu, Y.-C. Wu, H.-T. Hwang, Y.-W. Liu, Y. Tsao, and H.-M. Wang
APSIPA, 2017
Poster Presentation Award

- 2017
Complex Spectrogram Enhancement by Convolutional Neural Network with Multi-metrics Learning
S.-W. Fu, T.-y. Hu, Y. Tsao, and X. Lu
IEEE International Workshop on Machine Learning for Signal Processing (MLSP), 2017

- 2017
Deblending of Simultaneous-source Seismic Data via Periodicity-coded Nonnegative Matrix Factorization
T.-H. Lin and Y. Tsao
IEEE Dataport, 2017

- 2017
A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y. Tsao, and H.-M. Wang
Interspeech, 2017

- 2017
Wavelet Speech Enhancement Based on Robust Principal Component Analysis
C.-L. Wu, H.-P. Hsu, S.-S. Wang, J.-W. Hung, Y.-H. Lai, H.-M. Wang, and Y.Tsao
Interspeech, 2017

- 2017
Discriminative Autoencoders for Acoustic Modeling
M.-H. Yang, H.-S. Lee, Y.-D. Lu, K.-Y. Chen, Y. Tsao, B. Chen, and H.-M. Wang
Interspeech, 2017

- 2017
Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang
Interspeech, 2017

- 2017
Object-based on-line video summarization for internet of video things
S.-T. Lin, Y.-H. Liao, Y. Tsao, and S.-Y. Chien
IEEE International Symposium on Circuits & Systems (ISCAS), 2017

- 2017
Discriminative Autoencoders for Speaker Verification
H.-S. Lee, Y.-D. Lu, C.-C. Hsu, Y. Tsao, H.-M. Wang, and S.-K. Jeng
ICASSP, 2017

- 2017
A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y.-H. Lai, Y. Tsao, and H.-M. Wang
ICASSP, 2017
2016

- 2016
Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao and H.-M. Wang
APSIPA, 2016

- 2016
Audio-Visual Speech Enhancement using Deep Neural Networks
J.-C. Hou, S.-S. Wang, Y.-H. Lai, J.-C. Lin, Y. Tsao, H.-W. Chang, and H.-M. Wang
APSIPA, 2016

- 2016
Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang
ISCSLP, 2016

- 2016
A Linear Regression Model with Dynamic Pulse Transit Time Features for Noninvasive Blood Pressure Prediction
Y.-Y. Hsieh, C.-D. Wu, Y. Tsao, and S.-S. Lu
Biomedical Circuits and Systems Conference (BioCAS), 2016

- 2016
Incorporating Local Environment Information with Ensemble Neural Networks to Robust Automatic Speech Recognition
C.-Y. Hsu, R. E. Zezario, J.-C. Wang, X. Lu, and Y. Tsao
ISCSLP, 2016

- 2016
Improving the Performance of Speech Perception in Noisy Environment based on a FAME Strategy
Y.-H. Lai, S.-S. Wang, Y.-T. Su, H.-C. Cheng, F. K. Fu, and Y. Tsao
ISCSLP, 2016

- 2016
Pair-wise Distance Metric Learning of Neural Network Model for Spoken Language Identification
X. Lu, P. Shen, Y. Tsao, and H. Kawai
Interspeech, 2016

- 2016
DCASE Report for Task 3: Sound Event Detection in Real Life Audio
Y.-H. Lai, C.-H. Wang, S.-Y. Hou, B.-Y. Chen, Y. Tsao, and Y.-W. Liu
Detection and Classification of Acoustic Scenes and Events (DCASE) workshop, 2016

- 2016
Locally Linear Embedding for Exemplar-Based Spectral Conversion
Y.-C. Wu, H.-T. Hwang, C.-C. Hsu, Y. Tsao, and H.-M. Wang
Interspeech, 2016

- 2016
Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation
H.-S. Lee, Y. Tsao, C.-C. Lee, H.-M. Wang, W.-C. Lin, W.-C. Chen, S.-W. Hsiao, S.-K. Jeng
Interspeech, 2016

- 2016
SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement
S.-W. Fu, Y. Tsao, and X. Lu
Interspeech, 2016

- 2016
Track-clustering Error Evaluation for Track-based Multi-camera Tracking System Employing Human Re-identification
C.-W. Wu, M.-T. Zhong, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.-Y. Chien
Computer Vision and Pattern Recognition (CVPR), 2016

- 2016
Nonnegative Matrix Factorization-based Frequency Lowering Technology for Mandarin-speaking Hearing Aid Users
Y.-T. Liu, Y. Tsao, and R.-Y. Chang
ICASSP, 2016

- 2016
Speech Enhancement via Ensemble Modeling NMF Adaptation
Jeremy Chiaming Yang, S.-S. Wang, Y. Tsao, and J.-W. Hung
IEEE International Conference on Consumer Electronics (ICCE), 2016

- 2016
Leveraging Nonnegative Matrix Factorization in Processing the Temporal Modulation Spectrum for Speech Enhancement
S.-S. Wang, Jeremy Chiaming Yang, Y. Tsao, and J.-W. Hung
IEEE International Conference on Consumer Electronics (ICCE), 2016

- 2016
Temporal Modulation Spectral Restoration for Robust Speech Recognition
S.-S. Wang and Y. Tsao
IEEE International Conference on Multimedia Big Data, 2016

- 2016
Improving the Performance of Noise Reduction in Hearing Aids Based on the Genetic Algorithm
Y.-H. Lai, C.-H. Chen, S.-T. Tang, Z.-M. Yeh, and Y. Tsao
IFMBE Proceedings 57, 2016
2015

- 2015
A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
APSIPA, 2015

- 2015
A New Frequency Lowering Technique for Mandarin-Speaking Hearing Aid Users
Y.-T. Liu, R. Y. Chang, Y. Tsao, and Y.-P. Chang
IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2015

- 2015
Improving Denoising Auto-encoder Based Speech Enhancement With the Speech Parameter Generation Algorithm
S.-S. Wang, H.-T. Hwang, Y.-H. Lai, Y. Tsao, X. Lu, H.-M. Wang, and B. Su
APSIPA, 2015

- 2015
Temporal Alignment for Deep Neural Networks
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao
IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2015

- 2015
Speech Recognition with Temporal Neural Networks
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao
Interspeech, 2015

- 2015
Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection
X. Lu, P. Shen, Y. Tsao, C. Hori, and H. Kawai
Interspeech, 2015

- 2015
Temporal Information in Tone Recognition
P. Lin, S.-S. Wang, and Y. Tsao
IEEE International Conference on Consumer Electronics (ICCE), 2015

- 2015
Multimodal Arousal Rating using Unsupervised Fusion Technique
C. Ma, Y. Tsao, and C.-H. Lee
ICASSP, 2015

- 2015
A Discriminative Post-filter for Speech Enhancement in Hearing Aids
Y.-H. Lai, S.-S. Wang, P.-C. Li, and Y. Tsao
ICASSP, 2015
2014

- 2014
Robust Anchorperson Detection Based on Audio Streams using a Hybrid I-vector and DNN System
Y.-F. Chang, P. Lin, S.-H. Cheng, K.-H. Chan, Y.-C. Zeng, C.-W. Liao, W.-T. Chang,
Y.-C. Wang, and Y. Tsao
APSIPA, 2014

- 2014
Effect of Adaptive Envelope Compression in Simulated Electric Hearing in Reverberation
Y.-H. Lai, F. Chen, and Y. Tsao
ISIC, 2014

- 2014
A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering
H. Jing, A.-C. Liang, S.-D. Lin, and Y. Tsao
ICDM, 2014

- 2014
Clustering-Based I-Vector Formulation for Speaker Recognition
H.-S. Lee, Y. Tsao, H.-M. Wang, and S.-K. Jen
Interspeech, 2014

- 2014
Spectral Patch Based Sparse Coding for Acoustic Event Detection
X. Lu, Y. Tsao, P. Shen, and C. Hori
ISCSLP, 2014

- 2014
Ensemble Modeling of Denoising Autoencoder for Speech Spectrum Restoration
X. Lu, Y. Tsao, S. Matsuda, and C. Hori
Interspeech, 2014

- 2014
Ensemble of Machine Learning Algorithms for Cognitive and Physical Speaker Load Detection
C. Ma, Y. Tsao, and C.-H. Lee
Interspeech, 2014

- 2014
Automatic Speech Recognition with Primarily Temporal Envelope Information
P. Lin, F. Chen, S.-S. Wang, Y. Tsao and Y.-H. Lai
Interspeech, 2014

- 2014
An Adaptive Envelope Compression Strategy for Speech Processing in Cochlear Implants
Y.- H. Lai, F. Chen, and Y. Tsao
Interspeech, 2014

- 2014
Acoustic Feature Conversion using a Polynomial based Feature Transferring Algorithm
S.-S. Wang, P. Lin, D.-C. Lyu, Y. Tsao, H.-T. Hwang, B. Su, and H.-M. Wang
ISCSLP, 2014

- 2014
Speaker Verification Using Kernel-Based Binary Classifiers with Binary Operation Derived Features
H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng
ICASSP, 2014

- 2014
Sparse Representation Based on a Bag of Spectral Exemplars for Acoustic Event Detection
X. Lu, Y. Tsao, S. Matsuda, and C. Hori
ICASSP, 2014

- 2014
Speech Enhancement using Segmental Nonnegative Matrix Factorization
H.-T. Fan, J.-W. Hung, X. Lu, S.-S. Wang, and Y. Tsao
ICASSP, 2014
2013

- 2013
Semantic Naïve Bayes Classifier for Document Classification
H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng
International Joint Conference on Natural Language Processing (IJCNLP), 2013

- 2013
Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
APSIPA, 2013

- 2013
Robust Wi-Fi Location Fingerprinting Against Device Diversity based on Spatial Mean Normalization
C.-H. Wang, T.-W. Kao, S.-H. Fang, Y. Tsao, L.-C. Kuo, S.-W. Kao, and N.-C. Lin
APSIPA, 2013

- 2013
Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition
H.-Y. Lee, T.-Y. Hu, How Jing, Y.-F. Chang, Y. Tsao, Y.-C. Kao, and T.-L. Pao
Interspeech, 2013

- 2013
Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing
T.-H. Wen, Aaron Heidel, H.-y. Lee, Y. Tsao, and L.-S. Lee
Interspeech, 2013
Best Student Paper Award Nomination

- 2013
Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
Interspeech, 2013

- 2013
An Investigation of Spectral Restoration Algorithms for Deep Neural Networks based Noise Robust Speech Recognition
B. Li, Y. Tsao, and Khe Chai Sim
Interspeech, 2013

- 2013
Speech enhancement based on deep denoising autoencoder
X. Lu, Y. Tsao, Shigeki Matsuda and Chiori Hori
Interspeech, 2013

- 2013
Evaluation of Generalized Maximum a Posteriori Spectral Amplitude (GMAPA) Speech Enhancement Algorithm in Hearing Aids
Y.-H. Lai, Y.-C. Su, Y. Tsao, S.-T. Young
ISCE, 2013

- 2013
Filtering on the Temporal Probability Sequence in Histogram Equalization for Robust Speech Recognition
S.-S. Wang, Y. Tsao, and J.-W. Hung
ICASSP, 2013

- 2013
Speech Enhancement using Generalized Maximum a Posteriori Spectral Amplitude Estimator
Y.-C. Su, Y. Tsao, J.-E. Wu, and F.-R. Jean
ICASSP, 2013

2012

- 2012
Exploring Mutual Information for GMM-Based Spectral Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
ISCSLP, 2012

- 2012
A Study on Cepstral Subband Normalization for Robust ASR
S.-S. Wang, J.-W. Hung, and Y. Tsao
ISCSLP, 2012

- 2012
Acoustic Space Partition based on Broad Phonetic Class for Ensemble Acoustic Modeling
X. Lu, Y. Tsao, S. Matsuda, C. Hori, and H. Kashioka
ISCSLP, 2012

- 2012
Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation
T.-Y. Hu, Y. Tsao, and L.-S. Lee
Interspeech, 2012

- 2012
A Study of Mutual Information for GMM-Based Spectral Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
Interspeech, 2012

- 2012
A Linear Projection Approach to Environment Modeling for Robust Speech Recognition
Y. Tsao, C.-L. Huang, S. Matsuda, C. Hori, and H. Kashioka
ICASSP, 2012
2011

- 2011
Feature Normalization and Selection for Robust Speaker State Recognition
C.-L. Huang, Y. Tsao, and C. Hori
International Committee for Co-ordination and Standardisation of Speech Databases
(COCOSDA), 2011

- 2011
Incorporating Regional Information to Enhance MAP-based Stochastic Feature Compensation for Robust Speech Recognition
Y. Tsao, P. R. Dixon, C. Hori, and H. Kawai
Interspeech, 2011

- 2011
A Sampling-based Environment Population Projection Approach for Rapid Acoustic Model Adaptation
Y. Tsao, R. Isotani, H. Kawai, and S. Nakamura
ICASSP, 2011

- 2011
Increasing Discriminative Capability on Map-based Mapping Function Estimation for Acoustic Model Adaptation
Y. Tsao, R. Isotani, H. Kawai, and S. Nakamura
ICASSP, 2011
2010

- 2010
Shrinkage Model Adaptation in Automatic Speech Recognition
J. Li, Y. Tsao, and C.-H. Lee
Interspeech, 2010

- 2010
A Particle Filter Feature Compensation Approach to Robust Speech Recognition
A. Mushtaq, Y. Tsao, and C.-H. Lee
Interspeech, 2010

- 2010
An Acoustic Segment Model Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker Recognition
Y. Tsao, H. Sun, H. Li, and C.-H. Lee
ICASSP, 2010
2009

- 2009
MAP Estimation of Online Mapping Parameters in Ensemble Speaker and Speaking Environment Modeling
Y. Tsao, S. Matsuda, S. Nakamura, and C.-H. Lee
IEEE Automatic Speech Recognition and Understanding (ASRU), 2009

- 2009
Soft Margin Estimation on Improving Environment Structures for Ensemble Speaker and Speaking Environment Modeling
Y. Tsao, J. Li, C.-H. Lee, and S. Nakamura
International Universal Communication Symposium (IUCS), 2009

- 2009
A Study on Soft Margin Estimation of Linear Regression Parameters for Speaker Adaptation
S. Matsuda, Y. Tsao, J. Li, S. Nakamura, and C.-H. Lee
Interspeech, 2009

- 2009
Ensemble Speaker and Speaking Environment Modeling Approach with Advanced Online Estimation Process
Y. Tsao, J. Li, and C.-H. Lee
ICASSP, 2009
2008

- 2008
A Programmable Analog Radial-Basis-Function Based Classifier
S.-Y. Peng, Y. Tsao, P. E. Hasler, and D. V. Anderson
ICASSP, 2008

- 2008
Improving the Ensemble Speaker and Speaking Environment Modeling Approach by Enhancing the Precision of the Online Estimation Process
Y. Tsao and C.-H. Lee
Interspeech, 2008
2007

- 2007
Two Extensions to Ensemble Speaker and Speaking Environment Modeling for Robust Automatic Speech Recognition
Y. Tsao and C.-H. Lee
IEEE Automatic Speech Recognition and Understanding (ASRU), 2007

- 2007
Detection-based ASR In the Automatic Speech Attribute Transcription Project
I. Bromberg, Q. Fu, J. Hou, J. Li, C. Ma, B. Mattews, A. Moreno-Daniel, J. Morris,
S. M. Siniscalchi, Y. Tsao, and Y. Wang
Interspeech, 2007

- 2007
An Ensemble Modeling Approach to Joint Characterization of Speaker and Speaking Environments
Y. Tsao and C.-H. Lee
Interspeech, 2007
2006

- 2006
A Study on Detection Based Automatic Speech Recognition
C. Ma, Y. Tsao, and C.-H. Lee
Interspeech, 2006

- 2006
A Vector Space Approach to Environment Modeling for Robust Speech Recognition
Y. Tsao and C.-H. Lee
Interspeech, 2006
2005

- 2005
A Study on Separation between Acoustic Models and Its Applications
Y. Tsao, J. Li, and C.-H. Lee
Eurospeech, 2005

- 2005
A study on knowledge source integration for candidate rescoring in automatic speech recognition
J. Li, Yu. Tsao, and C.-H. Lee
ICASSP, 2005
2001

- 2001
Segmental Eigenvoice for Rapid Speaker Adaptation
Y. Tsao, S.-M. Lee, and L.-S. Lee
Eurospeech, 2001