Bio-asp Lab

Welcome to Bio-ASP Lab ~

Conference Papers

2025

2025

QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions

S. Wang, W. Yu, X. Chen, X. Tian, J. Zhang, L. Lu, Y. Tsao, J. Yamagishi, Y. Wang, C. Zhang

ACL, 2025

2025

A Study on Speech Assessment with Visual Cues

S. Ahmed, R. E. Zezario, N. Saleem, A. Hussain, H.-M. Wang, and Y. Tsao

Interspeech, 2025

2025

Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR

X. Lu, P. Shen, Y. Tsao, H. Kawai

Interspeech, 2025

2025

VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations

S.-F. Huang et. al.

Interspeech, 2025

2025

ZSDEVC: Zero-Shot Diffusion-based Emotional Voice Conversion with Disentangled Mechanism

H.-H. Chou, Y.-S. Lin, C.-C. Sung, Y. Tsao, and C.-C. Lee

Interspeech, 2025

2025

Universal Speech Enhancement with Regression and Generative Mamba

R. Chao, R. Nasretdinov, Y.-C. F. Wang, A. Jukić, S.-W. Fu, and Y. Tsao

Interspeech, 2025

2025

A Comparative Study on Proactive and Passive Detection of Deepfake Speech

C.-H. Wu, W. Ge, X. Wang, J. Yamagishi, Y. Tsao, and H.-M. Wang

Interspeech, 2025

2025

Feature Importance across Domains for Improving Non-Intrusive Speech Intelligibility Prediction in Hearing Aids

R. E. Zezario, S. M. Siniscalchi, F. Chen, H.-M. Wang, and Y. Tsao

Interspeech, 2025

2025

Speech Enhancement with MAP-based Training for Robust ASR

Y.-J. Li, R. Chao, B. Su, and Y. Tsao

IEEE ICASSP 2025, 2025

2025

MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network

Y.-T. Liu, K.-C. Wang, R. Chao, S. M. Siniscalchi, P.-C. Yeh, and Y. Tsao

IEEE ICASSP 2025, 2025

2025

Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement

W. Ren, H. Wu, Y.-C. Lin, X. Chen, R. Chao, K.-H. Hung, Y.-J. Li, W.-Y. Ting, H.-M. Wang, and Y. Tsao

IEEE ICASSP 2025, 2025

2025

Neural Variational Mode Decomposition and Its Application for ECG Denoising

D.-Y. Lu, J.-J. Ding, and Y. Tsao

IEEE ICASSP 2025, 2025

2025

A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models

R. E. Zezario, S. M. Siniscalchi, H.-M. Wang, and Y. Tsao

IEEE ICASSP 2025, 2025

2025

MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution

J. Lin, I Chiu, K.-C. Wang, K.-C. Liu, H.-M. Wang, P.-C. Yeh, and Y. Tsao

IEEE ICASSP 2025, 2025

2024

2024

DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset

J. Du, I-M. Lin, I-H. Chiu, X. Chen, H. Wu, W. Ren, Y. Tsao, H.-y. Lee, and J.-S. R. Jang

IEEE SLT 2024, 2024

2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

C.-H. H. Yang et al.

IEEE SLT 2024, 2024

2024

FLANEC: Exploring Flan-T5 for Post-ASR Error Correction

M. L. Quatra, V. M. Salerno, Y. Tsao, S. M. Siniscalchi

IEEE SLT 2024, 2024

2024

Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR

X. Lu, P. Shen, Y. Tsao, and H. Kawai

IEEE SLT 2024, 2024

2024

An Investigation of Incorporating Mamba for Speech Enhancement

R. Chao, W.-H. Cheng, M. L. Quatra, S. M. Siniscalchi, C.-H. H. Yang, S.-W. Fu, and Y. Tsao

IEEE SLT 2024, 2024

2024

The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction

W.-C. Huang, S.-W. Fu, E. Cooper, R. E. Zezario, T. Toda, H.-M. Wang, J. Yamagishi, and Y. Tsao

IEEE SLT 2024, 2024

2024

RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier

P.-Y. Huang, S.-W. Fu, and Y. Tsao

NeurIPS 2024, 2024

2024

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

S.-F. Huang, H.-C. Kuo, Z. Chen, X. Yang, C.-H. H. Yang, Y. Tsao, Y.-C. F. Wang, H.-y. Lee, and S.-W. Fu

IEEE SLT 2024, 2024

2024

MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal

K.-H. Hung, K.-C. Wang, K.-C. Liu, W.-L. Chen, X. Lu, Y. Tsao, and C.-W. Lin

IEEE BigData 2024, 2024

2024

SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models

C. Yin, T.-S. Chi, Y. Tsao, and H.-M. Wang

Interspeech, 2024

2024

Learnable Layer Selection and Model Fusion for Speech Self-Supervised Learning Models

S.-C. Chiu, C.-H. Wu, J.-K. Hsieh, Y. Tsao, and H.-M. Wang

Interspeech, 2024

2024

Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata

R. E. Zezario, F. Chen, C.-S.Fuh, H.-M. Wang, and Y. Tsao

Interspeech, 2024

2024

Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition

K.-C. Wang, Y.-J. Li, W.-L. Chen, Y.-W. Chen, Y.-C. Wang, P.-C. Yeh, C. Zhang, and Y. Tsao

IEEE EUSIPCO 2024, 2024

2024

A Study on Incorporating Whisper for Robust Speech Assessment

R. E. Zezario, Y.-W. Chen, S.-W. Fu, Y. Tsao, H.-M. Wang, C.-S. Fuh

IEEE ICME 2024 , 2024 (Top Performance on the Track 3 - VoiceMOS Challenge 2023)

2024

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech

S.-W. Fu, K.-H. Hung, Y. Tsao, and Y.-C. F. Wang

ICLR, 2024

2024

SDEMG: Score-based Diffusion Model For Surface Electromyographic Signal Denoising

Y.-T. Liu, K.-C. Wang, K.-C. Liu, S.-Y. Peng, and Y. Tsao

IEEE ICASSP, 2024

2024

Hierarchical Cross-modality Knowledge Transfer With Sinkhorn Attention For Ctc-based ASR

X. Lu, P. Shen, Y. Tsao, and H. Kawai

IEEE ICASSP, 2024

2024

Scalable Ensemble-based Detection Method Against Adversarial Attacks For Speaker Verification

H. Wu, H.-C. Kuo, Y. Tsao, H.-y. Lee

IEEE ICASSP, 2024

2024

A Multi-task Evaluation Benchmark For Audio-visual Representation Models

Y. Tseng, L. Berry, and Y.-T. Chen et al.,

IEEE ICASSP, 2024

2024

Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model

R. E. Zezario, B.-R. B. Bai, C.-S. Fuh, H.-M. Wang, and Y. Tsao

IEEE ICASSP, 2024

2023

2023

Cross-modal alignment with optimal transport for CTC-based ASR

X. Lu, P. Shen, Y. Tsao, and H. Kawa

IEEE ASRU, 2023

2023

LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models

C.-C. Lee, H.-W. Chen, C.-S. Chen, H.-M. Wang, T.-T. Liu, and Y. Tsao

IEEE ASRU, 2023

2023

Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility

H.-T. Chiang, K.-H. Hung, S.-W. Fu, H.-C. Kuo, M.-H. Tsai, and Y. Tsao

IEEE ASRU, 2023

2023

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

E. Cooper, W.-C. Huang, Y.Tsao, H.-M. Wang, T. Toda, and J. Yamagishi

IEEE ASRU, 2023

2023

Inference and Denoise: Causal Inference-based Neural Speech Enhancement

T.-A. Hsieh, C.-H. Huck Y., P.-Y. Chen, S. M. Siniscalchi, Y. Tsao

IEEE MLSP, 2023

2023

IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays

W.-Y. Ting, S.-S. Wang, Y. Tsao, and B. Su

IEEE MLSP, 2023

2023

Voice Direction-of-Arrival Conversion

I-C. Chern, S. Chern, H.-C. Kuo, H.-H. Tseng, K.-H. Hung, and Y. Tsao

IEEE MLSP, 2023

2023

Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition

H. Yen, P.-J. Ku, C.-H. H. Yang, H. Hu, S. M. Siniscalchi, P.-Y. Chen, and Y. Tsao

Interspeech, 2023

2023

Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion

Y.-L. Chien, H.-H. Chen, M.-C. Yen, S.-W. Tsai, H.-M. Wang, Y. Tsao, T.-S. Chi

Interspeech, 2023

2023

A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech

L.-W. Chen, Y.-F. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang

Interspeech, 2023

2023

Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features

H.-H. Chen, Y.-L. Chien, M.-C. Yen, S.-W. Tsai, T.-S. Chi, Y. Tsao, and H.-M. Wang

Interspeech, 2023

2023

Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation

E.-P. Chu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, and C.-T. Chan

IEEE EMBC, 2023

2023

Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks

C.-P. Liu, J.-H. Li, E.-P. Chu, C.-Y. Hsieh, K.-C. Liu, C.-T. Chan, and Y. Tsao

IEEE MeMeA, 2023

2023

Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning

C.-H. Chen, K.-C. Liu, T.-Y. Lu, C.-Y. Chang, C.-T. Chan, and Y. Tsao

IEEE NER, 2023

2023

Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids

J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain

IEEE ICASSP 2023 (AMHAT 2023 Workshop), 2023

2023

D4AM: A General Denoising Framework for Downstream Acoustic Models

C.-C. Lee, Y. Tsao, H.-M. Wang and C.-S. Chen

ICLR, 2023

2023

Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation

T.-H. Chi, K.-C. Liu, C.-Y. Hsieh, Y. Tsao, and C.-T. Chan

IEEE ICASSP , 2023

2023

ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks

K.-C. Wang, K.-C. Liu, S.-Y. Peng, Y. Tsao

IEEE ICASSP, 2023

2023

On the Robustness of Non-intrusive Speech Quality Model by Adversarial Examples

H.-Y. Lin, H.-H. Tseng, and Y. Tsao

IEEE ICASSP, 2023

2023

Audio-visual Speech Enhancement And Separation By Utilizing Multi-modal Self-supervised Embeddings

I-C. Chern, K.-H. Hung, Y.-T. Chen, T. Hussain, M. Gogate, A. Hussain, Y. Tsao, and J.-C. Hou

IEEE ICSSP 2023 (AMHAT 2023 Workshop), 2023

2023

Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids

J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain

IEEE ICSSP 2023 (AMHAT 2023 Workshop), 2023

2023

T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5

C.-J. Hsu, H.-L. Chung, H.-y. Lee, amd Y. Tsao

IEEE ICSSP, 2023

2023

Interpretations of Domain Adaptations via Layer Variational Analysis

H.-H. Tseng, H.-Y. Lin, H.-K. Hsuan and Y. Tsao

ICLR, 2023

2022

2022

Dysarthric Speech Enhancement Based on Convolution Neural Network

S.-S. Wang, Y. Tsao, W.-Z. Zheng, H.-W. Yeh, P.-C. Li, S.-H. Fang, Y.-H. Lai

IEEE EMBC, 2022

2022

A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning

T. Hussain, M. Diyan, M. Gogate, K. Dashtipour, A. Adeel, Y. Tsao, A. Hussain

IEEE EMBC, 2022

2022

NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling

C.-C. Lee, C.-H. Hu, Y.-C. Lin, C.-S. Chen, H.-M. Wang and Y. Tsao

Interspeech, 2022

2022

Boosting Self-Supervised Embeddings for Speech Enhancement

K.-H. Hung, S.-W. Fu, H.-H. Tseng, H.-T. Chiang, Y. Tsao, C.-W. Lin

Interspeech, 2022

2022

Perceptual Characteristics Based Multi-objective Model for Speech Enhancement

C.-J. Peng, Y.-J. Chan, Y.-L.Shen, C. Yu, Y. Tsao and T.-S. Chi

Interspeech, 2022

2022

ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding

Y.-J. Lu et al

Interspeech, 2022

2022

🏆
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

R. E. Zezario, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao

Interspeech, 2022

2022

MTI-Net: A Multi-Target Speech Intelligibility Prediction Model

R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao

Interspeech, 2022

2022

Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Authors

F.-L. Wang, H.-S. Lee, Y. Tsao and H.-M. Wang

Interspeech, 2022

2022

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

R. Chao, C. Yu, S.-W. Fu, X. Lu and Y. Tsao

Interspeech, 2022

2022

The VoiceMOS Challenge 2022

W.-C. Huang, E. C., Y. Tsao, H.-M. Wang, T. Toda and J. Yamagishi

Interspeech, 2022

2022

InQSS: a speech intelligibility and quality assessment model using a multi-task learning network

Y.-W. Chen and Y. Tsao

Interspeech, 2022

2022

OSSEM: one-shot speaker adaptive speech enhancement using meta learning

C. Yu, S.-W. Fu, T.-An Hsieh, Y. Tsao and M. Ravanelli

Interspeech, 2022

2022

When Bert Meets Quantum Temporal Convolution Learning for Text Classification In Heterogeneous Computing

C.-H. H. Yang, J. Qi, S. Y.-C. Chen, Y. Tsao, and P.-Y. Chen

ICASSP, 2022

2022

XDBERT: Distilling Visual Information to BERT via Cross-Modal Encoders to Improve Language Understanding

C.-J. Hsu, H.-Y. Lee, Y. Tsao

ACL , 2022

2022

Partially Fake Audio Detection by Self-attention-based Fake Span Discovery

H. Wu, H.-C. Kuo, N. Zheng, K.-H. Hung, H.-Y. Lee, Y. Tsao, H.-M. Wang, and H. Meng

ICASSP, 2022

2022

Speech Recovery For Real-world Self-powered Intermittent Devices

Y.-C. Lin,T.-A. Hsieh, K.-H. Hung, C. Yu, H. Garudadri, Y. Tsao, and T.-W. Kuo

ICASSP, 2022

2022

EMGSE: Acoustic/emg Fusion For Multimodal Speech Enhancement

K.-C. Wang, K.-C. Liu, H.-M. Wang, and Y. Tsao

ICASSP, 2022

2022

Conditional Diffusion Probabilistic Model For Speech Enhancement

Y.-J. Lu, Z.-Q. Wang, S. Watanabe, A. Richard, C. Yu, and Y. Tsao

ICASSP, 2022

2022

MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation based Only On Noisy/ Reverberated Speech

S.-W. Fu, C. Yu, K.-H. Hung, M. Ravanelli, and Y. Tsao

ICASSP, 2022

2022

Analyzing The Robustness Of Unsupervised Speech Recognition

G.-T. Lin, C.-J. Hsu, D.-R. Liu, H.-Y. Lee, and Y. Tsao

ICASSP, 2022

2021

2021

Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport

H.-Y. Lin, H.-H. Tseng, X. Lu, and Y. Tsao

NeurIPS, 2021

2021

HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network

H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, and Y. Tsao

ASRU, 2021

2021

Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Model-ing

M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tsai, Y. Tsao,

T. Toda, J.-S. Jang, and H.-M. Wang

ASRU, 2021

2021

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition

X. Chang, T. Maekaku, P. Guo, J. Shi,Y.-J. Lu, A. S. Subramanian, T. Wang,

S.-w. Yang, Y. Tsao, H.-y. Lee, and S. Watanabe

ASRU, 2021

2021

Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder

T.-Y. Lu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, C.-T. Chan

IEEE BHI, 2021

2021

Investigation of A Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-To-End Bengali Automatic Speech Recogni-tion Under Unseen Noisy Conditions

M. E Noor, Y.-J. Lu, S.-Si. Wang, S. Ghose, C.-Y. Chang, R. E. Zezario, S. Ahmed,

W.-H. Chung, Y. Tsao and H.-M. Wang

Oriental COCOSDA, 2021

2021

MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder

Y.-J. Li, S.-S. Wang, Y. Tsao, and B. Su

APSIPA ASC, 2021

2021

A Study on Speech Enhancement Based on Diffusion Probabilistic Model

Y.-J. Lu, Y. Tsao, and S. Watanabe

APSIPA ASC, 2021

2021

Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion

Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, and H.-M. Wang

APSIPA ASC, 2021

2021

Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues

Z. Feng, Y. Tsao, and F. Chen

APSIPA ASC, 2021

2021

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

X. Lu, P. Shen, Y. Tsao, and H. Kawai

APSIPA ASC, 2021

2021

Unsupervised neural adaptation model based on optimal transport for spoken language identification

X. Lu, P. Shen, Y. Tsao, and H. Kawai

ICASSP, 2021

2021

A Preliminary Study of a Two-Stage Paradigm for Preserving SpeakerIdentity in Dysarthric Voice Conversion

W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, and T. Toda

Interspeech, 2021

2021

A Study of Incorporating Articulatory Movement Information in Speech Enhancement

Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, X. Lu, and Y. Tsao

EUSIPCO, 2021

2021

Speech Enhancement with Zero-Shot Model Selection

R. E Zezario, C.-S. Fuh, H.-M. Wang, Y. Tsao

EUSIPCO, 2021

2021

One shot learning for speech separation

Y.-K. Wu, K.-P. Huang, Y. Tsao, and H.-Y. Lee

ICASSP, 2021

2021

QISTA-Net-Audio: Audio Super-resolution via Non-Convex Lq-normMinimization

G.-X. Lin, S.-W. Hu, Y.-J. Lu, Y. Tsao, and C.-S. Lu

Interspeech, 2021

2021

MetricGAN +: An Improved Version of MetricGAN for Speech Enhancement

S.-W. Fu, C. Yu, T.-A. Hsieh, P. Plantinga, M. Ravanelli, X. Lu, and Y. Tsao

Interspeech, 2021

2021

Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement

T.-A. Hsieh, C. Yu, S.-W. Fu, X. Lu, and Y. Tsao

Interspeech, 2021

2021

Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder

Y,-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang, and T. Toda

Interspeech, 2021

2021

EMA2S: An End-to-End Multimodal Articulatory-to-Speech System

Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, W.-C. Huang, X. Lu, and Y. Tsao

ISCAS, 2021

2021

Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario

C.-J. Peng, Y.-J. Chan, C. Yu, S.-S. Wang, Y. Tsao, and T.-S. Chi

ISCAS, 2021

2021

MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration

Y.-T. Chang, Y.-H. Yang, Y.-H. Peng, S.-S. Wang, T.-S. Chi, Y. Tsao and H.-M. Wang

ISCSLP, 2021

2020

2020

Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-based Voice Conversion System

C.-Y. Chen, W.-Z. Zheng, S.-S. Wang, Y. Tsao, P.-C. Li, and Y.-H. Lai

Interspeech, 2020

2020

Lite Audio-Visual Speech Enhancement

S.-Y. Chuang, Y. Tsao, C.-C. Lo, and H.-M. Wang

Interspeech, 2020

2020

STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model

R. E. Zezario, S.-W. Fu, C.-S. Fuh, Y. Tsao, and H.-M. Wang

APSIPA, 2020

2020

Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing

S.-W. Fu et al.,

APSIPA, 2020

2020

SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning

C.-C. Lee, Y.-C. Lin, H.-T. Lin, H.-M. Wang, and Y. Tsao

Interspeech, 2020

2020

iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning

H. Li, S.-W. Fu, Y. Tsao, and J. Yamagishi

Interspeech, 2020

2020

Incorporating Broad Phonetic Information for Speech Enhancement

Y.-J. Lu, C.-F. Liao, X. Lu, J.-w. Hung, and Y. Tsao

Interspeech, 2020

2020

Self-supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement

R. E. Zezario, T. Hussain, X. Lu, H.-M. Wang, and Y. Tsao

ICASSP, 2020

2019

2019

Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement

W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang

APSIPA, 2019

2019

Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement

T. Hussaink, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao

ICASSP, 2019

2019

Subjective Feedback-based Neural Network Pruning for Speech Enhancement

W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang

APSIPA, 2019

2019

Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking

K.-Y. Liu, S.-S. Wang, Y. Tsao, and J.-w. Hung

ROCLING, 2019

2019

Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion

W.-C. Huang, Y.-C. Wu, K. Kobayashi, Y.-H. Peng, H.-T. Hwang, P. L. Tobing, Y. Tsao,

H.-M. Wang, and T. Toda

ISCA SSW 10, 2019

2019

Speaker-aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement

F.-K. Chuang, S.-S. Wang, J.-w. Hung, Y. Tsao, and S.-H. Fang

Interspeech, 2019

2019

IA-NET: Acceleration and Compression of Speech Enhancement using Integer-adder Deep Neural Network

Y.-C. Lin, Y.-T. Hsu, S.-W. Fu, Y. Tsao, and T.-W. Kuo

Interspeech, 2019

2019

MOSNet: Deep Learning-based Objective Assessment for Voice Conversion

C.-C. Lo, S.-w. Fu, W. C. Huang, X. Wang, J. Yamagishi, Y. Tsao, and H.-M. Wang

Interspeech, 2019

2019

Class-wise Centroid Distance Metric Learning for Acoustic Event Detection

X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai

Interspeech, 2019

2019

Incorporating Symbolic Sequential Modeling for Speech Enhancement

C.-F. Liao, Y. Tsao, X. Lu, and H. Kawai

Interspeech, 2019

with ISCA Travel Grant

2019

Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric

R. E. Zezario, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao

Interspeech, 2019

2019

Noise Adaptive Speech Enhancement using Domain Adversarial Training

C.-F. Liao, Y. Tsao, H.-y. Lee, and H.-M. Wang

Interspeech, 2019

with ISCA Travel Grant

2019

Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech

L.-W. Chen, H.-Y. Lee, and Y. Tsao

Interspeech, 2019

2019

Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR

P.-T. Huang, H.-S. Lee, S.-S. Wang, K.-Y. Chen, Y. Tsao, and H.-M. Wang

Interspeech, 2019

with ISCA Travel Grant

2019

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion

W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P. L. Tobing, T. Hayashiy, K. Kobayashi, T. Toda,

Y. Tsao, and H.-M. Wang

European Signal Processing Conference (EUSIPCO), 2019

2019

Audio-Visual Speech Enhancement Using Hierarchical Extreme Learning Machine

T. Hussain, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao

European Signal Processing Conference (EUSIPCO), 2019

2019

Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion

W.-C. Huang, Y.-C. Wu, C.-C. Lo, P. L. Tobing, T. Hayashi, K. Kobayashi, T. Toda,

Y. Tsao, and H.-M. Wang

Interspeech, 2019

2019

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement

S.-W. Fu, C.-F. Liao, Y. Tsao, and S.-D. Lin

ICML, 2019

2019

Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition

Y.-L. Shen, C.-Y. Huang, S.-S. Wang, Y. Tsao, H.-M. Wang, and T.-S. Chi

ICASSP, 2019

2019

Reducing noise and reverberation in speech signals via the integration of denoising autoencoder and temporal lowpass filtering

K.-Y. Liu, S.-k. Lee, S.-S. Wang, Y. Tsao, and J.-W. Hung

IEEE International Conference on Applied System Innovation (ICASI), 2019

2019

Bone-conducted Speech Enhancement using Hierarchical Extreme Learning Machine

T. Hussain, Y. Tsao, S. M. Sinicalchi, J.-C. Wang, H.-M. Wang, and W.-H. Liao

International Workshop on Spoken Dialogue Systems Technology (IWSDS), 2019

2018

2018

An Abnormal Detection Strategy of Rotating Electric Machine based on Frequency Distribution

S.-C. Lin, Y. Tsao, S.-F. Su, Yennun Huang, and Z.-Q. Zhong

The 39th Symposium on Electrical Power Engineering, 2018

2018

Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement

R. E. Zezario, J.-W. Huang, X. Lu, Y. Tsao, H.-T. Hwang, and H.-M. Wang

APSIPA, 2018

2018

A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)

Y.-T. Hsu, Y.-C. Lin, S.-W. Fu, Y. Tsao, and T.-W. Kuo

IEEE Spoken Language Technology (SLT), 2018

2018

Robustness against the channel effect in pathological voice detection

Y.-T. Hsu, Z. Zhu, C.-T. Wang, S.-H. Fang, F. Rudzicz, and Y. Tsao

NeurIPS 2018, Machine Learning for Health (ML4H) Workshop

2018

An Industrial IoT Analysis System Based on Machining Data of Metal Materials

S.-C. Lin, Y. Tsao, S.-F. Su, and Yennun Huang

International Conference on Fuzzy Theory and Its Applications, 2018

2018

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders

W.-C. Huang, H.-T. Hwang, Y.-H. Peng, Y. Tsao, and H.-M. Wang

ISCSLP, 2018

Best Student Paper Award

2018

Speech Enhancement based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform

S.-k. Lee, S.-S. Wang, Y. Tsao, and J.-w. Hung

ISCSLP, 2018

2018

FIS-based Domestic Milling Machine PHM System Considering Multi-speed Frequency Variation

S.-C. Lin, C.-H. Su, Y. Tsao, S.-F. Su, H.-Y. Mark Liao, and Yennun Huang

IEEE International Conference on Advanced Manufacturing, 2018

Best Paper Award (獲推薦轉投SCI期刊, 擴充研究�?改中)

2018

A Supervised Learning Algorithm Considering Light Conditions for Visual Inspection of Metal Objects

H.-C. Li, S.-C. Lin, Y. Tsao, S.-F. Su, P.-L. Sun, and Yennun Huang

The 54th Annual Conference of Chinese Society for Quality 2018 International

Symposium of Quality Management

Makalot Industry-Academic Collaboration Award (獲推薦轉投EI期刊, 擴充研究�?改中)

2018

Exemplar-Based Spectral Detail Compensation for Voice Conversion

Y.-H. Peng, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang

Interspeech ,2018

2018

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM

S.-W. Fu, Y. Tsao, H.-T. Hwang, and H.-M. Wang

Interspeech ,2018

2018

Temporal Attentive Pooling for Acoustic Event Detection

X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai

Interspeech, 2018

2018

Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation

Y.-Y. Kao, H.-P. Hsu, C.-F. Liao, Y. Tsao, H.-C. Yang, J.-L. Li, C.-C. Lee,

H.-S. Lee, and H.-M. Wang

IEEE IWAENC ,2018

2018

Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform

S.-K. Lee, S.-S. Wang, Y. Tsao, and J.-W. Hung

IEEE SiPS ,2018

2018

Improving the Performance of Hearing Aids in Noisy Environments based on Deep Learning Technology

Y.-H. Lai, W.-Z. Zheng, S.-T. Tang, S.-H. Fang, W.-H. Liao, and Y. Tsao

IEEE Engineering in Medicine and Biology Society (EMBC), 2018

2018

A Novel LSTM-based Speech Preprocessor For Speaker Diarization in Realistic Mismatch Conditions

L. Sun, J. Du, T. Gao, Y.-D. Lu, Y. Tsao, C.-H. Lee, and N. Ryant

ICASSP, 2018

2018

Enhancement and Analysis of Conversational Speech: JSALT 2017

N. Ryant et al

ICASSP, 2018

2018

Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm

W.-J. Lee, S.-S. Wang, F. Chen, X. Lu, S.-Y. Chien, and Y. Tsao

ICASSP, 2018

2017

2017

Computing Biodiversity Change via a Soundscape Monitoring Network

T.-H. Lin, Y.-H. Wang, S.-S. Lu, H.-W. Yen, and Y. Tsao

PNC, 2017

2017

Raw Waveform-based Speech Enhancement by Fully Convolutional Networks

S.-W. Fu, Y. Tsao, X. Lu, and H. Kawai

APSIPA, 2017

2017

A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise

S.-S. Wang, Y. Tsao, H.-L. S. Wang, Y.-H. Lai, and L. P.-H. Li

APSIPA, 2017

2017

Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion

Y.-H. Peng, C.-C. Hsu, Y.-C. Wu, H.-T. Hwang, Y.-W. Liu, Y. Tsao, and H.-M. Wang

APSIPA, 2017

Poster Presentation Award

2017

Complex Spectrogram Enhancement by Convolutional Neural Network with Multi-metrics Learning

S.-W. Fu, T.-y. Hu, Y. Tsao, and X. Lu

IEEE International Workshop on Machine Learning for Signal Processing (MLSP), 2017

2017

Deblending of Simultaneous-source Seismic Data via Periodicity-coded Nonnegative Matrix Factorization

T.-H. Lin and Y. Tsao

IEEE Dataport, 2017

2017

A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement

Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y. Tsao, and H.-M. Wang

Interspeech, 2017

2017

Wavelet Speech Enhancement Based on Robust Principal Component Analysis

C.-L. Wu, H.-P. Hsu, S.-S. Wang, J.-W. Hung, Y.-H. Lai, H.-M. Wang, and Y.Tsao

Interspeech, 2017

2017

Discriminative Autoencoders for Acoustic Modeling

M.-H. Yang, H.-S. Lee, Y.-D. Lu, K.-Y. Chen, Y. Tsao, B. Chen, and H.-M. Wang

Interspeech, 2017

2017

Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks

C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang

Interspeech, 2017

2017

Object-based on-line video summarization for internet of video things

S.-T. Lin, Y.-H. Liao, Y. Tsao, and S.-Y. Chien

IEEE International Symposium on Circuits & Systems (ISCAS), 2017

2017

Discriminative Autoencoders for Speaker Verification

H.-S. Lee, Y.-D. Lu, C.-C. Hsu, Y. Tsao, H.-M. Wang, and S.-K. Jeng

ICASSP, 2017

2017

A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement

Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y.-H. Lai, Y. Tsao, and H.-M. Wang

ICASSP, 2017

2016

2016

Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder

C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao and H.-M. Wang

APSIPA, 2016

2016

Audio-Visual Speech Enhancement using Deep Neural Networks

J.-C. Hou, S.-S. Wang, Y.-H. Lai, J.-C. Lin, Y. Tsao, H.-W. Chang, and H.-M. Wang

APSIPA, 2016

2016

Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network

C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang

ISCSLP, 2016

2016

A Linear Regression Model with Dynamic Pulse Transit Time Features for Noninvasive Blood Pressure Prediction

Y.-Y. Hsieh, C.-D. Wu, Y. Tsao, and S.-S. Lu

Biomedical Circuits and Systems Conference (BioCAS), 2016

2016

Incorporating Local Environment Information with Ensemble Neural Networks to Robust Automatic Speech Recognition

C.-Y. Hsu, R. E. Zezario, J.-C. Wang, X. Lu, and Y. Tsao

ISCSLP, 2016

2016

Improving the Performance of Speech Perception in Noisy Environment based on a FAME Strategy

Y.-H. Lai, S.-S. Wang, Y.-T. Su, H.-C. Cheng, F. K. Fu, and Y. Tsao

ISCSLP, 2016

2016

Pair-wise Distance Metric Learning of Neural Network Model for Spoken Language Identification

X. Lu, P. Shen, Y. Tsao, and H. Kawai

Interspeech, 2016

2016

DCASE Report for Task 3: Sound Event Detection in Real Life Audio

Y.-H. Lai, C.-H. Wang, S.-Y. Hou, B.-Y. Chen, Y. Tsao, and Y.-W. Liu

Detection and Classification of Acoustic Scenes and Events (DCASE) workshop, 2016

2016

Locally Linear Embedding for Exemplar-Based Spectral Conversion

Y.-C. Wu, H.-T. Hwang, C.-C. Hsu, Y. Tsao, and H.-M. Wang

Interspeech, 2016

2016

Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation

H.-S. Lee, Y. Tsao, C.-C. Lee, H.-M. Wang, W.-C. Lin, W.-C. Chen, S.-W. Hsiao, S.-K. Jeng

Interspeech, 2016

2016

SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement

S.-W. Fu, Y. Tsao, and X. Lu

Interspeech, 2016

2016

Track-clustering Error Evaluation for Track-based Multi-camera Tracking System Employing Human Re-identification

C.-W. Wu, M.-T. Zhong, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.-Y. Chien

Computer Vision and Pattern Recognition (CVPR), 2016

2016

Nonnegative Matrix Factorization-based Frequency Lowering Technology for Mandarin-speaking Hearing Aid Users

Y.-T. Liu, Y. Tsao, and R.-Y. Chang

ICASSP, 2016

2016

Speech Enhancement via Ensemble Modeling NMF Adaptation

Jeremy Chiaming Yang, S.-S. Wang, Y. Tsao, and J.-W. Hung

IEEE International Conference on Consumer Electronics (ICCE), 2016

2016

Leveraging Nonnegative Matrix Factorization in Processing the Temporal Modulation Spectrum for Speech Enhancement

S.-S. Wang, Jeremy Chiaming Yang, Y. Tsao, and J.-W. Hung

IEEE International Conference on Consumer Electronics (ICCE), 2016

2016

Temporal Modulation Spectral Restoration for Robust Speech Recognition

S.-S. Wang and Y. Tsao

IEEE International Conference on Multimedia Big Data, 2016

2016

Improving the Performance of Noise Reduction in Hearing Aids Based on the Genetic Algorithm

Y.-H. Lai, C.-H. Chen, S.-T. Tang, Z.-M. Yeh, and Y. Tsao

IFMBE Proceedings 57, 2016

2015

2015

A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion

H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen

APSIPA, 2015

2015

A New Frequency Lowering Technique for Mandarin-Speaking Hearing Aid Users

Y.-T. Liu, R. Y. Chang, Y. Tsao, and Y.-P. Chang

IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2015

2015

Improving Denoising Auto-encoder Based Speech Enhancement With the Speech Parameter Generation Algorithm

S.-S. Wang, H.-T. Hwang, Y.-H. Lai, Y. Tsao, X. Lu, H.-M. Wang, and B. Su

APSIPA, 2015

2015

Temporal Alignment for Deep Neural Networks

P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao

IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2015

2015

Speech Recognition with Temporal Neural Networks

P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao

Interspeech, 2015

2015

Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection

X. Lu, P. Shen, Y. Tsao, C. Hori, and H. Kawai

Interspeech, 2015

2015

Temporal Information in Tone Recognition

P. Lin, S.-S. Wang, and Y. Tsao

IEEE International Conference on Consumer Electronics (ICCE), 2015

2015

Multimodal Arousal Rating using Unsupervised Fusion Technique

C. Ma, Y. Tsao, and C.-H. Lee

ICASSP, 2015

2015

A Discriminative Post-filter for Speech Enhancement in Hearing Aids

Y.-H. Lai, S.-S. Wang, P.-C. Li, and Y. Tsao

ICASSP, 2015

2014

2014

Robust Anchorperson Detection Based on Audio Streams using a Hybrid I-vector and DNN System

Y.-F. Chang, P. Lin, S.-H. Cheng, K.-H. Chan, Y.-C. Zeng, C.-W. Liao, W.-T. Chang,

Y.-C. Wang, and Y. Tsao

APSIPA, 2014

2014

Effect of Adaptive Envelope Compression in Simulated Electric Hearing in Reverberation

Y.-H. Lai, F. Chen, and Y. Tsao

ISIC, 2014

2014

A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering

H. Jing, A.-C. Liang, S.-D. Lin, and Y. Tsao

ICDM, 2014

2014

Clustering-Based I-Vector Formulation for Speaker Recognition

H.-S. Lee, Y. Tsao, H.-M. Wang, and S.-K. Jen

Interspeech, 2014

2014

Spectral Patch Based Sparse Coding for Acoustic Event Detection

X. Lu, Y. Tsao, P. Shen, and C. Hori

ISCSLP, 2014

2014

Ensemble Modeling of Denoising Autoencoder for Speech Spectrum Restoration

X. Lu, Y. Tsao, S. Matsuda, and C. Hori

Interspeech, 2014

2014

Ensemble of Machine Learning Algorithms for Cognitive and Physical Speaker Load Detection

C. Ma, Y. Tsao, and C.-H. Lee

Interspeech, 2014

2014

Automatic Speech Recognition with Primarily Temporal Envelope Information

P. Lin, F. Chen, S.-S. Wang, Y. Tsao and Y.-H. Lai

Interspeech, 2014

2014

An Adaptive Envelope Compression Strategy for Speech Processing in Cochlear Implants

Y.- H. Lai, F. Chen, and Y. Tsao

Interspeech, 2014

2014

Acoustic Feature Conversion using a Polynomial based Feature Transferring Algorithm

S.-S. Wang, P. Lin, D.-C. Lyu, Y. Tsao, H.-T. Hwang, B. Su, and H.-M. Wang

ISCSLP, 2014

2014

Speaker Verification Using Kernel-Based Binary Classifiers with Binary Operation Derived Features

H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng

ICASSP, 2014

2014

Sparse Representation Based on a Bag of Spectral Exemplars for Acoustic Event Detection

X. Lu, Y. Tsao, S. Matsuda, and C. Hori

ICASSP, 2014

2014

Speech Enhancement using Segmental Nonnegative Matrix Factorization

H.-T. Fan, J.-W. Hung, X. Lu, S.-S. Wang, and Y. Tsao

ICASSP, 2014

2013

2013

Semantic Naïve Bayes Classifier for Document Classification

H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng

International Joint Conference on Natural Language Processing (IJCNLP), 2013

2013

Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion

H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen

APSIPA, 2013

2013

Robust Wi-Fi Location Fingerprinting Against Device Diversity based on Spatial Mean Normalization

C.-H. Wang, T.-W. Kao, S.-H. Fang, Y. Tsao, L.-C. Kuo, S.-W. Kao, and N.-C. Lin

APSIPA, 2013

2013

Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition

H.-Y. Lee, T.-Y. Hu, How Jing, Y.-F. Chang, Y. Tsao, Y.-C. Kao, and T.-L. Pao

Interspeech, 2013

2013

Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing

T.-H. Wen, Aaron Heidel, H.-y. Lee, Y. Tsao, and L.-S. Lee

Interspeech, 2013

Best Student Paper Award Nomination

2013

Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training

H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen

Interspeech, 2013

2013

An Investigation of Spectral Restoration Algorithms for Deep Neural Networks based Noise Robust Speech Recognition

B. Li, Y. Tsao, and Khe Chai Sim

Interspeech, 2013

2013

Speech enhancement based on deep denoising autoencoder

X. Lu, Y. Tsao, Shigeki Matsuda and Chiori Hori

Interspeech, 2013

2013

Evaluation of Generalized Maximum a Posteriori Spectral Amplitude (GMAPA) Speech Enhancement Algorithm in Hearing Aids

Y.-H. Lai, Y.-C. Su, Y. Tsao, S.-T. Young

ISCE, 2013

2013

Filtering on the Temporal Probability Sequence in Histogram Equalization for Robust Speech Recognition

S.-S. Wang, Y. Tsao, and J.-W. Hung

ICASSP, 2013

2013

Speech Enhancement using Generalized Maximum a Posteriori Spectral Amplitude Estimator

Y.-C. Su, Y. Tsao, J.-E. Wu, and F.-R. Jean

ICASSP, 2013

2013

Sparse Maximum Entropy Deep Belief Nets

H. Jing and Y. Tsao

IJCNN, 2013

2012

2012

Exploring Mutual Information for GMM-Based Spectral Conversion

H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen

ISCSLP, 2012

2012

A Study on Cepstral Subband Normalization for Robust ASR

S.-S. Wang, J.-W. Hung, and Y. Tsao

ISCSLP, 2012

2012

Acoustic Space Partition based on Broad Phonetic Class for Ensemble Acoustic Modeling

X. Lu, Y. Tsao, S. Matsuda, C. Hori, and H. Kashioka

ISCSLP, 2012

2012

Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation

T.-Y. Hu, Y. Tsao, and L.-S. Lee

Interspeech, 2012

2012

A Study of Mutual Information for GMM-Based Spectral Conversion

H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen

Interspeech, 2012

2012

A Linear Projection Approach to Environment Modeling for Robust Speech Recognition

Y. Tsao, C.-L. Huang, S. Matsuda, C. Hori, and H. Kashioka

ICASSP, 2012

2011

2011

Feature Normalization and Selection for Robust Speaker State Recognition

C.-L. Huang, Y. Tsao, and C. Hori

International Committee for Co-ordination and Standardisation of Speech Databases

(COCOSDA), 2011

2011

Incorporating Regional Information to Enhance MAP-based Stochastic Feature Compensation for Robust Speech Recognition

Y. Tsao, P. R. Dixon, C. Hori, and H. Kawai

Interspeech, 2011

2011

A Sampling-based Environment Population Projection Approach for Rapid Acoustic Model Adaptation

Y. Tsao, R. Isotani, H. Kawai, and S. Nakamura

ICASSP, 2011

2011

Increasing Discriminative Capability on Map-based Mapping Function Estimation for Acoustic Model Adaptation

Y. Tsao, R. Isotani, H. Kawai, and S. Nakamura

ICASSP, 2011

2010

2010

Shrinkage Model Adaptation in Automatic Speech Recognition

J. Li, Y. Tsao, and C.-H. Lee

Interspeech, 2010

2010

A Particle Filter Feature Compensation Approach to Robust Speech Recognition

A. Mushtaq, Y. Tsao, and C.-H. Lee

Interspeech, 2010

2010

An Acoustic Segment Model Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker Recognition

Y. Tsao, H. Sun, H. Li, and C.-H. Lee

ICASSP, 2010

2009

2009

MAP Estimation of Online Mapping Parameters in Ensemble Speaker and Speaking Environment Modeling

Y. Tsao, S. Matsuda, S. Nakamura, and C.-H. Lee

IEEE Automatic Speech Recognition and Understanding (ASRU), 2009

2009

Soft Margin Estimation on Improving Environment Structures for Ensemble Speaker and Speaking Environment Modeling

Y. Tsao, J. Li, C.-H. Lee, and S. Nakamura

International Universal Communication Symposium (IUCS), 2009

2009

A Study on Soft Margin Estimation of Linear Regression Parameters for Speaker Adaptation

S. Matsuda, Y. Tsao, J. Li, S. Nakamura, and C.-H. Lee

Interspeech, 2009

2009

Ensemble Speaker and Speaking Environment Modeling Approach with Advanced Online Estimation Process

Y. Tsao, J. Li, and C.-H. Lee

ICASSP, 2009

2008

2008

A Programmable Analog Radial-Basis-Function Based Classifier

S.-Y. Peng, Y. Tsao, P. E. Hasler, and D. V. Anderson

ICASSP, 2008

2008

Improving the Ensemble Speaker and Speaking Environment Modeling Approach by Enhancing the Precision of the Online Estimation Process

Y. Tsao and C.-H. Lee

Interspeech, 2008

2007

2007

Two Extensions to Ensemble Speaker and Speaking Environment Modeling for Robust Automatic Speech Recognition

Y. Tsao and C.-H. Lee

IEEE Automatic Speech Recognition and Understanding (ASRU), 2007

2007

Detection-based ASR In the Automatic Speech Attribute Transcription Project

I. Bromberg, Q. Fu, J. Hou, J. Li, C. Ma, B. Mattews, A. Moreno-Daniel, J. Morris,

S. M. Siniscalchi, Y. Tsao, and Y. Wang

Interspeech, 2007

2007

An Ensemble Modeling Approach to Joint Characterization of Speaker and Speaking Environments

Y. Tsao and C.-H. Lee

Interspeech, 2007

2006

2006

A Study on Detection Based Automatic Speech Recognition

C. Ma, Y. Tsao, and C.-H. Lee

Interspeech, 2006

2006

A Vector Space Approach to Environment Modeling for Robust Speech Recognition

Y. Tsao and C.-H. Lee

Interspeech, 2006

2005

2005

A Study on Separation between Acoustic Models and Its Applications

Y. Tsao, J. Li, and C.-H. Lee

Eurospeech, 2005

2005

A study on knowledge source integration for candidate rescoring in automatic speech recognition

J. Li, Yu. Tsao, and C.-H. Lee

ICASSP, 2005

2001

2001

Segmental Eigenvoice for Rapid Speaker Adaptation

Y. Tsao, S.-M. Lee, and L.-S. Lee

Eurospeech, 2001