Conference Papers
2024
- 2024
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
S.-W. Fu, K.-H. Hung, Y. Tsao, and Y.-C. F. Wang
ICLR, 2024
- 2024
SDEMG: Score-based Diffusion Model For Surface Electromyographic Signal Denoising
Y.-T. Liu, K.-C. Wang, K.-C. Liu, S.-Y. Peng, and Y. Tsao
IEEE ICASSP, 2024
- 2024
Hierarchical Cross-modality Knowledge Transfer With Sinkhorn Attention For Ctc-based ASR
X. Lu, P. Shen, Y. Tsao, and H. Kawai
IEEE ICASSP, 2024
- 2024
Scalable Ensemble-based Detection Method Against Adversarial Attacks For Speaker Verification
H. Wu, H.-C. Kuo, Y. Tsao, H.-y. Lee
IEEE ICASSP, 2024
- 2024
A Multi-task Evaluation Benchmark For Audio-visual Representation Models
Y. Tseng, L. Berry, and Y.-T. Chen et al.,
IEEE ICASSP, 2024
- 2024
Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model
R. E. Zezario, B.-R. B. Bai, C.-S. Fuh, H.-M. Wang, and Y. Tsao
IEEE ICASSP, 2024
2023
- 2023
Cross-modal alignment with optimal transport for CTC-based ASR
X. Lu, P. Shen, Y. Tsao, and H. Kawa
IEEE ASRU, 2023
- 2023
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models
C.-C. Lee, H.-W. Chen, C.-S. Chen, H.-M. Wang, T.-T. Liu, and Y. Tsao
IEEE ASRU, 2023
- 2023
Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility
H.-T. Chiang, K.-H. Hung, S.-W. Fu, H.-C. Kuo, M.-H. Tsai, and Y. Tsao
IEEE ASRU, 2023
- 2023
The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
E. Cooper, W.-C. Huang, Y.Tsao, H.-M. Wang, T. Toda, and J. Yamagishi
IEEE ASRU, 2023
- 2023
Inference and Denoise: Causal Inference-based Neural Speech Enhancement
T.-A. Hsieh, C.-H. Huck Y., P.-Y. Chen, S. M. Siniscalchi, Y. Tsao
IEEE MLSP, 2023
- 2023
IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays
W.-Y. Ting, S.-S. Wang, Y. Tsao, and B. Su
IEEE MLSP, 2023
- 2023
Voice Direction-of-Arrival Conversion
I-C. Chern, S. Chern, H.-C. Kuo, H.-H. Tseng, K.-H. Hung, and Y. Tsao
IEEE MLSP, 2023
- 2023
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition
H. Yen, P.-J. Ku, C.-H. H. Yang, H. Hu, S. M. Siniscalchi, P.-Y. Chen, and Y. Tsao
Interspeech, 2023
- 2023
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Y.-L. Chien, H.-H. Chen, M.-C. Yen, S.-W. Tsai, H.-M. Wang, Y. Tsao, T.-S. Chi
Interspeech, 2023
- 2023
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
L.-W. Chen, Y.-F. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang
Interspeech, 2023
- 2023
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
H.-H. Chen, Y.-L. Chien, M.-C. Yen, S.-W. Tsai, T.-S. Chi, Y. Tsao, and H.-M. Wang
Interspeech, 2023
- 2023
Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation
E.-P. Chu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, and C.-T. Chan
IEEE EMBC, 2023
- 2023
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks
C.-P. Liu, J.-H. Li, E.-P. Chu, C.-Y. Hsieh, K.-C. Liu, C.-T. Chan, and Y. Tsao
IEEE MeMeA, 2023
- 2023
Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning
C.-H. Chen, K.-C. Liu, T.-Y. Lu, C.-Y. Chang, C.-T. Chan, and Y. Tsao
IEEE NER, 2023
- 2023
Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids
J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain
IEEE ICASSP 2023 (AMHAT 2023 Workshop), 2023
- 2023
D4AM: A General Denoising Framework for Downstream Acoustic Models
C.-C. Lee, Y. Tsao, H.-M. Wang and C.-S. Chen
ICLR, 2023
- 2023
Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation
T.-H. Chi, K.-C. Liu, C.-Y. Hsieh, Y. Tsao, and C.-T. Chan
IEEE ICASSP , 2023
- 2023
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks
K.-C. Wang, K.-C. Liu, S.-Y. Peng, Y. Tsao
IEEE ICASSP, 2023
- 2023
On the Robustness of Non-intrusive Speech Quality Model by Adversarial Examples
H.-Y. Lin, H.-H. Tseng, and Y. Tsao
IEEE ICASSP, 2023
- 2023
Audio-visual Speech Enhancement And Separation By Utilizing Multi-modal Self-supervised Embeddings
I-C. Chern, K.-H. Hung, Y.-T. Chen, T. Hussain, M. Gogate, A. Hussain, Y. Tsao, and J.-C. Hou
IEEE ICSSP 2023 (AMHAT 2023 Workshop), 2023
- 2023
Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids
J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain
IEEE ICSSP 2023 (AMHAT 2023 Workshop), 2023
- 2023
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
C.-J. Hsu, H.-L. Chung, H.-y. Lee, amd Y. Tsao
IEEE ICSSP, 2023
- 2023
Interpretations of Domain Adaptations via Layer Variational Analysis
H.-H. Tseng, H.-Y. Lin, H.-K. Hsuan and Y. Tsao
ICLR, 2023
2022
- 2022
Dysarthric Speech Enhancement Based on Convolution Neural Network
S.-S. Wang, Y. Tsao, W.-Z. Zheng, H.-W. Yeh, P.-C. Li, S.-H. Fang, Y.-H. Lai
IEEE EMBC, 2022
- 2022
A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning
T. Hussain, M. Diyan, M. Gogate, K. Dashtipour, A. Adeel, Y. Tsao, A. Hussain
IEEE EMBC, 2022
- 2022
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
C.-C. Lee, C.-H. Hu, Y.-C. Lin, C.-S. Chen, H.-M. Wang and Y. Tsao
Interspeech, 2022
- 2022
Boosting Self-Supervised Embeddings for Speech Enhancement
K.-H. Hung, S.-W. Fu, H.-H. Tseng, H.-T. Chiang, Y. Tsao, C.-W. Lin
Interspeech, 2022
- 2022
Perceptual Characteristics Based Multi-objective Model for Speech Enhancement
C.-J. Peng, Y.-J. Chan, Y.-L.Shen, C. Yu, Y. Tsao and T.-S. Chi
Interspeech, 2022
- 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
Y.-J. Lu et al
Interspeech, 2022
- 2022
🏆
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility
Prediction Model for Hearing Aids
R. E. Zezario, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao
Interspeech, 2022
- 2022
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao
Interspeech, 2022
- 2022
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Authors
F.-L. Wang, H.-S. Lee, Y. Tsao and H.-M. Wang
Interspeech, 2022
- 2022
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
R. Chao, C. Yu, S.-W. Fu, X. Lu and Y. Tsao
Interspeech, 2022
- 2022
The VoiceMOS Challenge 2022
W.-C. Huang, E. C., Y. Tsao, H.-M. Wang, T. Toda and J. Yamagishi
Interspeech, 2022
- 2022
InQSS: a speech intelligibility and quality assessment model using a multi-task learning network
Y.-W. Chen and Y. Tsao
Interspeech, 2022
- 2022
OSSEM: one-shot speaker adaptive speech enhancement using meta learning
C. Yu, S.-W. Fu, T.-An Hsieh, Y. Tsao and M. Ravanelli
Interspeech, 2022
- 2022
When Bert Meets Quantum Temporal Convolution Learning for Text Classification In Heterogeneous Computing
C.-H. H. Yang, J. Qi, S. Y.-C. Chen, Y. Tsao, and P.-Y. Chen
ICASSP, 2022
- 2022
XDBERT: Distilling Visual Information to BERT via Cross-Modal Encoders to Improve Language Understanding
C.-J. Hsu, H.-Y. Lee, Y. Tsao
ACL , 2022
- 2022
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
H. Wu, H.-C. Kuo, N. Zheng, K.-H. Hung, H.-Y. Lee, Y. Tsao, H.-M. Wang, and H. Meng
ICASSP, 2022
- 2022
Speech Recovery For Real-world Self-powered Intermittent Devices
Y.-C. Lin,T.-A. Hsieh, K.-H. Hung, C. Yu, H. Garudadri, Y. Tsao, and T.-W. Kuo
ICASSP, 2022
- 2022
EMGSE: Acoustic/emg Fusion For Multimodal Speech Enhancement
K.-C. Wang, K.-C. Liu, H.-M. Wang, and Y. Tsao
ICASSP, 2022
- 2022
Conditional Diffusion Probabilistic Model For Speech Enhancement
Y.-J. Lu, Z.-Q. Wang, S. Watanabe, A. Richard, C. Yu, and Y. Tsao
ICASSP, 2022
- 2022
MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation based Only On Noisy/ Reverberated Speech
S.-W. Fu, C. Yu, K.-H. Hung, M. Ravanelli, and Y. Tsao
ICASSP, 2022
- 2022
Analyzing The Robustness Of Unsupervised Speech Recognition
G.-T. Lin, C.-J. Hsu, D.-R. Liu, H.-Y. Lee, and Y. Tsao
ICASSP, 2022
2021
- 2021
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport
H.-Y. Lin, H.-H. Tseng, X. Lu, and Y. Tsao
NeurIPS, 2021
- 2021
HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network
H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, and Y. Tsao
ASRU, 2021
- 2021
Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Model-ing
M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tsai, Y.
Tsao,
T. Toda, J.-S. Jang, and H.-M. Wang
ASRU, 2021
- 2021
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
X. Chang, T. Maekaku, P. Guo, J. Shi,Y.-J. Lu, A. S. Subramanian, T.
Wang,
S.-w. Yang, Y. Tsao, H.-y. Lee, and S. Watanabe
ASRU, 2021
- 2021
Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder
T.-Y. Lu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, C.-T. Chan
IEEE BHI, 2021
- 2021
Investigation of A Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-To-End Bengali Automatic Speech Recogni-tion Under Unseen Noisy Conditions
M. E Noor, Y.-J. Lu, S.-Si. Wang, S. Ghose, C.-Y. Chang, R. E. Zezario,
S. Ahmed,
W.-H. Chung, Y. Tsao and H.-M. Wang
Oriental COCOSDA, 2021
- 2021
MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder
Y.-J. Li, S.-S. Wang, Y. Tsao, and B. Su
APSIPA ASC, 2021
- 2021
A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Y.-J. Lu, Y. Tsao, and S. Watanabe
APSIPA ASC, 2021
- 2021
Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion
Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, and H.-M. Wang
APSIPA ASC, 2021
- 2021
Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues
Z. Feng, Y. Tsao, and F. Chen
APSIPA ASC, 2021
- 2021
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
X. Lu, P. Shen, Y. Tsao, and H. Kawai
APSIPA ASC, 2021
- 2021
Unsupervised neural adaptation model based on optimal transport for spoken language identification
X. Lu, P. Shen, Y. Tsao, and H. Kawai
ICASSP, 2021
- 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving SpeakerIdentity in Dysarthric Voice Conversion
W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, and T. Toda
Interspeech, 2021
- 2021
A Study of Incorporating Articulatory Movement Information in Speech Enhancement
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, X. Lu, and Y. Tsao
EUSIPCO, 2021
- 2021
Speech Enhancement with Zero-Shot Model Selection
R. E Zezario, C.-S. Fuh, H.-M. Wang, Y. Tsao
EUSIPCO, 2021
- 2021
One shot learning for speech separation
Y.-K. Wu, K.-P. Huang, Y. Tsao, and H.-Y. Lee
ICASSP, 2021
- 2021
QISTA-Net-Audio: Audio Super-resolution via Non-Convex Lq-normMinimization
G.-X. Lin, S.-W. Hu, Y.-J. Lu, Y. Tsao, and C.-S. Lu
Interspeech, 2021
- 2021
MetricGAN +: An Improved Version of MetricGAN for Speech Enhancement
S.-W. Fu, C. Yu, T.-A. Hsieh, P. Plantinga, M. Ravanelli, X. Lu, and Y. Tsao
Interspeech, 2021
- 2021
Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement
T.-A. Hsieh, C. Yu, S.-W. Fu, X. Lu, and Y. Tsao
Interspeech, 2021
- 2021
Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Y,-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang, and T. Toda
Interspeech, 2021
- 2021
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, W.-C. Huang, X. Lu, and Y. Tsao
ISCAS, 2021
- 2021
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario
C.-J. Peng, Y.-J. Chan, C. Yu, S.-S. Wang, Y. Tsao, and T.-S. Chi
ISCAS, 2021
- 2021
MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration
Y.-T. Chang, Y.-H. Yang, Y.-H. Peng, S.-S. Wang, T.-S. Chi, Y. Tsao and H.-M. Wang
ISCSLP, 2021
2020
- 2020
Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-based Voice Conversion System
C.-Y. Chen, W.-Z. Zheng, S.-S. Wang, Y. Tsao, P.-C. Li, and Y.-H. Lai
Interspeech, 2020
- 2020
Lite Audio-Visual Speech Enhancement
S.-Y. Chuang, Y. Tsao, C.-C. Lo, and H.-M. Wang
Interspeech, 2020
- 2020
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model
R. E. Zezario, S.-W. Fu, C.-S. Fuh, Y. Tsao, and H.-M. Wang
APSIPA, 2020
- 2020
Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
S.-W. Fu et al.,
APSIPA, 2020
- 2020
SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning
C.-C. Lee, Y.-C. Lin, H.-T. Lin, H.-M. Wang, and Y. Tsao
Interspeech, 2020
- 2020
iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning
H. Li, S.-W. Fu, Y. Tsao, and J. Yamagishi
Interspeech, 2020
- 2020
Incorporating Broad Phonetic Information for Speech Enhancement
Y.-J. Lu, C.-F. Liao, X. Lu, J.-w. Hung, and Y. Tsao
Interspeech, 2020
- 2020
Self-supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement
R. E. Zezario, T. Hussain, X. Lu, H.-M. Wang, and Y. Tsao
ICASSP, 2020
2019
- 2019
Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement
W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang
APSIPA, 2019
- 2019
Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement
T. Hussaink, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao
ICASSP, 2019
- 2019
Subjective Feedback-based Neural Network Pruning for Speech Enhancement
W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang
APSIPA, 2019
- 2019
Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
K.-Y. Liu, S.-S. Wang, Y. Tsao, and J.-w. Hung
ROCLING, 2019
- 2019
Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion
W.-C. Huang, Y.-C. Wu, K. Kobayashi, Y.-H. Peng, H.-T. Hwang, P. L. Tobing, Y. Tsao,
H.-M. Wang, and T. Toda
ISCA SSW 10, 2019
- 2019
Speaker-aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement
F.-K. Chuang, S.-S. Wang, J.-w. Hung, Y. Tsao, and S.-H. Fang
Interspeech, 2019
- 2019
IA-NET: Acceleration and Compression of Speech Enhancement using Integer-adder Deep Neural Network
Y.-C. Lin, Y.-T. Hsu, S.-W. Fu, Y. Tsao, and T.-W. Kuo
Interspeech, 2019
- 2019
MOSNet: Deep Learning-based Objective Assessment for Voice Conversion
C.-C. Lo, S.-w. Fu, W. C. Huang, X. Wang, J. Yamagishi, Y. Tsao, and H.-M. Wang
Interspeech, 2019
- 2019
Class-wise Centroid Distance Metric Learning for Acoustic Event Detection
X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai
Interspeech, 2019
- 2019
Incorporating Symbolic Sequential Modeling for Speech Enhancement
C.-F. Liao, Y. Tsao, X. Lu, and H. Kawai
Interspeech, 2019
with ISCA Travel Grant
- 2019
Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric
R. E. Zezario, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao
Interspeech, 2019
- 2019
Noise Adaptive Speech Enhancement using Domain Adversarial Training
C.-F. Liao, Y. Tsao, H.-y. Lee, and H.-M. Wang
Interspeech, 2019
with ISCA Travel Grant
- 2019
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech
L.-W. Chen, H.-Y. Lee, and Y. Tsao
Interspeech, 2019
- 2019
Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR
P.-T. Huang, H.-S. Lee, S.-S. Wang, K.-Y. Chen, Y. Tsao, and H.-M. Wang
Interspeech, 2019
with ISCA Travel Grant
- 2019
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P. L. Tobing, T. Hayashiy, K. Kobayashi, T. Toda,
Y. Tsao, and H.-M. Wang
European Signal Processing Conference (EUSIPCO), 2019
- 2019
Audio-Visual Speech Enhancement Using Hierarchical Extreme Learning Machine
T. Hussain, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao
European Signal Processing Conference (EUSIPCO), 2019
- 2019
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion
W.-C. Huang, Y.-C. Wu, C.-C. Lo, P. L. Tobing, T. Hayashi, K. Kobayashi, T. Toda,
Y. Tsao, and H.-M. Wang
Interspeech, 2019
- 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
S.-W. Fu, C.-F. Liao, Y. Tsao, and S.-D. Lin
ICML, 2019
- 2019
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition
Y.-L. Shen, C.-Y. Huang, S.-S. Wang, Y. Tsao, H.-M. Wang, and T.-S. Chi
ICASSP, 2019
- 2019
Reducing noise and reverberation in speech signals via the integration of denoising autoencoder and temporal lowpass filtering
K.-Y. Liu, S.-k. Lee, S.-S. Wang, Y. Tsao, and J.-W. Hung
IEEE International Conference on Applied System Innovation (ICASI), 2019
- 2019
Bone-conducted Speech Enhancement using Hierarchical Extreme Learning Machine
T. Hussain, Y. Tsao, S. M. Sinicalchi, J.-C. Wang, H.-M. Wang, and W.-H. Liao
International Workshop on Spoken Dialogue Systems Technology (IWSDS), 2019
2018
- 2018
An Abnormal Detection Strategy of Rotating Electric Machine based on Frequency Distribution
S.-C. Lin, Y. Tsao, S.-F. Su, Yennun Huang, and Z.-Q. Zhong
The 39th Symposium on Electrical Power Engineering, 2018
- 2018
Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement
R. E. Zezario, J.-W. Huang, X. Lu, Y. Tsao, H.-T. Hwang, and H.-M. Wang
APSIPA, 2018
- 2018
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y.-T. Hsu, Y.-C. Lin, S.-W. Fu, Y. Tsao, and T.-W. Kuo
IEEE Spoken Language Technology (SLT), 2018
- 2018
Robustness against the channel effect in pathological voice detection
Y.-T. Hsu, Z. Zhu, C.-T. Wang, S.-H. Fang, F. Rudzicz, and Y. Tsao
NeurIPS 2018, Machine Learning for Health (ML4H) Workshop
- 2018
An Industrial IoT Analysis System Based on Machining Data of Metal Materials
S.-C. Lin, Y. Tsao, S.-F. Su, and Yennun Huang
International Conference on Fuzzy Theory and Its Applications, 2018
- 2018
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
W.-C. Huang, H.-T. Hwang, Y.-H. Peng, Y. Tsao, and H.-M. Wang
ISCSLP, 2018
Best Student Paper Award
- 2018
Speech Enhancement based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform
S.-k. Lee, S.-S. Wang, Y. Tsao, and J.-w. Hung
ISCSLP, 2018
- 2018
FIS-based Domestic Milling Machine PHM System Considering Multi-speed Frequency Variation
S.-C. Lin, C.-H. Su, Y. Tsao, S.-F. Su, H.-Y. Mark Liao, and Yennun Huang
IEEE International Conference on Advanced Manufacturing, 2018
Best Paper Award (獲推薦轉投SCI期刊, 擴充研究?改中)
- 2018
A Supervised Learning Algorithm Considering Light Conditions for Visual Inspection of Metal Objects
H.-C. Li, S.-C. Lin, Y. Tsao, S.-F. Su, P.-L. Sun, and Yennun Huang
The 54th Annual Conference of Chinese Society for Quality 2018 International
Symposium of Quality Management
Makalot Industry-Academic Collaboration Award (獲推薦轉投EI期刊, 擴充研究?改中)
- 2018
Exemplar-Based Spectral Detail Compensation for Voice Conversion
Y.-H. Peng, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang
Interspeech ,2018
- 2018
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
S.-W. Fu, Y. Tsao, H.-T. Hwang, and H.-M. Wang
Interspeech ,2018
- 2018
Temporal Attentive Pooling for Acoustic Event Detection
X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai
Interspeech, 2018
- 2018
Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation
Y.-Y. Kao, H.-P. Hsu, C.-F. Liao, Y. Tsao, H.-C. Yang, J.-L. Li, C.-C. Lee,
H.-S. Lee, and H.-M. Wang
IEEE IWAENC ,2018
- 2018
Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform
S.-K. Lee, S.-S. Wang, Y. Tsao, and J.-W. Hung
IEEE SiPS ,2018
- 2018
Improving the Performance of Hearing Aids in Noisy Environments based on Deep Learning Technology
Y.-H. Lai, W.-Z. Zheng, S.-T. Tang, S.-H. Fang, W.-H. Liao, and Y. Tsao
IEEE Engineering in Medicine and Biology Society (EMBC), 2018
- 2018
A Novel LSTM-based Speech Preprocessor For Speaker Diarization in Realistic Mismatch Conditions
L. Sun, J. Du, T. Gao, Y.-D. Lu, Y. Tsao, C.-H. Lee, and N. Ryant
ICASSP, 2018
- 2018
Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm
W.-J. Lee, S.-S. Wang, F. Chen, X. Lu, S.-Y. Chien, and Y. Tsao
ICASSP, 2018
2017
- 2017
Computing Biodiversity Change via a Soundscape Monitoring Network
T.-H. Lin, Y.-H. Wang, S.-S. Lu, H.-W. Yen, and Y. Tsao
PNC, 2017
- 2017
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks
S.-W. Fu, Y. Tsao, X. Lu, and H. Kawai
APSIPA, 2017
- 2017
A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise
S.-S. Wang, Y. Tsao, H.-L. S. Wang, Y.-H. Lai, and L. P.-H. Li
APSIPA, 2017
- 2017
Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion
Y.-H. Peng, C.-C. Hsu, Y.-C. Wu, H.-T. Hwang, Y.-W. Liu, Y. Tsao, and H.-M. Wang
APSIPA, 2017
Poster Presentation Award
- 2017
Complex Spectrogram Enhancement by Convolutional Neural Network with Multi-metrics Learning
S.-W. Fu, T.-y. Hu, Y. Tsao, and X. Lu
IEEE International Workshop on Machine Learning for Signal Processing (MLSP), 2017
- 2017
Deblending of Simultaneous-source Seismic Data via Periodicity-coded Nonnegative Matrix Factorization
T.-H. Lin and Y. Tsao
IEEE Dataport, 2017
- 2017
A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y. Tsao, and H.-M. Wang
Interspeech, 2017
- 2017
Wavelet Speech Enhancement Based on Robust Principal Component Analysis
C.-L. Wu, H.-P. Hsu, S.-S. Wang, J.-W. Hung, Y.-H. Lai, H.-M. Wang, and Y.Tsao
Interspeech, 2017
- 2017
Discriminative Autoencoders for Acoustic Modeling
M.-H. Yang, H.-S. Lee, Y.-D. Lu, K.-Y. Chen, Y. Tsao, B. Chen, and H.-M. Wang
Interspeech, 2017
- 2017
Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang
Interspeech, 2017
- 2017
Object-based on-line video summarization for internet of video things
S.-T. Lin, Y.-H. Liao, Y. Tsao, and S.-Y. Chien
IEEE International Symposium on Circuits & Systems (ISCAS), 2017
- 2017
Discriminative Autoencoders for Speaker Verification
H.-S. Lee, Y.-D. Lu, C.-C. Hsu, Y. Tsao, H.-M. Wang, and S.-K. Jeng
ICASSP, 2017
- 2017
A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y.-H. Lai, Y. Tsao, and H.-M. Wang
ICASSP, 2017
2016
- 2016
Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao and H.-M. Wang
APSIPA, 2016
- 2016
Audio-Visual Speech Enhancement using Deep Neural Networks
J.-C. Hou, S.-S. Wang, Y.-H. Lai, J.-C. Lin, Y. Tsao, H.-W. Chang, and H.-M. Wang
APSIPA, 2016
- 2016
Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang
ISCSLP, 2016
- 2016
A Linear Regression Model with Dynamic Pulse Transit Time Features for Noninvasive Blood Pressure Prediction
Y.-Y. Hsieh, C.-D. Wu, Y. Tsao, and S.-S. Lu
Biomedical Circuits and Systems Conference (BioCAS), 2016
- 2016
Incorporating Local Environment Information with Ensemble Neural Networks to Robust Automatic Speech Recognition
C.-Y. Hsu, R. E. Zezario, J.-C. Wang, X. Lu, and Y. Tsao
ISCSLP, 2016
- 2016
Improving the Performance of Speech Perception in Noisy Environment based on a FAME Strategy
Y.-H. Lai, S.-S. Wang, Y.-T. Su, H.-C. Cheng, F. K. Fu, and Y. Tsao
ISCSLP, 2016
- 2016
Pair-wise Distance Metric Learning of Neural Network Model for Spoken Language Identification
X. Lu, P. Shen, Y. Tsao, and H. Kawai
Interspeech, 2016
- 2016
DCASE Report for Task 3: Sound Event Detection in Real Life Audio
Y.-H. Lai, C.-H. Wang, S.-Y. Hou, B.-Y. Chen, Y. Tsao, and Y.-W. Liu
Detection and Classification of Acoustic Scenes and Events (DCASE) workshop, 2016
- 2016
Locally Linear Embedding for Exemplar-Based Spectral Conversion
Y.-C. Wu, H.-T. Hwang, C.-C. Hsu, Y. Tsao, and H.-M. Wang
Interspeech, 2016
- 2016
Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation
H.-S. Lee, Y. Tsao, C.-C. Lee, H.-M. Wang, W.-C. Lin, W.-C. Chen, S.-W. Hsiao, S.-K. Jeng
Interspeech, 2016
- 2016
SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement
S.-W. Fu, Y. Tsao, and X. Lu
Interspeech, 2016
- 2016
Track-clustering Error Evaluation for Track-based Multi-camera Tracking System Employing Human Re-identification
C.-W. Wu, M.-T. Zhong, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.-Y. Chien
Computer Vision and Pattern Recognition (CVPR), 2016
- 2016
Nonnegative Matrix Factorization-based Frequency Lowering Technology for Mandarin-speaking Hearing Aid Users
Y.-T. Liu, Y. Tsao, and R.-Y. Chang
ICASSP, 2016
- 2016
Speech Enhancement via Ensemble Modeling NMF Adaptation
Jeremy Chiaming Yang, S.-S. Wang, Y. Tsao, and J.-W. Hung
IEEE International Conference on Consumer Electronics (ICCE), 2016
- 2016
Leveraging Nonnegative Matrix Factorization in Processing the Temporal Modulation Spectrum for Speech Enhancement
S.-S. Wang, Jeremy Chiaming Yang, Y. Tsao, and J.-W. Hung
IEEE International Conference on Consumer Electronics (ICCE), 2016
- 2016
Temporal Modulation Spectral Restoration for Robust Speech Recognition
S.-S. Wang and Y. Tsao
IEEE International Conference on Multimedia Big Data, 2016
- 2016
Improving the Performance of Noise Reduction in Hearing Aids Based on the Genetic Algorithm
Y.-H. Lai, C.-H. Chen, S.-T. Tang, Z.-M. Yeh, and Y. Tsao
IFMBE Proceedings 57, 2016
2015
- 2015
A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
APSIPA, 2015
- 2015
A New Frequency Lowering Technique for Mandarin-Speaking Hearing Aid Users
Y.-T. Liu, R. Y. Chang, Y. Tsao, and Y.-P. Chang
IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2015
- 2015
Improving Denoising Auto-encoder Based Speech Enhancement With the Speech Parameter Generation Algorithm
S.-S. Wang, H.-T. Hwang, Y.-H. Lai, Y. Tsao, X. Lu, H.-M. Wang, and B. Su
APSIPA, 2015
- 2015
Temporal Alignment for Deep Neural Networks
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao
IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2015
- 2015
Speech Recognition with Temporal Neural Networks
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao
Interspeech, 2015
- 2015
Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection
X. Lu, P. Shen, Y. Tsao, C. Hori, and H. Kawai
Interspeech, 2015
- 2015
Temporal Information in Tone Recognition
P. Lin, S.-S. Wang, and Y. Tsao
IEEE International Conference on Consumer Electronics (ICCE), 2015
- 2015
Multimodal Arousal Rating using Unsupervised Fusion Technique
C. Ma, Y. Tsao, and C.-H. Lee
ICASSP, 2015
- 2015
A Discriminative Post-filter for Speech Enhancement in Hearing Aids
Y.-H. Lai, S.-S. Wang, P.-C. Li, and Y. Tsao
ICASSP, 2015
2014
- 2014
Robust Anchorperson Detection Based on Audio Streams using a Hybrid I-vector and DNN System
Y.-F. Chang, P. Lin, S.-H. Cheng, K.-H. Chan, Y.-C. Zeng, C.-W. Liao, W.-T. Chang,
Y.-C. Wang, and Y. Tsao
APSIPA, 2014
- 2014
Effect of Adaptive Envelope Compression in Simulated Electric Hearing in Reverberation
Y.-H. Lai, F. Chen, and Y. Tsao
ISIC, 2014
- 2014
A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering
H. Jing, A.-C. Liang, S.-D. Lin, and Y. Tsao
ICDM, 2014
- 2014
Clustering-Based I-Vector Formulation for Speaker Recognition
H.-S. Lee, Y. Tsao, H.-M. Wang, and S.-K. Jen
Interspeech, 2014
- 2014
Spectral Patch Based Sparse Coding for Acoustic Event Detection
X. Lu, Y. Tsao, P. Shen, and C. Hori
ISCSLP, 2014
- 2014
Ensemble Modeling of Denoising Autoencoder for Speech Spectrum Restoration
X. Lu, Y. Tsao, S. Matsuda, and C. Hori
Interspeech, 2014
- 2014
Ensemble of Machine Learning Algorithms for Cognitive and Physical Speaker Load Detection
C. Ma, Y. Tsao, and C.-H. Lee
Interspeech, 2014
- 2014
Automatic Speech Recognition with Primarily Temporal Envelope Information
P. Lin, F. Chen, S.-S. Wang, Y. Tsao and Y.-H. Lai
Interspeech, 2014
- 2014
An Adaptive Envelope Compression Strategy for Speech Processing in Cochlear Implants
Y.- H. Lai, F. Chen, and Y. Tsao
Interspeech, 2014
- 2014
Acoustic Feature Conversion using a Polynomial based Feature Transferring Algorithm
S.-S. Wang, P. Lin, D.-C. Lyu, Y. Tsao, H.-T. Hwang, B. Su, and H.-M. Wang
ISCSLP, 2014
- 2014
Speaker Verification Using Kernel-Based Binary Classifiers with Binary Operation Derived Features
H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng
ICASSP, 2014
- 2014
Sparse Representation Based on a Bag of Spectral Exemplars for Acoustic Event Detection
X. Lu, Y. Tsao, S. Matsuda, and C. Hori
ICASSP, 2014
- 2014
Speech Enhancement using Segmental Nonnegative Matrix Factorization
H.-T. Fan, J.-W. Hung, X. Lu, S.-S. Wang, and Y. Tsao
ICASSP, 2014
2013
- 2013
Semantic Naïve Bayes Classifier for Document Classification
H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng
International Joint Conference on Natural Language Processing (IJCNLP), 2013
- 2013
Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
APSIPA, 2013
- 2013
Robust Wi-Fi Location Fingerprinting Against Device Diversity based on Spatial Mean Normalization
C.-H. Wang, T.-W. Kao, S.-H. Fang, Y. Tsao, L.-C. Kuo, S.-W. Kao, and N.-C. Lin
APSIPA, 2013
- 2013
Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition
H.-Y. Lee, T.-Y. Hu, How Jing, Y.-F. Chang, Y. Tsao, Y.-C. Kao, and T.-L. Pao
Interspeech, 2013
- 2013
Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing
T.-H. Wen, Aaron Heidel, H.-y. Lee, Y. Tsao, and L.-S. Lee
Interspeech, 2013
Best Student Paper Award Nomination
- 2013
Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
Interspeech, 2013
- 2013
An Investigation of Spectral Restoration Algorithms for Deep Neural Networks based Noise Robust Speech Recognition
B. Li, Y. Tsao, and Khe Chai Sim
Interspeech, 2013
- 2013
Speech enhancement based on deep denoising autoencoder
X. Lu, Y. Tsao, Shigeki Matsuda and Chiori Hori
Interspeech, 2013
- 2013
Evaluation of Generalized Maximum a Posteriori Spectral Amplitude (GMAPA) Speech Enhancement Algorithm in Hearing Aids
Y.-H. Lai, Y.-C. Su, Y. Tsao, S.-T. Young
ISCE, 2013
- 2013
Filtering on the Temporal Probability Sequence in Histogram Equalization for Robust Speech Recognition
S.-S. Wang, Y. Tsao, and J.-W. Hung
ICASSP, 2013
- 2013
Speech Enhancement using Generalized Maximum a Posteriori Spectral Amplitude Estimator
Y.-C. Su, Y. Tsao, J.-E. Wu, and F.-R. Jean
ICASSP, 2013
2012
- 2012
Exploring Mutual Information for GMM-Based Spectral Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
ISCSLP, 2012
- 2012
A Study on Cepstral Subband Normalization for Robust ASR
S.-S. Wang, J.-W. Hung, and Y. Tsao
ISCSLP, 2012
- 2012
Acoustic Space Partition based on Broad Phonetic Class for Ensemble Acoustic Modeling
X. Lu, Y. Tsao, S. Matsuda, C. Hori, and H. Kashioka
ISCSLP, 2012
- 2012
Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation
T.-Y. Hu, Y. Tsao, and L.-S. Lee
Interspeech, 2012
- 2012
A Study of Mutual Information for GMM-Based Spectral Conversion
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen
Interspeech, 2012
- 2012
A Linear Projection Approach to Environment Modeling for Robust Speech Recognition
Y. Tsao, C.-L. Huang, S. Matsuda, C. Hori, and H. Kashioka
ICASSP, 2012
2011
- 2011
Feature Normalization and Selection for Robust Speaker State Recognition
C.-L. Huang, Y. Tsao, and C. Hori
International Committee for Co-ordination and Standardisation of Speech Databases
(COCOSDA), 2011
- 2011
Incorporating Regional Information to Enhance MAP-based Stochastic Feature Compensation for Robust Speech Recognition
Y. Tsao, P. R. Dixon, C. Hori, and H. Kawai
Interspeech, 2011
- 2011
A Sampling-based Environment Population Projection Approach for Rapid Acoustic Model Adaptation
Y. Tsao, R. Isotani, H. Kawai, and S. Nakamura
ICASSP, 2011
- 2011
Increasing Discriminative Capability on Map-based Mapping Function Estimation for Acoustic Model Adaptation
Y. Tsao, R. Isotani, H. Kawai, and S. Nakamura
ICASSP, 2011
2010
- 2010
Shrinkage Model Adaptation in Automatic Speech Recognition
J. Li, Y. Tsao, and C.-H. Lee
Interspeech, 2010
- 2010
A Particle Filter Feature Compensation Approach to Robust Speech Recognition
A. Mushtaq, Y. Tsao, and C.-H. Lee
Interspeech, 2010
- 2010
An Acoustic Segment Model Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker Recognition
Y. Tsao, H. Sun, H. Li, and C.-H. Lee
ICASSP, 2010
2009
- 2009
MAP Estimation of Online Mapping Parameters in Ensemble Speaker and Speaking Environment Modeling
Y. Tsao, S. Matsuda, S. Nakamura, and C.-H. Lee
IEEE Automatic Speech Recognition and Understanding (ASRU), 2009
- 2009
Soft Margin Estimation on Improving Environment Structures for Ensemble Speaker and Speaking Environment Modeling
Y. Tsao, J. Li, C.-H. Lee, and S. Nakamura
International Universal Communication Symposium (IUCS), 2009
- 2009
A Study on Soft Margin Estimation of Linear Regression Parameters for Speaker Adaptation
S. Matsuda, Y. Tsao, J. Li, S. Nakamura, and C.-H. Lee
Interspeech, 2009
- 2009
Ensemble Speaker and Speaking Environment Modeling Approach with Advanced Online Estimation Process
Y. Tsao, J. Li, and C.-H. Lee
ICASSP, 2009
2008
- 2008
A Programmable Analog Radial-Basis-Function Based Classifier
S.-Y. Peng, Y. Tsao, P. E. Hasler, and D. V. Anderson
ICASSP, 2008
- 2008
Improving the Ensemble Speaker and Speaking Environment Modeling Approach by Enhancing the Precision of the Online Estimation Process
Y. Tsao and C.-H. Lee
Interspeech, 2008
2007
- 2007
Two Extensions to Ensemble Speaker and Speaking Environment Modeling for Robust Automatic Speech Recognition
Y. Tsao and C.-H. Lee
IEEE Automatic Speech Recognition and Understanding (ASRU), 2007
- 2007
Detection-based ASR In the Automatic Speech Attribute Transcription Project
I. Bromberg, Q. Fu, J. Hou, J. Li, C. Ma, B. Mattews, A. Moreno-Daniel, J. Morris,
S. M. Siniscalchi, Y. Tsao, and Y. Wang
Interspeech, 2007
- 2007
An Ensemble Modeling Approach to Joint Characterization of Speaker and Speaking Environments
Y. Tsao and C.-H. Lee
Interspeech, 2007
2006
- 2006
A Study on Detection Based Automatic Speech Recognition
C. Ma, Y. Tsao, and C.-H. Lee
Interspeech, 2006
- 2006
A Vector Space Approach to Environment Modeling for Robust Speech Recognition
Y. Tsao and C.-H. Lee
Interspeech, 2006
2005
- 2005
A Study on Separation between Acoustic Models and Its Applications
Y. Tsao, J. Li, and C.-H. Lee
Eurospeech, 2005
- 2005
A study on knowledge source integration for candidate rescoring in automatic speech recognition
J. Li, Yu. Tsao, and C.-H. Lee
ICASSP, 2005
2001
- 2001
Segmental Eigenvoice for Rapid Speaker Adaptation
Y. Tsao, S.-M. Lee, and L.-S. Lee
Eurospeech, 2001