:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sung, Ching-Chih, Hsin, Cheng-Hung, Shiah, Yu-Anne, Lin, Bo-Jyun, Lai, Yi-Xuan, Lee, Chia-Ying, Wang, Yu-Te, Su, Borchin, Tsao, Yu
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2508.15473
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Linguistic Knowledge Transfer Learning for Speech Enhancement
by: Hung, Kuo-Hsuan, et al.
Published: (2025)

Condition-Invariant fMRI Decoding of Speech Intelligibility with Deep State Space Model
by: Sung, Ching-Chih, et al.
Published: (2025)

From Evaluation to Optimization: Neural Speech Assessment for Downstream Applications
by: Tsao, Yu
Published: (2025)

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
by: Fu, Szu-Wei, et al.
Published: (2024)

LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement
by: Chen, Chih-Ning, et al.
Published: (2026)

Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing
by: Ren, Wenze, et al.
Published: (2024)

Cincinnati; Our Convention City
by: Borchin, Anna
Published: (1970)

Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
by: Zezario, Ryandhimas E., et al.
Published: (2021)

72‐1: The PathSync Intelligent Transparent Display Navigation System
by: Chao-Ming Yu, et al.
Published: (2025)

SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models
by: Yin, Chun, et al.
Published: (2024)

Preoperative Prognosis Assessment of Lumbar Spinal Surgery for Low Back Pain and Sciatica Patients based on Multimodalities and Multimodal Learning
by: Chen, Li-Chin, et al.
Published: (2023)

A Study on Speech Assessment with Visual Cues
by: Ahmed, Shafique, et al.
Published: (2025)

A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
by: Zezario, Ryandhimas E., et al.
Published: (2024)

A Study on Incorporating Whisper for Robust Speech Assessment
by: Zezario, Ryandhimas E., et al.
Published: (2023)

HighRateMOS: Sampling-Rate Aware Modeling for Speech Quality Assessment
by: Ren, Wenze, et al.
Published: (2025)

A Comparative Study on Proactive and Passive Detection of Deepfake Speech
by: Wu, Chia-Hua, et al.
Published: (2025)

Speech Intelligibility Assessment with Uncertainty-Aware Whisper Embeddings and sLSTM
by: Zezario, Ryandhimas E., et al.
Published: (2025)

Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement
by: Chao, Rong, et al.
Published: (2025)

Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition
by: Wang, Kuan-Chen, et al.
Published: (2024)

Using knowledge building and flipped learning to enhance students' learning performance in a hands‐on STEM activity
by: Jyun‐Chen Chen, et al.
Published: (2024)

Developing an Interdisciplinary Hands‐On Learning Activity With the 6E Model to Improve Students' STEM Knowledge, Learning Motivation and Creativity
by: Jyun‐Chen Chen, et al.
Published: (2025)

Universal Speech Enhancement with Regression and Generative Mamba
by: Chao, Rong, et al.
Published: (2025)

Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
by: Ren, Wenze, et al.
Published: (2024)

TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
by: Wang, Kuan-Chen, et al.
Published: (2024)

MOS-Bias: From Hidden Gender Bias to Gender-Aware Speech Quality Assessment
by: Ren, Wenze, et al.
Published: (2026)

Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues
by: Hussain, Tassadaq, et al.
Published: (2024)

Hypoxia‐induced translation of collagen‐modifying enzymes PLOD2 and P4HA1 is dependent on RBM4 and eIF4E2 in human colon cancer HCT116 cells
by: Hung‐Hsuan Li, et al.
Published: (2024)

Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
by: Zezario, Ryandhimas E., et al.
Published: (2023)

Stabilizing Physics-Informed Consistency Models via Structure-Preserving Training
by: Chang, Che-Chia, et al.
Published: (2026)

Consistency Training with Physical Constraints
by: Chang, Che-Chia, et al.
Published: (2025)

TINNs: Time-Induced Neural Networks for Solving Time-Dependent PDEs
by: Dai, Chen-Yang, et al.
Published: (2026)

Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics
by: Wang, Syu-Siang, et al.
Published: (2024)

A Systematic Review and Meta‐Analysis of Footbath Effects and Optimal Procedures to Improve Sleep in Older Adults
by: Shih‐Yu Chang, et al.
Published: (2025)

Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement
by: Lin, Meng-Ping, et al.
Published: (2025)

HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
by: Wisnu, Dyah A. M. G., et al.
Published: (2024)

Rethinking Training Targets, Architectures and Data Quality for Universal Speech Enhancement
by: Fu, Szu-Wei, et al.
Published: (2026)

Anti‐Apoptotic and Anti‐Oxidative Effects of DDX24 Through HO‐1 Transcriptional Regulation
by: Yu‐Xiu Lin, et al.
Published: (2025)

The miRNAs 203a/210‐3p/5001‐5p regulate the androgen/androgen receptor/YAP‐induced migration in prostate cancer cells
by: Chieh Huo, et al.
Published: (2024)

Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement
by: Khan, Muhammad Salman, et al.
Published: (2024)

Learned Image Compression with Text Quality Enhancement
by: Lai, Chih-Yu, et al.
Published: (2024)