Saved in:
| Main Authors: | Sung, Ching-Chih, Hsin, Cheng-Hung, Shiah, Yu-Anne, Lin, Bo-Jyun, Lai, Yi-Xuan, Lee, Chia-Ying, Wang, Yu-Te, Su, Borchin, Tsao, Yu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.15473 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Linguistic Knowledge Transfer Learning for Speech Enhancement
by: Hung, Kuo-Hsuan, et al.
Published: (2025)
by: Hung, Kuo-Hsuan, et al.
Published: (2025)
Condition-Invariant fMRI Decoding of Speech Intelligibility with Deep State Space Model
by: Sung, Ching-Chih, et al.
Published: (2025)
by: Sung, Ching-Chih, et al.
Published: (2025)
From Evaluation to Optimization: Neural Speech Assessment for Downstream Applications
by: Tsao, Yu
Published: (2025)
by: Tsao, Yu
Published: (2025)
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
by: Fu, Szu-Wei, et al.
Published: (2024)
by: Fu, Szu-Wei, et al.
Published: (2024)
LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement
by: Chen, Chih-Ning, et al.
Published: (2026)
by: Chen, Chih-Ning, et al.
Published: (2026)
Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing
by: Ren, Wenze, et al.
Published: (2024)
by: Ren, Wenze, et al.
Published: (2024)
Cincinnati; Our Convention City
by: Borchin, Anna
Published: (1970)
by: Borchin, Anna
Published: (1970)
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
by: Zezario, Ryandhimas E., et al.
Published: (2021)
by: Zezario, Ryandhimas E., et al.
Published: (2021)
72‐1: The PathSync Intelligent Transparent Display Navigation System
by: Chao-Ming Yu, et al.
Published: (2025)
by: Chao-Ming Yu, et al.
Published: (2025)
SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models
by: Yin, Chun, et al.
Published: (2024)
by: Yin, Chun, et al.
Published: (2024)
Preoperative Prognosis Assessment of Lumbar Spinal Surgery for Low Back Pain and Sciatica Patients based on Multimodalities and Multimodal Learning
by: Chen, Li-Chin, et al.
Published: (2023)
by: Chen, Li-Chin, et al.
Published: (2023)
A Study on Speech Assessment with Visual Cues
by: Ahmed, Shafique, et al.
Published: (2025)
by: Ahmed, Shafique, et al.
Published: (2025)
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
by: Zezario, Ryandhimas E., et al.
Published: (2024)
by: Zezario, Ryandhimas E., et al.
Published: (2024)
A Study on Incorporating Whisper for Robust Speech Assessment
by: Zezario, Ryandhimas E., et al.
Published: (2023)
by: Zezario, Ryandhimas E., et al.
Published: (2023)
HighRateMOS: Sampling-Rate Aware Modeling for Speech Quality Assessment
by: Ren, Wenze, et al.
Published: (2025)
by: Ren, Wenze, et al.
Published: (2025)
A Comparative Study on Proactive and Passive Detection of Deepfake Speech
by: Wu, Chia-Hua, et al.
Published: (2025)
by: Wu, Chia-Hua, et al.
Published: (2025)
Speech Intelligibility Assessment with Uncertainty-Aware Whisper Embeddings and sLSTM
by: Zezario, Ryandhimas E., et al.
Published: (2025)
by: Zezario, Ryandhimas E., et al.
Published: (2025)
Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement
by: Chao, Rong, et al.
Published: (2025)
by: Chao, Rong, et al.
Published: (2025)
Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition
by: Wang, Kuan-Chen, et al.
Published: (2024)
by: Wang, Kuan-Chen, et al.
Published: (2024)
Using knowledge building and flipped learning to enhance students' learning performance in a hands‐on STEM activity
by: Jyun‐Chen Chen, et al.
Published: (2024)
by: Jyun‐Chen Chen, et al.
Published: (2024)
Developing an Interdisciplinary Hands‐On Learning Activity With the 6E Model to Improve Students' STEM Knowledge, Learning Motivation and Creativity
by: Jyun‐Chen Chen, et al.
Published: (2025)
by: Jyun‐Chen Chen, et al.
Published: (2025)
Universal Speech Enhancement with Regression and Generative Mamba
by: Chao, Rong, et al.
Published: (2025)
by: Chao, Rong, et al.
Published: (2025)
Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
by: Ren, Wenze, et al.
Published: (2024)
by: Ren, Wenze, et al.
Published: (2024)
TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
by: Wang, Kuan-Chen, et al.
Published: (2024)
by: Wang, Kuan-Chen, et al.
Published: (2024)
MOS-Bias: From Hidden Gender Bias to Gender-Aware Speech Quality Assessment
by: Ren, Wenze, et al.
Published: (2026)
by: Ren, Wenze, et al.
Published: (2026)
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues
by: Hussain, Tassadaq, et al.
Published: (2024)
by: Hussain, Tassadaq, et al.
Published: (2024)
Hypoxia‐induced translation of collagen‐modifying enzymes PLOD2 and P4HA1 is dependent on RBM4 and eIF4E2 in human colon cancer HCT116 cells
by: Hung‐Hsuan Li, et al.
Published: (2024)
by: Hung‐Hsuan Li, et al.
Published: (2024)
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
by: Zezario, Ryandhimas E., et al.
Published: (2023)
by: Zezario, Ryandhimas E., et al.
Published: (2023)
Stabilizing Physics-Informed Consistency Models via Structure-Preserving Training
by: Chang, Che-Chia, et al.
Published: (2026)
by: Chang, Che-Chia, et al.
Published: (2026)
Consistency Training with Physical Constraints
by: Chang, Che-Chia, et al.
Published: (2025)
by: Chang, Che-Chia, et al.
Published: (2025)
TINNs: Time-Induced Neural Networks for Solving Time-Dependent PDEs
by: Dai, Chen-Yang, et al.
Published: (2026)
by: Dai, Chen-Yang, et al.
Published: (2026)
Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics
by: Wang, Syu-Siang, et al.
Published: (2024)
by: Wang, Syu-Siang, et al.
Published: (2024)
A Systematic Review and Meta‐Analysis of Footbath Effects and Optimal Procedures to Improve Sleep in Older Adults
by: Shih‐Yu Chang, et al.
Published: (2025)
by: Shih‐Yu Chang, et al.
Published: (2025)
Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement
by: Lin, Meng-Ping, et al.
Published: (2025)
by: Lin, Meng-Ping, et al.
Published: (2025)
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
by: Wisnu, Dyah A. M. G., et al.
Published: (2024)
by: Wisnu, Dyah A. M. G., et al.
Published: (2024)
Rethinking Training Targets, Architectures and Data Quality for Universal Speech Enhancement
by: Fu, Szu-Wei, et al.
Published: (2026)
by: Fu, Szu-Wei, et al.
Published: (2026)
Anti‐Apoptotic and Anti‐Oxidative Effects of DDX24 Through HO‐1 Transcriptional Regulation
by: Yu‐Xiu Lin, et al.
Published: (2025)
by: Yu‐Xiu Lin, et al.
Published: (2025)
The miRNAs 203a/210‐3p/5001‐5p regulate the androgen/androgen receptor/YAP‐induced migration in prostate cancer cells
by: Chieh Huo, et al.
Published: (2024)
by: Chieh Huo, et al.
Published: (2024)
Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement
by: Khan, Muhammad Salman, et al.
Published: (2024)
by: Khan, Muhammad Salman, et al.
Published: (2024)
Learned Image Compression with Text Quality Enhancement
by: Lai, Chih-Yu, et al.
Published: (2024)
by: Lai, Chih-Yu, et al.
Published: (2024)
Similar Items
-
Linguistic Knowledge Transfer Learning for Speech Enhancement
by: Hung, Kuo-Hsuan, et al.
Published: (2025) -
Condition-Invariant fMRI Decoding of Speech Intelligibility with Deep State Space Model
by: Sung, Ching-Chih, et al.
Published: (2025) -
From Evaluation to Optimization: Neural Speech Assessment for Downstream Applications
by: Tsao, Yu
Published: (2025) -
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
by: Fu, Szu-Wei, et al.
Published: (2024) -
LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement
by: Chen, Chih-Ning, et al.
Published: (2026)