:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jeong, Seung Gyu, Kim, Seong Eun
Format:	Preprint
Published:	2025
Subjects:	Sound Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.09262
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Patient Domain Supervised Contrastive Learning for Lung Sound Classification Using Mobile Phone
by: Jeong, Seung Gyu, et al.
Published: (2025)

PC-MCL: Patient-Consistent Multi-Cycle Learning with multi-label bias correction for respiratory sound classification
by: Jeong, Seung Gyu, et al.
Published: (2026)

Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)

Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech
by: Kim, Nam-Gyu, et al.
Published: (2025)

Deep Space Separable Distillation for Lightweight Acoustic Scene Classification
by: Ye, ShuQi, et al.
Published: (2024)

Listen Like a Teacher: Mitigating Whisper Hallucinations using Adaptive Layer Attention and Knowledge Distillation
by: Tripathi, Kumud, et al.
Published: (2025)

Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech
by: Kim, Nam-Gyu
Published: (2025)

Adaptive Knowledge Distillation for Device-Directed Speech Detection
by: Chi, Hyung Gun, et al.
Published: (2025)

Toward Complex-Valued Neural Networks for Waveform Generation
by: Oh, Hyung-Seok, et al.
Published: (2026)

Device-Robust Acoustic Scene Classification via Impulse Response Augmentation
by: Morocutti, Tobias, et al.
Published: (2023)

DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025)

An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025)

Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification
by: Morocutti, Tobias, et al.
Published: (2025)

DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech
by: Cho, Deok-Hyeon, et al.
Published: (2025)

Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from Ensembles
by: Toikkanen, Miika, et al.
Published: (2025)

EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification
by: Cho, Deok-Hyeon, et al.
Published: (2025)

EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
by: Cho, Deok-Hyeon, et al.
Published: (2024)

Modality-Specific Speech Enhancement and Noise-Adaptive Fusion for Acoustic and Body-Conduction Microphone Framework
by: Kim, Yunsik, et al.
Published: (2025)

DOA Estimation with Lightweight Network on LLM-Aided Simulated Acoustic Scenes
by: Li, Haowen, et al.
Published: (2025)

ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition
by: Lee, Junseok, et al.
Published: (2026)

TinyMusician: On-Device Music Generation with Knowledge Distillation and Mixed Precision Quantization
by: Wang, Hainan, et al.
Published: (2025)

Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge
by: Schmid, Florian, et al.
Published: (2025)

SoundCompass: Navigating Target Sound Extraction With Effective Directional Clue Integration In Complex Acoustic Scenes
by: Choi, Dayun, et al.
Published: (2025)

Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices
by: Sharify, Hossein, et al.
Published: (2025)

Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments
by: Quan, Minh K., et al.
Published: (2025)

Adaptive Vehicle Speed Classification via BMCNN with Reinforcement Learning-Enhanced Acoustic Processing
by: Zhang, Yuli, et al.
Published: (2025)

Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning
by: Han, Bing, et al.
Published: (2024)

Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments
by: Ortigoso-Narro, Jorge, et al.
Published: (2025)

S-SONDO: Self-Supervised Knowledge Distillation for General Audio Foundation Models
by: Adlouni, Mohammed Ali El, et al.
Published: (2026)

FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
by: Yun, Jun-Hak, et al.
Published: (2025)

Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility
by: Liu, Xiaoyu, et al.
Published: (2024)

VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis
by: Ma, Chengyuan, et al.
Published: (2026)

MATER: Multi-level Acoustic and Textual Emotion Representation for Interpretable Speech Emotion Recognition
by: Jon, Hyo Jin, et al.
Published: (2025)

IS${}^3$ : Generic Impulsive--Stationary Sound Separation in Acoustic Scenes using Deep Filtering
by: Berger, Clémentine, et al.
Published: (2025)

Cross-Corpus Validation of Speech Emotion Recognition in Urdu using Domain-Knowledge Acoustic Features
by: Talpur, Unzela, et al.
Published: (2025)

Self-supervised Learning for Acoustic Few-Shot Classification
by: Liang, Jingyong, et al.
Published: (2024)

Neural Speech Embeddings for Speech Synthesis Based on Deep Generative Networks
by: Lee, Seo-Hyun, et al.
Published: (2023)

Domain-Agnostic Causal-Aware Audio Transformer for Infant Cry Classification
by: Owino, Geofrey, et al.
Published: (2025)

BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification
by: Kim, June-Woo, et al.
Published: (2024)

EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech
by: Cho, Deok-Hyeon, et al.
Published: (2024)