Saved in:
| Main Authors: | Jeong, Seung Gyu, Kim, Seong Eun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.09262 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Patient Domain Supervised Contrastive Learning for Lung Sound Classification Using Mobile Phone
by: Jeong, Seung Gyu, et al.
Published: (2025)
by: Jeong, Seung Gyu, et al.
Published: (2025)
PC-MCL: Patient-Consistent Multi-Cycle Learning with multi-label bias correction for respiratory sound classification
by: Jeong, Seung Gyu, et al.
Published: (2026)
by: Jeong, Seung Gyu, et al.
Published: (2026)
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)
by: Jeong, Seung Gyu, et al.
Published: (2025)
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech
by: Kim, Nam-Gyu, et al.
Published: (2025)
by: Kim, Nam-Gyu, et al.
Published: (2025)
Deep Space Separable Distillation for Lightweight Acoustic Scene Classification
by: Ye, ShuQi, et al.
Published: (2024)
by: Ye, ShuQi, et al.
Published: (2024)
Listen Like a Teacher: Mitigating Whisper Hallucinations using Adaptive Layer Attention and Knowledge Distillation
by: Tripathi, Kumud, et al.
Published: (2025)
by: Tripathi, Kumud, et al.
Published: (2025)
Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech
by: Kim, Nam-Gyu
Published: (2025)
by: Kim, Nam-Gyu
Published: (2025)
Adaptive Knowledge Distillation for Device-Directed Speech Detection
by: Chi, Hyung Gun, et al.
Published: (2025)
by: Chi, Hyung Gun, et al.
Published: (2025)
Toward Complex-Valued Neural Networks for Waveform Generation
by: Oh, Hyung-Seok, et al.
Published: (2026)
by: Oh, Hyung-Seok, et al.
Published: (2026)
Device-Robust Acoustic Scene Classification via Impulse Response Augmentation
by: Morocutti, Tobias, et al.
Published: (2023)
by: Morocutti, Tobias, et al.
Published: (2023)
DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025)
by: Zhang, Peihong, et al.
Published: (2025)
An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift
by: Zhang, Peihong, et al.
Published: (2025)
by: Zhang, Peihong, et al.
Published: (2025)
Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification
by: Morocutti, Tobias, et al.
Published: (2025)
by: Morocutti, Tobias, et al.
Published: (2025)
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech
by: Cho, Deok-Hyeon, et al.
Published: (2025)
by: Cho, Deok-Hyeon, et al.
Published: (2025)
Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from Ensembles
by: Toikkanen, Miika, et al.
Published: (2025)
by: Toikkanen, Miika, et al.
Published: (2025)
EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification
by: Cho, Deok-Hyeon, et al.
Published: (2025)
by: Cho, Deok-Hyeon, et al.
Published: (2025)
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
by: Cho, Deok-Hyeon, et al.
Published: (2024)
by: Cho, Deok-Hyeon, et al.
Published: (2024)
Modality-Specific Speech Enhancement and Noise-Adaptive Fusion for Acoustic and Body-Conduction Microphone Framework
by: Kim, Yunsik, et al.
Published: (2025)
by: Kim, Yunsik, et al.
Published: (2025)
DOA Estimation with Lightweight Network on LLM-Aided Simulated Acoustic Scenes
by: Li, Haowen, et al.
Published: (2025)
by: Li, Haowen, et al.
Published: (2025)
ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition
by: Lee, Junseok, et al.
Published: (2026)
by: Lee, Junseok, et al.
Published: (2026)
TinyMusician: On-Device Music Generation with Knowledge Distillation and Mixed Precision Quantization
by: Wang, Hainan, et al.
Published: (2025)
by: Wang, Hainan, et al.
Published: (2025)
Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge
by: Schmid, Florian, et al.
Published: (2025)
by: Schmid, Florian, et al.
Published: (2025)
SoundCompass: Navigating Target Sound Extraction With Effective Directional Clue Integration In Complex Acoustic Scenes
by: Choi, Dayun, et al.
Published: (2025)
by: Choi, Dayun, et al.
Published: (2025)
Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices
by: Sharify, Hossein, et al.
Published: (2025)
by: Sharify, Hossein, et al.
Published: (2025)
Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments
by: Quan, Minh K., et al.
Published: (2025)
by: Quan, Minh K., et al.
Published: (2025)
Adaptive Vehicle Speed Classification via BMCNN with Reinforcement Learning-Enhanced Acoustic Processing
by: Zhang, Yuli, et al.
Published: (2025)
by: Zhang, Yuli, et al.
Published: (2025)
Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning
by: Han, Bing, et al.
Published: (2024)
by: Han, Bing, et al.
Published: (2024)
Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments
by: Ortigoso-Narro, Jorge, et al.
Published: (2025)
by: Ortigoso-Narro, Jorge, et al.
Published: (2025)
S-SONDO: Self-Supervised Knowledge Distillation for General Audio Foundation Models
by: Adlouni, Mohammed Ali El, et al.
Published: (2026)
by: Adlouni, Mohammed Ali El, et al.
Published: (2026)
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
by: Yun, Jun-Hak, et al.
Published: (2025)
by: Yun, Jun-Hak, et al.
Published: (2025)
Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility
by: Liu, Xiaoyu, et al.
Published: (2024)
by: Liu, Xiaoyu, et al.
Published: (2024)
VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis
by: Ma, Chengyuan, et al.
Published: (2026)
by: Ma, Chengyuan, et al.
Published: (2026)
MATER: Multi-level Acoustic and Textual Emotion Representation for Interpretable Speech Emotion Recognition
by: Jon, Hyo Jin, et al.
Published: (2025)
by: Jon, Hyo Jin, et al.
Published: (2025)
IS${}^3$ : Generic Impulsive--Stationary Sound Separation in Acoustic Scenes using Deep Filtering
by: Berger, Clémentine, et al.
Published: (2025)
by: Berger, Clémentine, et al.
Published: (2025)
Cross-Corpus Validation of Speech Emotion Recognition in Urdu using Domain-Knowledge Acoustic Features
by: Talpur, Unzela, et al.
Published: (2025)
by: Talpur, Unzela, et al.
Published: (2025)
Self-supervised Learning for Acoustic Few-Shot Classification
by: Liang, Jingyong, et al.
Published: (2024)
by: Liang, Jingyong, et al.
Published: (2024)
Neural Speech Embeddings for Speech Synthesis Based on Deep Generative Networks
by: Lee, Seo-Hyun, et al.
Published: (2023)
by: Lee, Seo-Hyun, et al.
Published: (2023)
Domain-Agnostic Causal-Aware Audio Transformer for Infant Cry Classification
by: Owino, Geofrey, et al.
Published: (2025)
by: Owino, Geofrey, et al.
Published: (2025)
BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification
by: Kim, June-Woo, et al.
Published: (2024)
by: Kim, June-Woo, et al.
Published: (2024)
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech
by: Cho, Deok-Hyeon, et al.
Published: (2024)
by: Cho, Deok-Hyeon, et al.
Published: (2024)
Similar Items
-
Patient Domain Supervised Contrastive Learning for Lung Sound Classification Using Mobile Phone
by: Jeong, Seung Gyu, et al.
Published: (2025) -
PC-MCL: Patient-Consistent Multi-Cycle Learning with multi-label bias correction for respiratory sound classification
by: Jeong, Seung Gyu, et al.
Published: (2026) -
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025) -
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech
by: Kim, Nam-Gyu, et al.
Published: (2025) -
Deep Space Separable Distillation for Lightweight Acoustic Scene Classification
by: Ye, ShuQi, et al.
Published: (2024)