Saved in:
| Main Authors: | Tan, Ee-Leng, Karnapi, Furi Andi, Ng, Linus Junjia, Ooi, Kenneth, Gan, Woon-Seng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.05721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improving Stereo 3D Sound Event Localization and Detection: Perceptual Features, Stereo-specific Data Augmentation, and Distance Normalization
by: Yeow, Jun-Wei, et al.
Published: (2025)
by: Yeow, Jun-Wei, et al.
Published: (2025)
MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection
by: Yeow, Jun-Wei, et al.
Published: (2025)
by: Yeow, Jun-Wei, et al.
Published: (2025)
Squeeze-and-Excite ResNet-Conformers for Sound Event Localization, Detection, and Distance Estimation for DCASE 2024 Challenge
by: Yeow, Jun Wei, et al.
Published: (2024)
by: Yeow, Jun Wei, et al.
Published: (2024)
Enhancing Situational Awareness in Wearable Audio Devices Using a Lightweight Sound Event Localization and Detection System
by: Yeow, Jun-Wei, et al.
Published: (2025)
by: Yeow, Jun-Wei, et al.
Published: (2025)
Transformer-based End-to-End Control Filter Generation for Active Noise Control
by: Yang, Ziyi, et al.
Published: (2026)
by: Yang, Ziyi, et al.
Published: (2026)
Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas
by: Lam, Bhan, et al.
Published: (2024)
by: Lam, Bhan, et al.
Published: (2024)
ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes
by: Ooi, Kenneth, et al.
Published: (2022)
by: Ooi, Kenneth, et al.
Published: (2022)
Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation
by: Tan, Ee-Leng, et al.
Published: (2025)
by: Tan, Ee-Leng, et al.
Published: (2025)
Autonomous Soundscape Augmentation with Multimodal Fusion of Visual and Participant-linked Inputs
by: Ooi, Kenneth, et al.
Published: (2023)
by: Ooi, Kenneth, et al.
Published: (2023)
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction
by: Yeo, Jin Jie Sean, et al.
Published: (2024)
by: Yeo, Jin Jie Sean, et al.
Published: (2024)
Joint Feature and Output Distillation for Low-complexity Acoustic Scene Classification
by: Li, Haowen, et al.
Published: (2025)
by: Li, Haowen, et al.
Published: (2025)
Do neonates hear what we measure? Assessing neonatal ward soundscapes at the neonates ears
by: Lam, Bhan, et al.
Published: (2025)
by: Lam, Bhan, et al.
Published: (2025)
FRCRN: Boosting Feature Representation using Frequency Recurrence for Monaural Speech Enhancement
by: Zhao, Shengkui, et al.
Published: (2022)
by: Zhao, Shengkui, et al.
Published: (2022)
Mixed-gradients Distributed Filtered Reference Least Mean Square Algorithm -- A Robust Distributed Multichannel Active Noise Control Algorithm
by: Ji, Junwei, et al.
Published: (2025)
by: Ji, Junwei, et al.
Published: (2025)
IoT-based Noise Monitoring using Mobile Nodes for Smart Cities
by: Manthina, Bhima Sankar, et al.
Published: (2025)
by: Manthina, Bhima Sankar, et al.
Published: (2025)
Self-Boosted Weight-Constrained FxLMS: A Robustness Distributed Active Noise Control Algorithm Without Internode Communication
by: Ji, Junwei, et al.
Published: (2025)
by: Ji, Junwei, et al.
Published: (2025)
Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
by: Yin, Han, et al.
Published: (2024)
by: Yin, Han, et al.
Published: (2024)
A Stabilized Hybrid Active Noise Control Algorithm of GFANC and FxNLMS with Online Clustering
by: Luo, Zhengding, et al.
Published: (2026)
by: Luo, Zhengding, et al.
Published: (2026)
AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning
by: Bai, Jisheng, et al.
Published: (2023)
by: Bai, Jisheng, et al.
Published: (2023)
Period Singer: Integrating Periodic and Aperiodic Variational Autoencoders for Natural-Sounding End-to-End Singing Voice Synthesis
by: Kim, Taewoo, et al.
Published: (2024)
by: Kim, Taewoo, et al.
Published: (2024)
VBx for End-to-End Neural and Clustering-based Diarization
by: Pálka, Petr, et al.
Published: (2025)
by: Pálka, Petr, et al.
Published: (2025)
DNCASR: End-to-End Training for Speaker-Attributed ASR
by: Zheng, Xianrui, et al.
Published: (2025)
by: Zheng, Xianrui, et al.
Published: (2025)
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT
by: Dai, Dongyang, et al.
Published: (2025)
by: Dai, Dongyang, et al.
Published: (2025)
An End-To-End Stuttering Detection Method Based On Conformer And BILSTM
by: Liu, Xiaokang, et al.
Published: (2024)
by: Liu, Xiaokang, et al.
Published: (2024)
Distributed Multichannel Active Noise Control with Asynchronous Communication
by: Ji, Junwei, et al.
Published: (2026)
by: Ji, Junwei, et al.
Published: (2026)
Transferable Selective Virtual Sensing Active Noise Control Technique Based on Metric Learning
by: Wang, Boxiang, et al.
Published: (2024)
by: Wang, Boxiang, et al.
Published: (2024)
Predictive Directional Selective Fixed-Filter Active Noise Control for Moving Sources via a Convolutional Recurrent Neural Network
by: Wang, Boxiang, et al.
Published: (2026)
by: Wang, Boxiang, et al.
Published: (2026)
A Real-Time Platform for Portable and Scalable Active Noise Mitigation for Construction Machinery
by: Gan, Woon-Seng, et al.
Published: (2024)
by: Gan, Woon-Seng, et al.
Published: (2024)
End-to-End Speech Recognition with Pre-trained Masked Language Model
by: Higuchi, Yosuke, et al.
Published: (2024)
by: Higuchi, Yosuke, et al.
Published: (2024)
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
by: Bai, Jisheng, et al.
Published: (2024)
by: Bai, Jisheng, et al.
Published: (2024)
End-to-End Direction-Aware Keyword Spotting with Spatial Priors in Noisy Environments
by: Wang, Rui, et al.
Published: (2026)
by: Wang, Rui, et al.
Published: (2026)
End-to-End DOA-Guided Speech Extraction in Noisy Multi-Talker Scenarios
by: Jing, Kangqi, et al.
Published: (2025)
by: Jing, Kangqi, et al.
Published: (2025)
Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech Recognition
by: Eeckt, Steven Vander, et al.
Published: (2022)
by: Eeckt, Steven Vander, et al.
Published: (2022)
End-to-End Diarization utilizing Attractor Deep Clustering
by: Palzer, David, et al.
Published: (2025)
by: Palzer, David, et al.
Published: (2025)
Speaker Adaptation for Quantised End-to-End ASR Models
by: Zhao, Qiuming, et al.
Published: (2024)
by: Zhao, Qiuming, et al.
Published: (2024)
An Investigation on Speaker Augmentation for End-to-End Speaker Extraction
by: You, Zhenghai, et al.
Published: (2025)
by: You, Zhenghai, et al.
Published: (2025)
Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform
by: Kong, Xiangzhu, et al.
Published: (2025)
by: Kong, Xiangzhu, et al.
Published: (2025)
Dissecting the Segmentation Model of End-to-End Diarization with Vector Clustering
by: Plaquet, Alexis, et al.
Published: (2025)
by: Plaquet, Alexis, et al.
Published: (2025)
A Robust Proactive Communication Strategy for Distributed Active Noise Control Systems
by: Ji, Junwei, et al.
Published: (2025)
by: Ji, Junwei, et al.
Published: (2025)
Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement
by: Dai, Wang, et al.
Published: (2024)
by: Dai, Wang, et al.
Published: (2024)
Similar Items
-
Improving Stereo 3D Sound Event Localization and Detection: Perceptual Features, Stereo-specific Data Augmentation, and Distance Normalization
by: Yeow, Jun-Wei, et al.
Published: (2025) -
MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection
by: Yeow, Jun-Wei, et al.
Published: (2025) -
Squeeze-and-Excite ResNet-Conformers for Sound Event Localization, Detection, and Distance Estimation for DCASE 2024 Challenge
by: Yeow, Jun Wei, et al.
Published: (2024) -
Enhancing Situational Awareness in Wearable Audio Devices Using a Lightweight Sound Event Localization and Detection System
by: Yeow, Jun-Wei, et al.
Published: (2025) -
Transformer-based End-to-End Control Filter Generation for Active Noise Control
by: Yang, Ziyi, et al.
Published: (2026)