:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tan, Ee-Leng, Karnapi, Furi Andi, Ng, Linus Junjia, Ooi, Kenneth, Gan, Woon-Seng
Format:	Preprint
Published:	2024
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2408.05721
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Improving Stereo 3D Sound Event Localization and Detection: Perceptual Features, Stereo-specific Data Augmentation, and Distance Normalization
by: Yeow, Jun-Wei, et al.
Published: (2025)

MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection
by: Yeow, Jun-Wei, et al.
Published: (2025)

Squeeze-and-Excite ResNet-Conformers for Sound Event Localization, Detection, and Distance Estimation for DCASE 2024 Challenge
by: Yeow, Jun Wei, et al.
Published: (2024)

Enhancing Situational Awareness in Wearable Audio Devices Using a Lightweight Sound Event Localization and Detection System
by: Yeow, Jun-Wei, et al.
Published: (2025)

Transformer-based End-to-End Control Filter Generation for Active Noise Control
by: Yang, Ziyi, et al.
Published: (2026)

Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas
by: Lam, Bhan, et al.
Published: (2024)

ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes
by: Ooi, Kenneth, et al.
Published: (2022)

Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation
by: Tan, Ee-Leng, et al.
Published: (2025)

Autonomous Soundscape Augmentation with Multimodal Fusion of Visual and Participant-linked Inputs
by: Ooi, Kenneth, et al.
Published: (2023)

Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction
by: Yeo, Jin Jie Sean, et al.
Published: (2024)

Joint Feature and Output Distillation for Low-complexity Acoustic Scene Classification
by: Li, Haowen, et al.
Published: (2025)

Do neonates hear what we measure? Assessing neonatal ward soundscapes at the neonates ears
by: Lam, Bhan, et al.
Published: (2025)

FRCRN: Boosting Feature Representation using Frequency Recurrence for Monaural Speech Enhancement
by: Zhao, Shengkui, et al.
Published: (2022)

Mixed-gradients Distributed Filtered Reference Least Mean Square Algorithm -- A Robust Distributed Multichannel Active Noise Control Algorithm
by: Ji, Junwei, et al.
Published: (2025)

IoT-based Noise Monitoring using Mobile Nodes for Smart Cities
by: Manthina, Bhima Sankar, et al.
Published: (2025)

Self-Boosted Weight-Constrained FxLMS: A Robustness Distributed Active Noise Control Algorithm Without Internode Communication
by: Ji, Junwei, et al.
Published: (2025)

Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
by: Yin, Han, et al.
Published: (2024)

A Stabilized Hybrid Active Noise Control Algorithm of GFANC and FxNLMS with Online Clustering
by: Luo, Zhengding, et al.
Published: (2026)

AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning
by: Bai, Jisheng, et al.
Published: (2023)

Period Singer: Integrating Periodic and Aperiodic Variational Autoencoders for Natural-Sounding End-to-End Singing Voice Synthesis
by: Kim, Taewoo, et al.
Published: (2024)

VBx for End-to-End Neural and Clustering-based Diarization
by: Pálka, Petr, et al.
Published: (2025)

DNCASR: End-to-End Training for Speaker-Attributed ASR
by: Zheng, Xianrui, et al.
Published: (2025)

Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT
by: Dai, Dongyang, et al.
Published: (2025)

An End-To-End Stuttering Detection Method Based On Conformer And BILSTM
by: Liu, Xiaokang, et al.
Published: (2024)

Distributed Multichannel Active Noise Control with Asynchronous Communication
by: Ji, Junwei, et al.
Published: (2026)

Transferable Selective Virtual Sensing Active Noise Control Technique Based on Metric Learning
by: Wang, Boxiang, et al.
Published: (2024)

Predictive Directional Selective Fixed-Filter Active Noise Control for Moving Sources via a Convolutional Recurrent Neural Network
by: Wang, Boxiang, et al.
Published: (2026)

A Real-Time Platform for Portable and Scalable Active Noise Mitigation for Construction Machinery
by: Gan, Woon-Seng, et al.
Published: (2024)

End-to-End Speech Recognition with Pre-trained Masked Language Model
by: Higuchi, Yosuke, et al.
Published: (2024)

AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
by: Bai, Jisheng, et al.
Published: (2024)

End-to-End Direction-Aware Keyword Spotting with Spatial Priors in Noisy Environments
by: Wang, Rui, et al.
Published: (2026)

End-to-End DOA-Guided Speech Extraction in Noisy Multi-Talker Scenarios
by: Jing, Kangqi, et al.
Published: (2025)

Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech Recognition
by: Eeckt, Steven Vander, et al.
Published: (2022)

End-to-End Diarization utilizing Attractor Deep Clustering
by: Palzer, David, et al.
Published: (2025)

Speaker Adaptation for Quantised End-to-End ASR Models
by: Zhao, Qiuming, et al.
Published: (2024)

An Investigation on Speaker Augmentation for End-to-End Speaker Extraction
by: You, Zhenghai, et al.
Published: (2025)

Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform
by: Kong, Xiangzhu, et al.
Published: (2025)

Dissecting the Segmentation Model of End-to-End Diarization with Vector Clustering
by: Plaquet, Alexis, et al.
Published: (2025)

A Robust Proactive Communication Strategy for Distributed Active Noise Control Systems
by: Ji, Junwei, et al.
Published: (2025)

Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement
by: Dai, Wang, et al.
Published: (2024)