:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Son, Sang Won, Park, Jongyeon, Kim, Hong Kook, Vesal, Sulaiman, Lim, Jeong Eun
Format:	Preprint
Published:	2024
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2406.12721
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sound Scene Synthesis at the DCASE 2024 Challenge
by: Lagrange, Mathieu, et al.
Published: (2025)

Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4
by: Park, Jongyeon, et al.
Published: (2025)

DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
by: Cornell, Samuele, et al.
Published: (2024)

Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9
by: Lee, Do Hyun, et al.
Published: (2024)

Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes
by: Yasuda, Masahiro, et al.
Published: (2025)

Description and Discussion on DCASE 2026 Challenge Task 2: Noise-aware Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
by: Nishida, Tomoya, et al.
Published: (2026)

Description and Discussion on DCASE 2025 Challenge Task 2: First-shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
by: Nishida, Tomoya, et al.
Published: (2025)

FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels
by: Xiao, Yang, et al.
Published: (2024)

Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge
by: Schmid, Florian, et al.
Published: (2024)

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
by: Nishida, Tomoya, et al.
Published: (2024)

AISTAT lab system for DCASE2025 Task6: Language-based audio retrieval
by: Kim, Hyun Jun, et al.
Published: (2025)

Handling Domain Shifts for Anomalous Sound Detection: A Review of DCASE-Related Work
by: Wilkinghoff, Kevin, et al.
Published: (2025)

Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system
by: Ronchini, Francesca, et al.
Published: (2022)

Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge
by: Schmid, Florian, et al.
Published: (2025)

BioDCASE 2026 Challenge Baseline for Cross-Domain Mosquito Species Classification
by: Hou, Yuanbo, et al.
Published: (2026)

A decade of DCASE: Achievements, practices, evaluations and future challenges
by: Mesaros, Annamaria, et al.
Published: (2024)

Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation
by: Ronchini, Francesca, et al.
Published: (2020)

Patient Domain Supervised Contrastive Learning for Lung Sound Classification Using Mobile Phone
by: Jeong, Seung Gyu, et al.
Published: (2025)

Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)

Solution for Temporal Sound Localisation Task of ECCV Second Perception Test Challenge 2024
by: Gu, Haowei, et al.
Published: (2024)

Frequency-aware convolution for sound event detection
by: Song, Tao, et al.
Published: (2024)

An Empirical Analysis of Task-Induced Encoder Bias in Fréchet Audio Distance
by: Jeong, Wonwoo
Published: (2026)

DiffSound: Differentiable Modal Sound Rendering and Inverse Rendering for Diverse Inference Tasks
by: Jin, Xutong, et al.
Published: (2024)

Onset and offset weighted loss function for sound event detection
by: Song, Tao
Published: (2024)

Fine-tune the pretrained ATST model for sound event detection
by: Shao, Nian, et al.
Published: (2023)

Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2
by: Ke, Weijie, et al.
Published: (2024)

XWSB: A Blend System Utilizing XLS-R and WavLM with SLS Classifier detection system for SVDD 2024 Challenge
by: Zhang, Qishan, et al.
Published: (2024)

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
by: Uhlich, Stefan, et al.
Published: (2023)

Technical Report of Nomi Team in the Environmental Sound Deepfake Detection Challenge 2026
by: Mawalim, Candy Olivia, et al.
Published: (2025)

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
by: Fabbro, Giorgio, et al.
Published: (2023)

Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
by: Yue, Haobo, et al.
Published: (2024)

Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling
by: Gao, Wenmiao, et al.
Published: (2025)

Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems
by: Ronchini, Francesca, et al.
Published: (2023)

Resnet-conformer network with shared weights and attention mechanism for sound event localization, detection, and distance estimation
by: Vo, Quoc Thinh, et al.
Published: (2025)

The impact of non-target events in synthetic soundscapes for sound event detection
by: Ronchini, Francesca, et al.
Published: (2021)

BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification
by: Kim, June-Woo, et al.
Published: (2024)

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
by: Bae, Sangmin, et al.
Published: (2023)

The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
by: Chang, Xuankai, et al.
Published: (2024)

The IEEE-IS2 2024 Music Packet Loss Concealment Challenge
by: Mezza, Alessandro Ilic, et al.
Published: (2024)

The ICASSP 2024 Audio Deep Packet Loss Concealment Challenge
by: Diener, Lorenz, et al.
Published: (2024)