Saved in:
| Main Authors: | Son, Sang Won, Park, Jongyeon, Kim, Hong Kook, Vesal, Sulaiman, Lim, Jeong Eun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.12721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sound Scene Synthesis at the DCASE 2024 Challenge
by: Lagrange, Mathieu, et al.
Published: (2025)
by: Lagrange, Mathieu, et al.
Published: (2025)
Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4
by: Park, Jongyeon, et al.
Published: (2025)
by: Park, Jongyeon, et al.
Published: (2025)
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
by: Cornell, Samuele, et al.
Published: (2024)
by: Cornell, Samuele, et al.
Published: (2024)
Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9
by: Lee, Do Hyun, et al.
Published: (2024)
by: Lee, Do Hyun, et al.
Published: (2024)
Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes
by: Yasuda, Masahiro, et al.
Published: (2025)
by: Yasuda, Masahiro, et al.
Published: (2025)
Description and Discussion on DCASE 2026 Challenge Task 2: Noise-aware Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
by: Nishida, Tomoya, et al.
Published: (2026)
by: Nishida, Tomoya, et al.
Published: (2026)
Description and Discussion on DCASE 2025 Challenge Task 2: First-shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
by: Nishida, Tomoya, et al.
Published: (2025)
by: Nishida, Tomoya, et al.
Published: (2025)
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels
by: Xiao, Yang, et al.
Published: (2024)
by: Xiao, Yang, et al.
Published: (2024)
Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge
by: Schmid, Florian, et al.
Published: (2024)
by: Schmid, Florian, et al.
Published: (2024)
Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
by: Nishida, Tomoya, et al.
Published: (2024)
by: Nishida, Tomoya, et al.
Published: (2024)
AISTAT lab system for DCASE2025 Task6: Language-based audio retrieval
by: Kim, Hyun Jun, et al.
Published: (2025)
by: Kim, Hyun Jun, et al.
Published: (2025)
Handling Domain Shifts for Anomalous Sound Detection: A Review of DCASE-Related Work
by: Wilkinghoff, Kevin, et al.
Published: (2025)
by: Wilkinghoff, Kevin, et al.
Published: (2025)
Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system
by: Ronchini, Francesca, et al.
Published: (2022)
by: Ronchini, Francesca, et al.
Published: (2022)
Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge
by: Schmid, Florian, et al.
Published: (2025)
by: Schmid, Florian, et al.
Published: (2025)
BioDCASE 2026 Challenge Baseline for Cross-Domain Mosquito Species Classification
by: Hou, Yuanbo, et al.
Published: (2026)
by: Hou, Yuanbo, et al.
Published: (2026)
A decade of DCASE: Achievements, practices, evaluations and future challenges
by: Mesaros, Annamaria, et al.
Published: (2024)
by: Mesaros, Annamaria, et al.
Published: (2024)
Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation
by: Ronchini, Francesca, et al.
Published: (2020)
by: Ronchini, Francesca, et al.
Published: (2020)
Patient Domain Supervised Contrastive Learning for Lung Sound Classification Using Mobile Phone
by: Jeong, Seung Gyu, et al.
Published: (2025)
by: Jeong, Seung Gyu, et al.
Published: (2025)
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)
by: Jeong, Seung Gyu, et al.
Published: (2025)
Solution for Temporal Sound Localisation Task of ECCV Second Perception Test Challenge 2024
by: Gu, Haowei, et al.
Published: (2024)
by: Gu, Haowei, et al.
Published: (2024)
Frequency-aware convolution for sound event detection
by: Song, Tao, et al.
Published: (2024)
by: Song, Tao, et al.
Published: (2024)
An Empirical Analysis of Task-Induced Encoder Bias in Fréchet Audio Distance
by: Jeong, Wonwoo
Published: (2026)
by: Jeong, Wonwoo
Published: (2026)
DiffSound: Differentiable Modal Sound Rendering and Inverse Rendering for Diverse Inference Tasks
by: Jin, Xutong, et al.
Published: (2024)
by: Jin, Xutong, et al.
Published: (2024)
Onset and offset weighted loss function for sound event detection
by: Song, Tao
Published: (2024)
by: Song, Tao
Published: (2024)
Fine-tune the pretrained ATST model for sound event detection
by: Shao, Nian, et al.
Published: (2023)
by: Shao, Nian, et al.
Published: (2023)
Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2
by: Ke, Weijie, et al.
Published: (2024)
by: Ke, Weijie, et al.
Published: (2024)
XWSB: A Blend System Utilizing XLS-R and WavLM with SLS Classifier detection system for SVDD 2024 Challenge
by: Zhang, Qishan, et al.
Published: (2024)
by: Zhang, Qishan, et al.
Published: (2024)
The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
by: Uhlich, Stefan, et al.
Published: (2023)
by: Uhlich, Stefan, et al.
Published: (2023)
Technical Report of Nomi Team in the Environmental Sound Deepfake Detection Challenge 2026
by: Mawalim, Candy Olivia, et al.
Published: (2025)
by: Mawalim, Candy Olivia, et al.
Published: (2025)
The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
by: Fabbro, Giorgio, et al.
Published: (2023)
by: Fabbro, Giorgio, et al.
Published: (2023)
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
by: Yue, Haobo, et al.
Published: (2024)
by: Yue, Haobo, et al.
Published: (2024)
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling
by: Gao, Wenmiao, et al.
Published: (2025)
by: Gao, Wenmiao, et al.
Published: (2025)
Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems
by: Ronchini, Francesca, et al.
Published: (2023)
by: Ronchini, Francesca, et al.
Published: (2023)
Resnet-conformer network with shared weights and attention mechanism for sound event localization, detection, and distance estimation
by: Vo, Quoc Thinh, et al.
Published: (2025)
by: Vo, Quoc Thinh, et al.
Published: (2025)
The impact of non-target events in synthetic soundscapes for sound event detection
by: Ronchini, Francesca, et al.
Published: (2021)
by: Ronchini, Francesca, et al.
Published: (2021)
BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification
by: Kim, June-Woo, et al.
Published: (2024)
by: Kim, June-Woo, et al.
Published: (2024)
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
by: Bae, Sangmin, et al.
Published: (2023)
by: Bae, Sangmin, et al.
Published: (2023)
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
by: Chang, Xuankai, et al.
Published: (2024)
by: Chang, Xuankai, et al.
Published: (2024)
The IEEE-IS2 2024 Music Packet Loss Concealment Challenge
by: Mezza, Alessandro Ilic, et al.
Published: (2024)
by: Mezza, Alessandro Ilic, et al.
Published: (2024)
The ICASSP 2024 Audio Deep Packet Loss Concealment Challenge
by: Diener, Lorenz, et al.
Published: (2024)
by: Diener, Lorenz, et al.
Published: (2024)
Similar Items
-
Sound Scene Synthesis at the DCASE 2024 Challenge
by: Lagrange, Mathieu, et al.
Published: (2025) -
Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4
by: Park, Jongyeon, et al.
Published: (2025) -
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
by: Cornell, Samuele, et al.
Published: (2024) -
Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9
by: Lee, Do Hyun, et al.
Published: (2024) -
Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes
by: Yasuda, Masahiro, et al.
Published: (2025)