:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Hai, P. H., Minh, L. T., Son, L. H.
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Sound Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2605.03934
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

FlexSED: Towards Open-Vocabulary Sound Event Detection
von: Hai, Jiarui, et al.
Veröffentlicht: (2025)

Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
von: Cai, Pengfei, et al.
Veröffentlicht: (2025)

SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering
von: Hai, Jiarui, et al.
Veröffentlicht: (2025)

Environmental Sound Deepfake Detection Using Deep-Learning Framework
von: Pham, Lam, et al.
Veröffentlicht: (2026)

Formula-Supervised Sound Event Detection: Pre-Training Without Real Data
von: Shibata, Yuto, et al.
Veröffentlicht: (2025)

Leveraging Language Model Capabilities for Sound Event Detection
von: Wang, Hualei, et al.
Veröffentlicht: (2023)

The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection
von: Bibbó, Gabriel, et al.
Veröffentlicht: (2024)

Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
von: Wang, Zhenyu, et al.
Veröffentlicht: (2024)

Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection
von: Cai, Pengfei, et al.
Veröffentlicht: (2024)

Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes
von: Roman, Adrian S., et al.
Veröffentlicht: (2024)

AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds
von: Wang, Qizhou, et al.
Veröffentlicht: (2025)

Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection
von: Zhang, Shiqi, et al.
Veröffentlicht: (2025)

SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
von: Mu, Da, et al.
Veröffentlicht: (2024)

MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection
von: Cai, Pengfei, et al.
Veröffentlicht: (2024)

Dual Knowledge Distillation for Efficient Sound Event Detection
von: Xiao, Yang, et al.
Veröffentlicht: (2024)

Sub-Band Spectral Matching with Localized Score Aggregation for Robust Anomalous Sound Detection
von: Saengthong, Phurich, et al.
Veröffentlicht: (2026)

Auditory Intelligence: Understanding the World Through Sound
von: Nam, Hyeonuk
Veröffentlicht: (2025)

Improving Anomalous Sound Detection with Attribute-aware Representation from Domain-adaptive Pre-training
von: Fang, Xin, et al.
Veröffentlicht: (2025)

Pediatric Asthma Detection with Googles HeAR Model: An AI-Driven Respiratory Sound Classifier
von: Ehtesham, Abul, et al.
Veröffentlicht: (2025)

MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
von: Mu, Da, et al.
Veröffentlicht: (2024)

Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks
von: Li, Kai, et al.
Veröffentlicht: (2024)

AFSS: Artifact-Focused Self-Synthesis for Mitigating Bias in Audio Deepfake Detection
von: Nguyen-Le, Hai-Son, et al.
Veröffentlicht: (2026)

Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
von: Kim, Ji-Hoon, et al.
Veröffentlicht: (2023)

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection
von: Fujita, Yoto, et al.
Veröffentlicht: (2024)

Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation
von: Xiong, Chenxu, et al.
Veröffentlicht: (2024)

Towards Robust Speech Deepfake Detection via Human-Inspired Reasoning
von: Dvirniak, Artem, et al.
Veröffentlicht: (2026)

'Studies for': A Human-AI Co-Creative Sound Artwork Using a Real-time Multi-channel Sound Generation Model
von: Nagashima, Chihiro, et al.
Veröffentlicht: (2025)

MARS-Sep: Multimodal-Aligned Reinforced Sound Separation
von: Zhang, Zihan, et al.
Veröffentlicht: (2025)

ASD-Diffusion: Anomalous Sound Detection with Diffusion Models
von: Zhang, Fengrun, et al.
Veröffentlicht: (2024)

EnvSDD: Benchmarking Environmental Sound Deepfake Detection
von: Yin, Han, et al.
Veröffentlicht: (2025)

Towards Explicit Acoustic Evidence Perception in Audio LLMs for Speech Deepfake Detection
von: Guo, Xiaoxuan, et al.
Veröffentlicht: (2026)

Analytic Incremental Learning For Sound Source Localization With Imbalance Rectification
von: Fan, Zexia, et al.
Veröffentlicht: (2026)

OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation
von: Mahmud, Tanvir, et al.
Veröffentlicht: (2024)

Toward Noise-Aware Audio Deepfake Detection: Survey, SNR-Benchmarks, and Practical Recipes
von: Sen, Udayon, et al.
Veröffentlicht: (2025)

SynthGuard: An Open Platform for Detecting AI-Generated Multimedia with Multimodal LLMs
von: Desai, Shail, et al.
Veröffentlicht: (2025)

Improving Audio Event Recognition with Consistency Regularization
von: Sadhu, Shanmuka, et al.
Veröffentlicht: (2025)

Contrastive Learning with Spectrum Information Augmentation in Abnormal Sound Detection
von: Meng, Xinxin, et al.
Veröffentlicht: (2025)

Deep Generic Representations for Domain-Generalized Anomalous Sound Detection
von: Saengthong, Phurich, et al.
Veröffentlicht: (2024)

LAMB: LLM-based Audio Captioning with Modality Gap Bridging via Cauchy-Schwarz Divergence
von: Lee, Hyeongkeun, et al.
Veröffentlicht: (2026)

One Prompt, Many Sounds: Modeling Listener Variability in LLM-Based Equalization
von: Stylianou, Ioannis, et al.
Veröffentlicht: (2026)