:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xie, Yuan, Xu, Ji, Ren, Jiawei, Li, Junfeng
Format:	Preprint
Published:	2024
Subjects:	Sound Machine Learning
Online Access:	https://arxiv.org/abs/2411.02848
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Guiding the underwater acoustic target recognition with interpretable contrastive learning
by: Xie, Yuan, et al.
Published: (2024)

Advancing Robust Underwater Acoustic Target Recognition through Multi-task Learning and Multi-Gate Mixture-of-Experts
by: Xie, Yuan, et al.
Published: (2024)

Adaptive ship-radiated noise recognition with learnable fine-grained wavelet transform
by: Xie, Yuan, et al.
Published: (2023)

Underwater-Art: Expanding Information Perspectives With Text Templates For Underwater Acoustic Target Recognition
by: Xie, Yuan, et al.
Published: (2023)

DEMONet: Underwater Acoustic Target Recognition based on Multi-Expert Network and Cross-Temporal Variational Autoencoder
by: Xie, Yuan, et al.
Published: (2024)

Beyond saliency: enhancing explanation of speech emotion recognition with expert-referenced acoustic cues
by: Nasr, Seham, et al.
Published: (2025)

Underwater Acoustic Target Recognition based on Smoothness-inducing Regularization and Spectrogram-based Data Augmentation
by: Xu, Ji, et al.
Published: (2023)

IsoNet: Spatially-aware audio-visual target speech extraction in complex acoustic environments
by: Padhya, Dinanath, et al.
Published: (2026)

SeMaScore : a new evaluation metric for automatic speech recognition tasks
by: Sasindran, Zitha, et al.
Published: (2024)

Fusion approaches for emotion recognition from speech using acoustic and text-based features
by: Pepino, Leonardo, et al.
Published: (2024)

Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
by: Maiti, Soumi, et al.
Published: (2023)

A noise-robust acoustic method for recognizing foraging activities of grazing cattle
by: Martinez-Rau, Luciano S., et al.
Published: (2023)

Training chord recognition models on artificially generated audio
by: Majchrzak, Martyna, et al.
Published: (2025)

DeepForestSound: a multi-species automatic detector for passive acoustic monitoring in African tropical forests, a case study in Kibale National Park
by: Dubus, Gabriel, et al.
Published: (2026)

Determining the severity of Parkinson's disease in patients using a multi task neural network
by: García-Ordás, María Teresa, et al.
Published: (2024)

Virtual boundary integral neural network for three-dimensional exterior acoustic problems
by: Li, Jiahao, et al.
Published: (2026)

Surface impedance inference via neural fields and sparse acoustic data obtained by a compact array
by: Xia, Yuanxin, et al.
Published: (2026)

Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Experts
by: Xie, Yuan, et al.
Published: (2024)

CR-CTC: Consistency regularization on CTC for improved speech recognition
by: Yao, Zengwei, et al.
Published: (2024)

Better audio representations are more brain-like: linking model-brain alignment with performance in downstream auditory tasks
by: Pepino, Leonardo, et al.
Published: (2025)

Introduction to speech recognition
by: Dauphin, Gabriel
Published: (2024)

Benchmarks and leaderboards for sound demixing tasks
by: Solovyev, Roman, et al.
Published: (2023)

A Systematic Evaluation of Adversarial Attacks against Speech Emotion Recognition Models
by: Facchinetti, Nicolas, et al.
Published: (2024)

Time-Varying Audio Effect Modeling by End-to-End Adversarial Training
by: Bourdin, Yann, et al.
Published: (2025)

Zipformer: A faster and better encoder for automatic speech recognition
by: Yao, Zengwei, et al.
Published: (2023)

Robustifying automatic speech recognition by extracting slowly varying features
by: Pizarro, Matías, et al.
Published: (2021)

Versatile audio-visual learning for emotion recognition
by: Goncalves, Lucas, et al.
Published: (2023)

Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
by: Wu, Haibin, et al.
Published: (2021)

Late fusion ensembles for speech recognition on diverse input audio representations
by: Jezidžić, Marin, et al.
Published: (2024)

CAARMA: Class Augmentation with Adversarial Mixup Regularization
by: Baali, Massa, et al.
Published: (2025)

Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction
by: Wu, Yusong, et al.
Published: (2025)

Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
by: Joshi, Sonal, et al.
Published: (2024)

BioSEN: A Bio-acoustic Signal Enhancement Network for Animal Vocalizations
by: Song, Tianyu, et al.
Published: (2026)

MAIA: An Inpainting-Based Approach for Music Adversarial Attacks
by: Liu, Yuxuan, et al.
Published: (2025)

DFKI-Speech System for WildSpoof Challenge: A robust framework for SASV In-the-Wild
by: Das, Arnab, et al.
Published: (2026)

Adversarial Data Augmentation for Robust Speaker Verification
by: Zhou, Zhenyu, et al.
Published: (2024)

A vector quantized masked autoencoder for audiovisual speech emotion recognition
by: Sadok, Samir, et al.
Published: (2023)

ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis
by: Zhang, Zhanqi, et al.
Published: (2026)

LipsAM: Lipschitz-Continuous Amplitude Modifier for Audio Signal Processing and its Application to Plug-and-Play Dereverberation
by: Matsumoto, Kazuki, et al.
Published: (2026)

StrADiff: A Structured Source-Wise Adaptive Diffusion Framework for Linear and Nonlinear Blind Source Separation
by: Wei, Yuan-Hao
Published: (2026)