:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gan, Yuan, Miao, Jiaxu, Wang, Yunze, Yang, Yi
Format:	Preprint
Published:	2025
Subjects:	Graphics Cryptography and Security Computer Vision and Pattern Recognition Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2506.01591
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Hindi audio-video-Deepfake (HAV-DF): A Hindi language-based Audio-video Deepfake Dataset
by: Kaur, Sukhandeep, et al.
Published: (2024)

IO-RAE: Information-Obfuscation Reversible Adversarial Example for Audio Privacy Protection
by: Zhu, Jiajie, et al.
Published: (2026)

Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization
by: Jin, Weifei, et al.
Published: (2025)

EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations
by: Chang, Jung-Woo, et al.
Published: (2024)

PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
by: Xie, Yifan, et al.
Published: (2024)

Yours or Mine? Overwriting Attacks Against Neural Audio Watermarking
by: Yao, Lingfeng, et al.
Published: (2025)

ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
by: Qiu, Zhiping, et al.
Published: (2025)

READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
by: Wang, Haotian, et al.
Published: (2025)

Adversarial Representation Learning for Robust Privacy Preservation in Audio
by: Gharib, Shayan, et al.
Published: (2023)

SilentCipher: Deep Audio Watermarking
by: Singh, Mayank Kumar, et al.
Published: (2024)

Gumbel Rao Monte Carlo based Bi-Modal Neural Architecture Search for Audio-Visual Deepfake Detection
by: PN, Aravinda Reddy, et al.
Published: (2024)

An Effective Energy Mask-based Adversarial Evasion Attacks against Misclassification in Speaker Recognition Systems
by: Park, Chanwoo, et al.
Published: (2026)

Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis
by: Shen, Shuai, et al.
Published: (2025)

Hybrid Audio Detection Using Fine-Tuned Audio Spectrogram Transformers: A Dataset-Driven Evaluation of Mixed AI-Human Speech
by: Huang, Kunyang, et al.
Published: (2025)

Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
by: Joshi, Sonal, et al.
Published: (2024)

Interpretable Temporal Class Activation Representation for Audio Spoofing Detection
by: Li, Menglu, et al.
Published: (2024)

Adversarial Attacks and Defenses for Speech Recognition Systems
by: Żelasko, Piotr, et al.
Published: (2021)

Pitch Imperfect: Detecting Audio Deepfakes Through Acoustic Prosodic Analysis
by: Warren, Kevin, et al.
Published: (2025)

One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection
by: Kim, Hyun Myung, et al.
Published: (2024)

A Preliminary Case Study on Long-Form In-the-Wild Audio Spoofing Detection
by: Liu, Xuechen, et al.
Published: (2024)

PITCH: AI-assisted Tagging of Deepfake Audio Calls using Challenge-Response
by: Mittal, Govind, et al.
Published: (2024)

When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs
by: Dingeto, Hiskias, et al.
Published: (2025)

Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning
by: Luong, Diep, et al.
Published: (2023)

AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models
by: Kang, Mintong, et al.
Published: (2024)

Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems
by: Fang, Zheng, et al.
Published: (2024)

The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents
by: Wang, Lixu, et al.
Published: (2025)

ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features
by: Cheng, Peng, et al.
Published: (2024)

PosCUDA: Position based Convolution for Unlearnable Audio Datasets
by: Gokul, Vignesh, et al.
Published: (2024)

AudioMarkBench: Benchmarking Robustness of Audio Watermarking
by: Liu, Hongbin, et al.
Published: (2024)

SpeechVerifier: Robust Acoustic Fingerprint against Tampering Attacks via Watermarking
by: Yao, Lingfeng, et al.
Published: (2025)

GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis
by: Liu, Weizhi, et al.
Published: (2024)

Phoneme-Based Proactive Anti-Eavesdropping with Controlled Recording Privilege
by: Huang, Peng, et al.
Published: (2024)

Attacker's Noise Can Manipulate Your Audio-based LLM in the Real World
by: Sadasivan, Vinu Sankar, et al.
Published: (2025)

Cross-Technology Generalization in Synthesized Speech Detection: Evaluating AST Models with Modern Voice Generators
by: Ustinov, Andrew, et al.
Published: (2025)

Sok: Comprehensive Security Overview, Challenges, and Future Directions of Voice-Controlled Systems
by: Xu, Haozhe, et al.
Published: (2024)

Multi-speaker Text-to-speech Training with Speaker Anonymized Data
by: Huang, Wen-Chin, et al.
Published: (2024)

Benchmarking Fake Voice Detection in the Fake Voice Generation Arms Race
by: Mao, Xutao, et al.
Published: (2025)

A Practical Survey on Emerging Threats from AI-driven Voice Attacks: How Vulnerable are Commercial Voice Control Systems?
by: Wang, Yuanda, et al.
Published: (2023)

Content and Style Aware Audio-Driven Facial Animation
by: Liu, Qingju, et al.
Published: (2024)

Why Speech Deepfake Detectors Won't Generalize: The Limits of Detection in an Open World
by: Berisha, Visar, et al.
Published: (2025)