Saved in:
| Main Authors: | Gan, Yuan, Miao, Jiaxu, Wang, Yunze, Yang, Yi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.01591 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hindi audio-video-Deepfake (HAV-DF): A Hindi language-based Audio-video Deepfake Dataset
by: Kaur, Sukhandeep, et al.
Published: (2024)
by: Kaur, Sukhandeep, et al.
Published: (2024)
IO-RAE: Information-Obfuscation Reversible Adversarial Example for Audio Privacy Protection
by: Zhu, Jiajie, et al.
Published: (2026)
by: Zhu, Jiajie, et al.
Published: (2026)
Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization
by: Jin, Weifei, et al.
Published: (2025)
by: Jin, Weifei, et al.
Published: (2025)
EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations
by: Chang, Jung-Woo, et al.
Published: (2024)
by: Chang, Jung-Woo, et al.
Published: (2024)
PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
by: Xie, Yifan, et al.
Published: (2024)
by: Xie, Yifan, et al.
Published: (2024)
Yours or Mine? Overwriting Attacks Against Neural Audio Watermarking
by: Yao, Lingfeng, et al.
Published: (2025)
by: Yao, Lingfeng, et al.
Published: (2025)
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
by: Qiu, Zhiping, et al.
Published: (2025)
by: Qiu, Zhiping, et al.
Published: (2025)
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
by: Wang, Haotian, et al.
Published: (2025)
by: Wang, Haotian, et al.
Published: (2025)
Adversarial Representation Learning for Robust Privacy Preservation in Audio
by: Gharib, Shayan, et al.
Published: (2023)
by: Gharib, Shayan, et al.
Published: (2023)
SilentCipher: Deep Audio Watermarking
by: Singh, Mayank Kumar, et al.
Published: (2024)
by: Singh, Mayank Kumar, et al.
Published: (2024)
Gumbel Rao Monte Carlo based Bi-Modal Neural Architecture Search for Audio-Visual Deepfake Detection
by: PN, Aravinda Reddy, et al.
Published: (2024)
by: PN, Aravinda Reddy, et al.
Published: (2024)
An Effective Energy Mask-based Adversarial Evasion Attacks against Misclassification in Speaker Recognition Systems
by: Park, Chanwoo, et al.
Published: (2026)
by: Park, Chanwoo, et al.
Published: (2026)
Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis
by: Shen, Shuai, et al.
Published: (2025)
by: Shen, Shuai, et al.
Published: (2025)
Hybrid Audio Detection Using Fine-Tuned Audio Spectrogram Transformers: A Dataset-Driven Evaluation of Mixed AI-Human Speech
by: Huang, Kunyang, et al.
Published: (2025)
by: Huang, Kunyang, et al.
Published: (2025)
Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
by: Joshi, Sonal, et al.
Published: (2024)
by: Joshi, Sonal, et al.
Published: (2024)
Interpretable Temporal Class Activation Representation for Audio Spoofing Detection
by: Li, Menglu, et al.
Published: (2024)
by: Li, Menglu, et al.
Published: (2024)
Adversarial Attacks and Defenses for Speech Recognition Systems
by: Żelasko, Piotr, et al.
Published: (2021)
by: Żelasko, Piotr, et al.
Published: (2021)
Pitch Imperfect: Detecting Audio Deepfakes Through Acoustic Prosodic Analysis
by: Warren, Kevin, et al.
Published: (2025)
by: Warren, Kevin, et al.
Published: (2025)
One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection
by: Kim, Hyun Myung, et al.
Published: (2024)
by: Kim, Hyun Myung, et al.
Published: (2024)
A Preliminary Case Study on Long-Form In-the-Wild Audio Spoofing Detection
by: Liu, Xuechen, et al.
Published: (2024)
by: Liu, Xuechen, et al.
Published: (2024)
PITCH: AI-assisted Tagging of Deepfake Audio Calls using Challenge-Response
by: Mittal, Govind, et al.
Published: (2024)
by: Mittal, Govind, et al.
Published: (2024)
When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs
by: Dingeto, Hiskias, et al.
Published: (2025)
by: Dingeto, Hiskias, et al.
Published: (2025)
Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning
by: Luong, Diep, et al.
Published: (2023)
by: Luong, Diep, et al.
Published: (2023)
AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models
by: Kang, Mintong, et al.
Published: (2024)
by: Kang, Mintong, et al.
Published: (2024)
Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems
by: Fang, Zheng, et al.
Published: (2024)
by: Fang, Zheng, et al.
Published: (2024)
The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents
by: Wang, Lixu, et al.
Published: (2025)
by: Wang, Lixu, et al.
Published: (2025)
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features
by: Cheng, Peng, et al.
Published: (2024)
by: Cheng, Peng, et al.
Published: (2024)
PosCUDA: Position based Convolution for Unlearnable Audio Datasets
by: Gokul, Vignesh, et al.
Published: (2024)
by: Gokul, Vignesh, et al.
Published: (2024)
AudioMarkBench: Benchmarking Robustness of Audio Watermarking
by: Liu, Hongbin, et al.
Published: (2024)
by: Liu, Hongbin, et al.
Published: (2024)
SpeechVerifier: Robust Acoustic Fingerprint against Tampering Attacks via Watermarking
by: Yao, Lingfeng, et al.
Published: (2025)
by: Yao, Lingfeng, et al.
Published: (2025)
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis
by: Liu, Weizhi, et al.
Published: (2024)
by: Liu, Weizhi, et al.
Published: (2024)
Phoneme-Based Proactive Anti-Eavesdropping with Controlled Recording Privilege
by: Huang, Peng, et al.
Published: (2024)
by: Huang, Peng, et al.
Published: (2024)
Attacker's Noise Can Manipulate Your Audio-based LLM in the Real World
by: Sadasivan, Vinu Sankar, et al.
Published: (2025)
by: Sadasivan, Vinu Sankar, et al.
Published: (2025)
Cross-Technology Generalization in Synthesized Speech Detection: Evaluating AST Models with Modern Voice Generators
by: Ustinov, Andrew, et al.
Published: (2025)
by: Ustinov, Andrew, et al.
Published: (2025)
Sok: Comprehensive Security Overview, Challenges, and Future Directions of Voice-Controlled Systems
by: Xu, Haozhe, et al.
Published: (2024)
by: Xu, Haozhe, et al.
Published: (2024)
Multi-speaker Text-to-speech Training with Speaker Anonymized Data
by: Huang, Wen-Chin, et al.
Published: (2024)
by: Huang, Wen-Chin, et al.
Published: (2024)
Benchmarking Fake Voice Detection in the Fake Voice Generation Arms Race
by: Mao, Xutao, et al.
Published: (2025)
by: Mao, Xutao, et al.
Published: (2025)
A Practical Survey on Emerging Threats from AI-driven Voice Attacks: How Vulnerable are Commercial Voice Control Systems?
by: Wang, Yuanda, et al.
Published: (2023)
by: Wang, Yuanda, et al.
Published: (2023)
Content and Style Aware Audio-Driven Facial Animation
by: Liu, Qingju, et al.
Published: (2024)
by: Liu, Qingju, et al.
Published: (2024)
Why Speech Deepfake Detectors Won't Generalize: The Limits of Detection in an Open World
by: Berisha, Visar, et al.
Published: (2025)
by: Berisha, Visar, et al.
Published: (2025)
Similar Items
-
Hindi audio-video-Deepfake (HAV-DF): A Hindi language-based Audio-video Deepfake Dataset
by: Kaur, Sukhandeep, et al.
Published: (2024) -
IO-RAE: Information-Obfuscation Reversible Adversarial Example for Audio Privacy Protection
by: Zhu, Jiajie, et al.
Published: (2026) -
Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization
by: Jin, Weifei, et al.
Published: (2025) -
EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations
by: Chang, Jung-Woo, et al.
Published: (2024) -
PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
by: Xie, Yifan, et al.
Published: (2024)