Saved in:
| Main Authors: | Lan, Yuqin, Li, Gen, Hu, Yuanze, Shen, Weihao, Fan, Zhaoxin, Wu, Faguo, Zhang, Xiao, Yang, Laurence T., Zheng, Zhiming |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.09253 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Burger: Robust Graph Denoising-augmentation Fusion and Multi-semantic Modeling in Social Recommendation
by: Lan, Yuqin, et al.
Published: (2025)
by: Lan, Yuqin, et al.
Published: (2025)
State Beyond Appearance: Diagnosing and Improving State Consistency in Dial-Based Measurement Reading
by: Hu, Yuanze, et al.
Published: (2026)
by: Hu, Yuanze, et al.
Published: (2026)
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
by: Wang, Guojian, et al.
Published: (2023)
by: Wang, Guojian, et al.
Published: (2023)
Lyapunov Probes for Hallucination Detection in Large Foundation Models
by: Luan, Bozhi, et al.
Published: (2026)
by: Luan, Bozhi, et al.
Published: (2026)
HSF: Defending against Jailbreak Attacks with Hidden State Filtering
by: Qian, Cheng, et al.
Published: (2024)
by: Qian, Cheng, et al.
Published: (2024)
Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty
by: Yang, Zhichao, et al.
Published: (2025)
by: Yang, Zhichao, et al.
Published: (2025)
Trajectory-Oriented Policy Optimization with Sparse Rewards
by: Wang, Guojian, et al.
Published: (2024)
by: Wang, Guojian, et al.
Published: (2024)
Universal Adversarial Attacks against Closed-Source MLLMs via Target-View Routed Meta Optimization
by: Lu, Hui, et al.
Published: (2026)
by: Lu, Hui, et al.
Published: (2026)
HalluSAE: Detecting Hallucinations in Large Language Models via Sparse Auto-Encoders
by: Chen, Boshui, et al.
Published: (2026)
by: Chen, Boshui, et al.
Published: (2026)
Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations
by: Wang, Guojian, et al.
Published: (2023)
by: Wang, Guojian, et al.
Published: (2023)
TinyAlign: Boosting Lightweight Vision-Language Models by Mitigating Modal Alignment Bottlenecks
by: Hu, Yuanze, et al.
Published: (2025)
by: Hu, Yuanze, et al.
Published: (2025)
EMG-UP: Unsupervised Personalization in Cross-User EMG Gesture Recognition
by: Wang, Nana, et al.
Published: (2025)
by: Wang, Nana, et al.
Published: (2025)
A Lightweight Framework for Trigger-Guided LoRA-Based Self-Adaptation in LLMs
by: Wei, Jiacheng, et al.
Published: (2025)
by: Wei, Jiacheng, et al.
Published: (2025)
Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models
by: Wang, Youze, et al.
Published: (2025)
by: Wang, Youze, et al.
Published: (2025)
You Only Anonymize What Is Not Intent-Relevant: Suppressing Non-Intent Privacy Evidence
by: Shen, Weihao, et al.
Published: (2026)
by: Shen, Weihao, et al.
Published: (2026)
Mosaic: Area-Closed Spherical Surface Mosaics Induced by Cartesian Grids
by: Counts, H. F., et al.
Published: (2026)
by: Counts, H. F., et al.
Published: (2026)
Jailbreaking Attack against Multimodal Large Language Model
by: Niu, Zhenxing, et al.
Published: (2024)
by: Niu, Zhenxing, et al.
Published: (2024)
Steering Dialogue Dynamics for Robustness against Multi-turn Jailbreaking Attacks
by: Hu, Hanjiang, et al.
Published: (2025)
by: Hu, Hanjiang, et al.
Published: (2025)
MLPHand: Real Time Multi-View 3D Hand Mesh Reconstruction via MLP Modeling
by: Yang, Jian, et al.
Published: (2024)
by: Yang, Jian, et al.
Published: (2024)
Jailbreaking LLMs & VLMs: Mechanisms, Evaluation, and Unified Defense
by: Chen, Zejian, et al.
Published: (2026)
by: Chen, Zejian, et al.
Published: (2026)
Tone Matters: The Impact of Linguistic Tone on Hallucination in VLMs
by: Hong, Weihao, et al.
Published: (2026)
by: Hong, Weihao, et al.
Published: (2026)
MSP-MVS: Multi-Granularity Segmentation Prior Guided Multi-View Stereo
by: Yuan, Zhenlong, et al.
Published: (2024)
by: Yuan, Zhenlong, et al.
Published: (2024)
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
by: Zeng, Yifan, et al.
Published: (2024)
by: Zeng, Yifan, et al.
Published: (2024)
Efficient LLM-Jailbreaking via Multimodal-LLM Jailbreak
by: Ji, Haoxuan, et al.
Published: (2024)
by: Ji, Haoxuan, et al.
Published: (2024)
Z-Erase: Enabling Concept Erasure in Single-Stream Diffusion Transformers
by: Jiang, Nanxiang, et al.
Published: (2026)
by: Jiang, Nanxiang, et al.
Published: (2026)
Learning Diverse Policies with Soft Self-Generated Guidance
by: Wang, Guojian, et al.
Published: (2024)
by: Wang, Guojian, et al.
Published: (2024)
Faster-GCG: Efficient Discrete Optimization Jailbreak Attacks against Aligned Large Language Models
by: Li, Xiao, et al.
Published: (2024)
by: Li, Xiao, et al.
Published: (2024)
PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
by: Shen, Guobin, et al.
Published: (2025)
by: Shen, Guobin, et al.
Published: (2025)
Universally Unfiltered and Unseen:Input-Agnostic Multimodal Jailbreaks against Text-to-Image Model Safeguards
by: Yan, Song, et al.
Published: (2025)
by: Yan, Song, et al.
Published: (2025)
DVP-MVS: Synergize Depth-Edge and Visibility Prior for Multi-View Stereo
by: Yuan, Zhenlong, et al.
Published: (2024)
by: Yuan, Zhenlong, et al.
Published: (2024)
On the Perception Bottleneck of VLMs for Chart Understanding
by: Liu, Junteng, et al.
Published: (2025)
by: Liu, Junteng, et al.
Published: (2025)
MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design
by: Zhou, Gen, et al.
Published: (2026)
by: Zhou, Gen, et al.
Published: (2026)
Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
by: Liu, Xuannan, et al.
Published: (2024)
by: Liu, Xuannan, et al.
Published: (2024)
When Memory Becomes a Vulnerability: Towards Multi-turn Jailbreak Attacks against Text-to-Image Generation Systems
by: Zhao, Shiqian, et al.
Published: (2025)
by: Zhao, Shiqian, et al.
Published: (2025)
Implicit Jailbreak Attacks via Cross-Modal Information Concealment on Vision-Language Models
by: Wang, Zhaoxin, et al.
Published: (2025)
by: Wang, Zhaoxin, et al.
Published: (2025)
A New Deep-learning-Based Approach For mRNA Optimization: High Fidelity, Computation Efficiency, and Multiple Optimization Factors
by: Gong, Zheng, et al.
Published: (2025)
by: Gong, Zheng, et al.
Published: (2025)
On Optimizing Multimodal Jailbreaks for Spoken Language Models
by: Krishnan, Aravind, et al.
Published: (2026)
by: Krishnan, Aravind, et al.
Published: (2026)
TSAR-MVS: Textureless-aware Segmentation and Correlative Refinement Guided Multi-View Stereo
by: Yuan, Zhenlong, et al.
Published: (2023)
by: Yuan, Zhenlong, et al.
Published: (2023)
Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models
by: Liu, Jiangtao, et al.
Published: (2025)
by: Liu, Jiangtao, et al.
Published: (2025)
Every Picture Tells a Dangerous Story: Memory-Augmented Multi-Agent Jailbreak Attacks on VLMs
by: Chen, Jianhao, et al.
Published: (2026)
by: Chen, Jianhao, et al.
Published: (2026)
Similar Items
-
Burger: Robust Graph Denoising-augmentation Fusion and Multi-semantic Modeling in Social Recommendation
by: Lan, Yuqin, et al.
Published: (2025) -
State Beyond Appearance: Diagnosing and Improving State Consistency in Dial-Based Measurement Reading
by: Hu, Yuanze, et al.
Published: (2026) -
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
by: Wang, Guojian, et al.
Published: (2023) -
Lyapunov Probes for Hallucination Detection in Large Foundation Models
by: Luan, Bozhi, et al.
Published: (2026) -
HSF: Defending against Jailbreak Attacks with Hidden State Filtering
by: Qian, Cheng, et al.
Published: (2024)