:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jiang, Weipeng, Wang, Zhenting, Zhai, Juan, Ma, Shiqing, Zhao, Zhengyu, Shen, Chao
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2408.11313
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TASO: Jailbreak LLMs via Alternative Template and Suffix Optimization
by: Wang, Yanting, et al.
Published: (2025)

Efficient DNN-Powered Software with Fair Sparse Models
by: Gao, Xuanqi, et al.
Published: (2024)

Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal
by: Jiang, Weipeng, et al.
Published: (2025)

False Friends in the Shell: Unveiling the Emoticon Semantic Confusion in Large Language Models
by: Jiang, Weipeng, et al.
Published: (2026)

Mitigating Stylistic Biases of Machine Translation Systems via Monolingual Corpora Only
by: Gao, Xuanqi, et al.
Published: (2025)

From Effectiveness to Efficiency: Uncovering Linguistic Bias in Large Language Model-based Code Generation
by: Jiang, Weipeng, et al.
Published: (2024)

Speculative Coreset Selection for Task-Specific Fine-tuning
by: Zhang, Xiaoyu, et al.
Published: (2024)

GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs
by: Basani, Advik Raj, et al.
Published: (2024)

TrapSuffix: Proactive Defense Against Adversarial Suffixes in Jailbreaking
by: Du, Mengyao, et al.
Published: (2026)

The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation
by: Zhang, Xiaoyu, et al.
Published: (2025)

Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia
by: Shen, Guangyu, et al.
Published: (2024)

The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries
by: Jiang, Weipeng, et al.
Published: (2025)

DREAM: Debugging and Repairing AutoML Pipelines
by: Zhang, Xiaoyu, et al.
Published: (2023)

Data-centric NLP Backdoor Defense from the Lens of Memorization
by: Wang, Zhenting, et al.
Published: (2024)

Vript: A Video Is Worth Thousands of Words
by: Yang, Dongjie, et al.
Published: (2024)

Rethinking Technology Stack Selection with AI Coding Proficiency
by: Zhang, Xiaoyu, et al.
Published: (2025)

Token-Budget-Aware LLM Reasoning
by: Han, Tingxu, et al.
Published: (2024)

BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
by: Zhao, Yunhan, et al.
Published: (2024)

Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
by: Hu, Kai, et al.
Published: (2024)

PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization
by: Li, Mingzhe, et al.
Published: (2025)

An Embedding is Worth a Thousand Noisy Labels
by: Di Salvo, Francesco, et al.
Published: (2024)

A Video Is Not Worth a Thousand Words
by: Pollard, Sam, et al.
Published: (2025)

Constrained Efficient Global Optimization of Expensive Black-box Functions
by: Xu, Wenjie, et al.
Published: (2022)

CITADEL: Context Similarity Based Deep Learning Framework Bug Finding
by: Zhang, Xiaoyu, et al.
Published: (2024)

MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
by: Gao, Xuanqi, et al.
Published: (2025)

ASSURE: Metamorphic Testing for AI-powered Browser Extensions
by: Gao, Xuanqi, et al.
Published: (2025)

How to Trace Latent Generative Model Generated Images without Artificial Watermark?
by: Wang, Zhenting, et al.
Published: (2024)

Universal Jailbreak Suffixes Are Strong Attention Hijackers
by: Ben-Tov, Matan, et al.
Published: (2025)

Task-free Adaptive Meta Black-box Optimization
by: Wang, Chao, et al.
Published: (2026)

Universal Multi-view Black-box Attack against Object Detectors via Layout Optimization
by: Wang, Donghua, et al.
Published: (2024)

JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs
by: Li, Hongyi, et al.
Published: (2024)

Adaptive Content Restriction for Large Language Models via Suffix Optimization
by: Li, Yige, et al.
Published: (2025)

Black-box Optimization of LLM Outputs by Asking for Directions
by: Zhang, Jie, et al.
Published: (2025)

A LoRA is Worth a Thousand Pictures
by: Liu, Chenxi, et al.
Published: (2024)

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
by: Ma, Avery, et al.
Published: (2025)

EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
by: Li, Mingzhe, et al.
Published: (2025)

WordVIS: A Color Worth A Thousand Words
by: Khan, Umar, et al.
Published: (2024)

A Label is Worth a Thousand Images in Dataset Distillation
by: Qin, Tian, et al.
Published: (2024)

Route to Rome Attack: Directing LLM Routers to Expensive Models via Adversarial Suffix Optimization
by: Tang, Haochun, et al.
Published: (2026)

Efficient LLM-Jailbreaking via Multimodal-LLM Jailbreak
by: Ji, Haoxuan, et al.
Published: (2024)