Saved in:
| Main Authors: | Fu, Lucheng, Yu, Ye, Wang, Yiyang, Jin, Yiqiao, Jin, Haibo, Prakash, B. Aditya, Wang, Haohan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.21318 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
UniSD: Towards a Unified Self-Distillation Framework for Large Language Models
by: Jin, Yiqiao, et al.
Published: (2026)
by: Jin, Yiqiao, et al.
Published: (2026)
Beyond Magic Words: Sharpness-Aware Prompt Evolving for Robust Large Language Models with TARE
by: Wan, Guancheng, et al.
Published: (2025)
by: Wan, Guancheng, et al.
Published: (2025)
Break Me If You Can: Self-Jailbreaking of Aligned LLMs via Lexical Insertion Prompting
by: Kulshreshtha, Devang, et al.
Published: (2026)
by: Kulshreshtha, Devang, et al.
Published: (2026)
Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems
by: Yu, Ye, et al.
Published: (2026)
by: Yu, Ye, et al.
Published: (2026)
Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models
by: Yu, Ye, et al.
Published: (2026)
by: Yu, Ye, et al.
Published: (2026)
SciEvo: A 2 Million, 30-Year Cross-disciplinary Dataset for Temporal Scientometric Analysis
by: Jin, Yiqiao, et al.
Published: (2024)
by: Jin, Yiqiao, et al.
Published: (2024)
PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models
by: Yu, Ye, et al.
Published: (2025)
by: Yu, Ye, et al.
Published: (2025)
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems
by: Jin, Haibo, et al.
Published: (2026)
by: Jin, Haibo, et al.
Published: (2026)
Do Self-Evolving Agents Forget? Capability Degradation and Preservation in Lifelong LLM Agent Adaptation
by: Yu, Ye, et al.
Published: (2026)
by: Yu, Ye, et al.
Published: (2026)
MASCOT: Towards Multi-Agent Socio-Collaborative Companion Systems
by: Wang, Yiyang, et al.
Published: (2026)
by: Wang, Yiyang, et al.
Published: (2026)
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback
by: Yu, Yaoning, et al.
Published: (2025)
by: Yu, Yaoning, et al.
Published: (2025)
Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters
by: Jin, Haibo, et al.
Published: (2024)
by: Jin, Haibo, et al.
Published: (2024)
Mitigating Bias in Text Classification via Prompt-Based Text Transformation
by: Barker, Charmaine, et al.
Published: (2023)
by: Barker, Charmaine, et al.
Published: (2023)
Reasoning Can Hurt the Inductive Abilities of Large Language Models
by: Jin, Haibo, et al.
Published: (2025)
by: Jin, Haibo, et al.
Published: (2025)
GuardVal: Dynamic Large Language Model Jailbreak Evaluation for Comprehensive Safety Testing
by: Zhang, Peiyan, et al.
Published: (2025)
by: Zhang, Peiyan, et al.
Published: (2025)
Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation
by: Zhuang, Jun, et al.
Published: (2025)
by: Zhuang, Jun, et al.
Published: (2025)
Structured Prompt Optimization for Few-Shot Text Classification via Semantic Alignment in Latent Space
by: Zheng, Jiasen, et al.
Published: (2026)
by: Zheng, Jiasen, et al.
Published: (2026)
InfoFlood: Jailbreaking Large Language Models with Information Overload
by: Yadav, Advait, et al.
Published: (2025)
by: Yadav, Advait, et al.
Published: (2025)
From Hallucinations to Jailbreaks: Rethinking the Vulnerability of Large Foundation Models
by: Jin, Haibo, et al.
Published: (2025)
by: Jin, Haibo, et al.
Published: (2025)
SimReg: Achieving Higher Performance in the Pretraining via Embedding Similarity Regularization
by: Sun, Yan, et al.
Published: (2026)
by: Sun, Yan, et al.
Published: (2026)
Learning to Conceal Risk: Controllable Multi-turn Red Teaming for LLMs in the Financial Domain
by: Cheng, Gang, et al.
Published: (2025)
by: Cheng, Gang, et al.
Published: (2025)
Prompt-tuning for Clickbait Detection via Text Summarization
by: Deng, Haoxiang, et al.
Published: (2024)
by: Deng, Haoxiang, et al.
Published: (2024)
AgentReview: Exploring Peer Review Dynamics with LLM Agents
by: Jin, Yiqiao, et al.
Published: (2024)
by: Jin, Yiqiao, et al.
Published: (2024)
GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models
by: Jin, Haibo, et al.
Published: (2024)
by: Jin, Haibo, et al.
Published: (2024)
Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems
by: Chen, Ke, et al.
Published: (2025)
by: Chen, Ke, et al.
Published: (2025)
RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation
by: Cao, Yuefan, et al.
Published: (2025)
by: Cao, Yuefan, et al.
Published: (2025)
DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models
by: Zhang, Zijian, et al.
Published: (2024)
by: Zhang, Zijian, et al.
Published: (2024)
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
by: Gao, Bingjie, et al.
Published: (2025)
by: Gao, Bingjie, et al.
Published: (2025)
DomainSum: A Hierarchical Benchmark for Fine-Grained Domain Shift in Abstractive Text Summarization
by: Yuan, Haohan, et al.
Published: (2024)
by: Yuan, Haohan, et al.
Published: (2024)
Universal Prompt Optimizer for Safe Text-to-Image Generation
by: Wu, Zongyu, et al.
Published: (2024)
by: Wu, Zongyu, et al.
Published: (2024)
Crafting Adversarial Inputs for Large Vision-Language Models Using Black-Box Optimization
by: Guan, Jiwei, et al.
Published: (2026)
by: Guan, Jiwei, et al.
Published: (2026)
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks
by: Zhou, Andy, et al.
Published: (2024)
by: Zhou, Andy, et al.
Published: (2024)
REVOLVE: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization
by: Zhang, Peiyan, et al.
Published: (2024)
by: Zhang, Peiyan, et al.
Published: (2024)
GUARD: Guideline Upholding Test through Adaptive Role-play and Jailbreak Diagnostics for LLMs
by: Jin, Haibo, et al.
Published: (2025)
by: Jin, Haibo, et al.
Published: (2025)
SCI-Defense: Defending Manipulation Attacks from Generative Engine Optimization
by: Yu, Xucheng, et al.
Published: (2026)
by: Yu, Xucheng, et al.
Published: (2026)
Mitigating Heterogeneous Token Overfitting in LLM Knowledge Editing
by: Liu, Tianci, et al.
Published: (2025)
by: Liu, Tianci, et al.
Published: (2025)
The Master-Slave Encoder Model for Improving Patent Text Summarization: A New Approach to Combining Specifications and Claims
by: Zhou, Shu, et al.
Published: (2024)
by: Zhou, Shu, et al.
Published: (2024)
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
by: Puerto, Haritz, et al.
Published: (2024)
by: Puerto, Haritz, et al.
Published: (2024)
Optimizing Prompts for Text-to-Image Generation
by: Hao, Yaru, et al.
Published: (2022)
by: Hao, Yaru, et al.
Published: (2022)
Modality Bias in LVLMs: Analyzing and Mitigating Object Hallucination via Attention Lens
by: Zheng, Haohan, et al.
Published: (2025)
by: Zheng, Haohan, et al.
Published: (2025)
Similar Items
-
UniSD: Towards a Unified Self-Distillation Framework for Large Language Models
by: Jin, Yiqiao, et al.
Published: (2026) -
Beyond Magic Words: Sharpness-Aware Prompt Evolving for Robust Large Language Models with TARE
by: Wan, Guancheng, et al.
Published: (2025) -
Break Me If You Can: Self-Jailbreaking of Aligned LLMs via Lexical Insertion Prompting
by: Kulshreshtha, Devang, et al.
Published: (2026) -
Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems
by: Yu, Ye, et al.
Published: (2026) -
Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models
by: Yu, Ye, et al.
Published: (2026)