Saved in:
| Main Authors: | Deng, Yimo, Chen, Huangxun |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.07130 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language Models
by: He, Jialuo, et al.
Published: (2026)
by: He, Jialuo, et al.
Published: (2026)
Towards Compact and Robust DNNs via Compression-aware Sharpness Minimization
by: He, Jialuo, et al.
Published: (2026)
by: He, Jialuo, et al.
Published: (2026)
BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks
by: Miao, Rui, et al.
Published: (2025)
by: Miao, Rui, et al.
Published: (2025)
RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
by: Chen, Xin, et al.
Published: (2025)
by: Chen, Xin, et al.
Published: (2025)
StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations
by: Li, Yanjie, et al.
Published: (2025)
by: Li, Yanjie, et al.
Published: (2025)
Autonomous Algorithm Discovery for Ptychography via Evolutionary LLM Reasoning
by: Yin, Xiangyu, et al.
Published: (2026)
by: Yin, Xiangyu, et al.
Published: (2026)
Harnessing LLM Agents with Skill Programs
by: Liu, Hongjun, et al.
Published: (2026)
by: Liu, Hongjun, et al.
Published: (2026)
AutoBridge: Automating Smart Device Integration with Centralized Platform
by: Liu, Siyuan, et al.
Published: (2025)
by: Liu, Siyuan, et al.
Published: (2025)
Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning
by: Zhang, Chenyu, et al.
Published: (2025)
by: Zhang, Chenyu, et al.
Published: (2025)
PersGuard: Preventing Malicious Personalization via Backdoor Attacks on Pre-trained Text-to-Image Diffusion Models
by: Liu, Xinwei, et al.
Published: (2025)
by: Liu, Xinwei, et al.
Published: (2025)
BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks
by: Tu, Xinming, et al.
Published: (2026)
by: Tu, Xinming, et al.
Published: (2026)
CodeGuard: Improving LLM Guardrails in CS Education
by: Raihan, Nishat, et al.
Published: (2026)
by: Raihan, Nishat, et al.
Published: (2026)
GuardReasoner: Towards Reasoning-based LLM Safeguards
by: Liu, Yue, et al.
Published: (2025)
by: Liu, Yue, et al.
Published: (2025)
Harnessing LLM for Noise-Robust Cognitive Diagnosis in Web-Based Intelligent Education Systems
by: Zhang, Guixian, et al.
Published: (2025)
by: Zhang, Guixian, et al.
Published: (2025)
Adapting the Interface, Not the Model: Runtime Harness Adaptation for Deterministic LLM Agents
by: Xu, Tianshi, et al.
Published: (2026)
by: Xu, Tianshi, et al.
Published: (2026)
Distillation Traps and Guards: A Calibration Knob for LLM Distillability
by: Zhan, Weixiao, et al.
Published: (2026)
by: Zhan, Weixiao, et al.
Published: (2026)
ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents
by: Lin, Minhua, et al.
Published: (2026)
by: Lin, Minhua, et al.
Published: (2026)
PrefixGuard: From LLM-Agent Traces to Online Failure-Warning Monitors
by: Huang, Xinmiao, et al.
Published: (2026)
by: Huang, Xinmiao, et al.
Published: (2026)
AlignGuard: Scalable Safety Alignment for Text-to-Image Generation
by: Liu, Runtao, et al.
Published: (2024)
by: Liu, Runtao, et al.
Published: (2024)
HARIVO: Harnessing Text-to-Image Models for Video Generation
by: Kwon, Mingi, et al.
Published: (2024)
by: Kwon, Mingi, et al.
Published: (2024)
Privacy Guard & Token Parsimony by Prompt and Context Handling and LLM Routing
by: Langiu, Alessio
Published: (2026)
by: Langiu, Alessio
Published: (2026)
RouteGuard: Internal-Signal Detection of Skill Poisoning in LLM Agents
by: Xiao, Wenjie, et al.
Published: (2026)
by: Xiao, Wenjie, et al.
Published: (2026)
DistillGuard: Evaluating Defenses Against LLM Knowledge Distillation
by: Jiang, Bo
Published: (2026)
by: Jiang, Bo
Published: (2026)
CourtGuard: A Model-Agnostic Framework for Zero-Shot Policy Adaptation in LLM Safety
by: Suleymanov, Umid, et al.
Published: (2026)
by: Suleymanov, Umid, et al.
Published: (2026)
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
by: Miao, Yibo, et al.
Published: (2023)
by: Miao, Yibo, et al.
Published: (2023)
Judge Reliability Harness: Stress Testing the Reliability of LLM Judges
by: Dev, Sunishchal, et al.
Published: (2026)
by: Dev, Sunishchal, et al.
Published: (2026)
Semantic-level Backdoor Attack against Text-to-Image Diffusion Models
by: Chen, Tianxin, et al.
Published: (2026)
by: Chen, Tianxin, et al.
Published: (2026)
MergeGuard: Efficient Thwarting of Trojan Attacks in Machine Learning Models
by: Shabgahi, Soheil Zibakhsh, et al.
Published: (2025)
by: Shabgahi, Soheil Zibakhsh, et al.
Published: (2025)
Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
by: Fang, Hao, et al.
Published: (2025)
by: Fang, Hao, et al.
Published: (2025)
Blind Spots in the Guard: How Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems
by: Pai, Aaditya
Published: (2026)
by: Pai, Aaditya
Published: (2026)
FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation
by: Ding, Zhihao, et al.
Published: (2026)
by: Ding, Zhihao, et al.
Published: (2026)
QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety
by: Lee, Taegyeong, et al.
Published: (2025)
by: Lee, Taegyeong, et al.
Published: (2025)
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack
by: Park, Yein, et al.
Published: (2025)
by: Park, Yein, et al.
Published: (2025)
GroupGuard: A Framework for Modeling and Defending Collusive Attacks in Multi-Agent Systems
by: Tao, Yiling, et al.
Published: (2026)
by: Tao, Yiling, et al.
Published: (2026)
Stop Comparing LLM Agents Without Disclosing the Harness
by: Zhang, Yunbei, et al.
Published: (2026)
by: Zhang, Yunbei, et al.
Published: (2026)
Harnessing Consistency for Robust Test-Time LLM Ensemble
by: Zeng, Zhichen, et al.
Published: (2025)
by: Zeng, Zhichen, et al.
Published: (2025)
CheatAgent: Attacking LLM-Empowered Recommender Systems via LLM Agent
by: Ning, Liang-bo, et al.
Published: (2025)
by: Ning, Liang-bo, et al.
Published: (2025)
Gala: Global LLM Agents for Text-to-Model Translation
by: Cai, Junyang, et al.
Published: (2025)
by: Cai, Junyang, et al.
Published: (2025)
LLM2: Let Large Language Models Harness System 2 Reasoning
by: Yang, Cheng, et al.
Published: (2024)
by: Yang, Cheng, et al.
Published: (2024)
Similar Items
-
Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language Models
by: He, Jialuo, et al.
Published: (2026) -
Towards Compact and Robust DNNs via Compression-aware Sharpness Minimization
by: He, Jialuo, et al.
Published: (2026) -
BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks
by: Miao, Rui, et al.
Published: (2025) -
RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
by: Chen, Xin, et al.
Published: (2025) -
StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations
by: Li, Yanjie, et al.
Published: (2025)