:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Deng, Yimo, Chen, Huangxun
Format:	Preprint
Published:	2023
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2312.07130
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language Models
by: He, Jialuo, et al.
Published: (2026)

Towards Compact and Robust DNNs via Compression-aware Sharpness Minimization
by: He, Jialuo, et al.
Published: (2026)

BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks
by: Miao, Rui, et al.
Published: (2025)

RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
by: Chen, Xin, et al.
Published: (2025)

StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations
by: Li, Yanjie, et al.
Published: (2025)

Autonomous Algorithm Discovery for Ptychography via Evolutionary LLM Reasoning
by: Yin, Xiangyu, et al.
Published: (2026)

Harnessing LLM Agents with Skill Programs
by: Liu, Hongjun, et al.
Published: (2026)

AutoBridge: Automating Smart Device Integration with Centralized Platform
by: Liu, Siyuan, et al.
Published: (2025)

Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning
by: Zhang, Chenyu, et al.
Published: (2025)

PersGuard: Preventing Malicious Personalization via Backdoor Attacks on Pre-trained Text-to-Image Diffusion Models
by: Liu, Xinwei, et al.
Published: (2025)

BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks
by: Tu, Xinming, et al.
Published: (2026)

CodeGuard: Improving LLM Guardrails in CS Education
by: Raihan, Nishat, et al.
Published: (2026)

GuardReasoner: Towards Reasoning-based LLM Safeguards
by: Liu, Yue, et al.
Published: (2025)

Harnessing LLM for Noise-Robust Cognitive Diagnosis in Web-Based Intelligent Education Systems
by: Zhang, Guixian, et al.
Published: (2025)

Adapting the Interface, Not the Model: Runtime Harness Adaptation for Deterministic LLM Agents
by: Xu, Tianshi, et al.
Published: (2026)

Distillation Traps and Guards: A Calibration Knob for LLM Distillability
by: Zhan, Weixiao, et al.
Published: (2026)

ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety
by: Wang, Haoyu, et al.
Published: (2025)

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents
by: Lin, Minhua, et al.
Published: (2026)

PrefixGuard: From LLM-Agent Traces to Online Failure-Warning Monitors
by: Huang, Xinmiao, et al.
Published: (2026)

AlignGuard: Scalable Safety Alignment for Text-to-Image Generation
by: Liu, Runtao, et al.
Published: (2024)

HARIVO: Harnessing Text-to-Image Models for Video Generation
by: Kwon, Mingi, et al.
Published: (2024)

Privacy Guard & Token Parsimony by Prompt and Context Handling and LLM Routing
by: Langiu, Alessio
Published: (2026)

RouteGuard: Internal-Signal Detection of Skill Poisoning in LLM Agents
by: Xiao, Wenjie, et al.
Published: (2026)

DistillGuard: Evaluating Defenses Against LLM Knowledge Distillation
by: Jiang, Bo
Published: (2026)

CourtGuard: A Model-Agnostic Framework for Zero-Shot Policy Adaptation in LLM Safety
by: Suleymanov, Umid, et al.
Published: (2026)

Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
by: Miao, Yibo, et al.
Published: (2023)

Judge Reliability Harness: Stress Testing the Reliability of LLM Judges
by: Dev, Sunishchal, et al.
Published: (2026)

Semantic-level Backdoor Attack against Text-to-Image Diffusion Models
by: Chen, Tianxin, et al.
Published: (2026)

MergeGuard: Efficient Thwarting of Trojan Attacks in Machine Learning Models
by: Shabgahi, Soheil Zibakhsh, et al.
Published: (2025)

Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
by: Fang, Hao, et al.
Published: (2025)

Blind Spots in the Guard: How Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems
by: Pai, Aaditya
Published: (2026)

FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation
by: Ding, Zhihao, et al.
Published: (2026)

QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety
by: Lee, Taegyeong, et al.
Published: (2025)

ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack
by: Park, Yein, et al.
Published: (2025)

GroupGuard: A Framework for Modeling and Defending Collusive Attacks in Multi-Agent Systems
by: Tao, Yiling, et al.
Published: (2026)

Stop Comparing LLM Agents Without Disclosing the Harness
by: Zhang, Yunbei, et al.
Published: (2026)

Harnessing Consistency for Robust Test-Time LLM Ensemble
by: Zeng, Zhichen, et al.
Published: (2025)

CheatAgent: Attacking LLM-Empowered Recommender Systems via LLM Agent
by: Ning, Liang-bo, et al.
Published: (2025)

Gala: Global LLM Agents for Text-to-Model Translation
by: Cai, Junyang, et al.
Published: (2025)

LLM2: Let Large Language Models Harness System 2 Reasoning
by: Yang, Cheng, et al.
Published: (2024)