:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Elkins, Katherine, Chun, Jon
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.21433
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Syntactic Framing Fragility: An Audit of Robustness in LLM Ethical Decisions
by: Elkins, Katherine, et al.
Published: (2025)

The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making
by: Chun, Jon, et al.
Published: (2026)

Informed AI Regulation: Comparing the Ethical Frameworks of Leading LLM Chatbots Using an Ethics-Based Audit to Assess Moral Reasoning and Normative Values
by: Chun, Jon, et al.
Published: (2024)

AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making
by: Chun, Jon, et al.
Published: (2026)

The AI Fiction Paradox
by: Elkins, Katherine
Published: (2026)

Comparative Global AI Regulation: Policy Perspectives from the EU, China, and the US
by: Chun, Jon, et al.
Published: (2024)

Permissive Information-Flow Analysis for Large Language Models
by: Siddiqui, Shoaib Ahmed, et al.
Published: (2024)

When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models
by: Shamsi, Zafir, et al.
Published: (2026)

Permissioned LLMs: Enforcing Access Control in Large Language Models
by: Jayaraman, Bargav, et al.
Published: (2025)

Quantifying Gender Bias in Large Language Models: When ChatGPT Becomes a Hiring Manager
by: Gerszberg, Nina, et al.
Published: (2026)

When Emotion Becomes Trigger: Emotion-style dynamic Backdoor Attack Parasitising Large Language Models
by: Liu, Ziyu, et al.
Published: (2026)

Useful Memories Become Faulty When Continuously Updated by LLMs
by: Zhang, Dylan, et al.
Published: (2026)

Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention
by: Rabanser, Stephan, et al.
Published: (2025)

Permissive-Washing in the Open AI Supply Chain: A Large-Scale Audit of License Integrity
by: Jewitt, James, et al.
Published: (2026)

When Individually Calibrated Models Become Collectively Miscalibrated
by: Wang, Zhaohui
Published: (2026)

Generative Adversarial Reviews: When LLMs Become the Critic
by: Bougie, Nicolas, et al.
Published: (2024)

How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes
by: Elkins, Sabina, et al.
Published: (2024)

Judicial Permission
by: Governatori, Guido, et al.
Published: (2025)

When Helpfulness Becomes Sycophancy: Sycophancy is a Boundary Failure Between Social Alignment and Epistemic Integrity in Large Language Models
by: Li, Jiechen, et al.
Published: (2026)

Permissible Knowledge Pooling
by: Dong, Huimin
Published: (2024)

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection
by: Camarato, Steffen J., et al.
Published: (2026)

Agentive Permissions in Multiagent Systems
by: Shi, Qi
Published: (2024)

LLMs are Capable of Misaligned Behavior Under Explicit Prohibition and Surveillance
by: Ivanov, Igor
Published: (2025)

AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach
by: Amirizaniani, Maryam, et al.
Published: (2024)

When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA
by: Roh, Taeyun, et al.
Published: (2026)

Maximally Permissive Reward Machines
by: Varricchione, Giovanni, et al.
Published: (2024)

Auditing Disability Representation in Vision-Language Models
by: Panda, Srikant, et al.
Published: (2026)

When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models
by: Sadanandan, Binesh, et al.
Published: (2026)

Should AI Become an Intergenerational Civil Right?
by: Crowcroft, Jon, et al.
Published: (2025)

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards
by: Alzahrani, Norah, et al.
Published: (2024)

When Noise Lowers The Loss: Rethinking Likelihood-Based Evaluation in Music Large Language Models
by: Li, Xiaosha, et al.
Published: (2026)

Style Attack Disguise: When Fonts Become a Camouflage for Adversarial Intent
by: Zhang, Yangshijie, et al.
Published: (2025)

AuditWen:An Open-Source Large Language Model for Audit
by: Huang, Jiajia, et al.
Published: (2024)

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
by: Hou, Jiacheng, et al.
Published: (2026)

When Reasoning Traces Become Performative: Step-Level Evidence that Chain-of-Thought Is an Imperfect Oversight Channel
by: Li, Wenkai, et al.
Published: (2026)

Weak Permission is not Well-Founded, Grounded and Stable
by: Governatori, Guido
Published: (2024)

AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
by: Li, Mingyuan, et al.
Published: (2024)

Privacy Auditing of Large Language Models
by: Panda, Ashwinee, et al.
Published: (2025)

The Relic Condition: When Published Scholarship Becomes Material for Its Own Replacement
by: Deng, Lin, et al.
Published: (2026)

From Prohibition to Adoption: How Hong Kong Universities Are Navigating ChatGPT in Academic Workflows
by: Huang, Junjun, et al.
Published: (2024)