Saved in:
| Main Authors: | Elkins, Katherine, Chun, Jon |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.21433 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Syntactic Framing Fragility: An Audit of Robustness in LLM Ethical Decisions
by: Elkins, Katherine, et al.
Published: (2025)
by: Elkins, Katherine, et al.
Published: (2025)
The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making
by: Chun, Jon, et al.
Published: (2026)
by: Chun, Jon, et al.
Published: (2026)
Informed AI Regulation: Comparing the Ethical Frameworks of Leading LLM Chatbots Using an Ethics-Based Audit to Assess Moral Reasoning and Normative Values
by: Chun, Jon, et al.
Published: (2024)
by: Chun, Jon, et al.
Published: (2024)
AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making
by: Chun, Jon, et al.
Published: (2026)
by: Chun, Jon, et al.
Published: (2026)
The AI Fiction Paradox
by: Elkins, Katherine
Published: (2026)
by: Elkins, Katherine
Published: (2026)
Comparative Global AI Regulation: Policy Perspectives from the EU, China, and the US
by: Chun, Jon, et al.
Published: (2024)
by: Chun, Jon, et al.
Published: (2024)
Permissive Information-Flow Analysis for Large Language Models
by: Siddiqui, Shoaib Ahmed, et al.
Published: (2024)
by: Siddiqui, Shoaib Ahmed, et al.
Published: (2024)
When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models
by: Shamsi, Zafir, et al.
Published: (2026)
by: Shamsi, Zafir, et al.
Published: (2026)
Permissioned LLMs: Enforcing Access Control in Large Language Models
by: Jayaraman, Bargav, et al.
Published: (2025)
by: Jayaraman, Bargav, et al.
Published: (2025)
Quantifying Gender Bias in Large Language Models: When ChatGPT Becomes a Hiring Manager
by: Gerszberg, Nina, et al.
Published: (2026)
by: Gerszberg, Nina, et al.
Published: (2026)
When Emotion Becomes Trigger: Emotion-style dynamic Backdoor Attack Parasitising Large Language Models
by: Liu, Ziyu, et al.
Published: (2026)
by: Liu, Ziyu, et al.
Published: (2026)
Useful Memories Become Faulty When Continuously Updated by LLMs
by: Zhang, Dylan, et al.
Published: (2026)
by: Zhang, Dylan, et al.
Published: (2026)
Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention
by: Rabanser, Stephan, et al.
Published: (2025)
by: Rabanser, Stephan, et al.
Published: (2025)
Permissive-Washing in the Open AI Supply Chain: A Large-Scale Audit of License Integrity
by: Jewitt, James, et al.
Published: (2026)
by: Jewitt, James, et al.
Published: (2026)
When Individually Calibrated Models Become Collectively Miscalibrated
by: Wang, Zhaohui
Published: (2026)
by: Wang, Zhaohui
Published: (2026)
Generative Adversarial Reviews: When LLMs Become the Critic
by: Bougie, Nicolas, et al.
Published: (2024)
by: Bougie, Nicolas, et al.
Published: (2024)
How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes
by: Elkins, Sabina, et al.
Published: (2024)
by: Elkins, Sabina, et al.
Published: (2024)
Judicial Permission
by: Governatori, Guido, et al.
Published: (2025)
by: Governatori, Guido, et al.
Published: (2025)
When Helpfulness Becomes Sycophancy: Sycophancy is a Boundary Failure Between Social Alignment and Epistemic Integrity in Large Language Models
by: Li, Jiechen, et al.
Published: (2026)
by: Li, Jiechen, et al.
Published: (2026)
Permissible Knowledge Pooling
by: Dong, Huimin
Published: (2024)
by: Dong, Huimin
Published: (2024)
PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection
by: Camarato, Steffen J., et al.
Published: (2026)
by: Camarato, Steffen J., et al.
Published: (2026)
Agentive Permissions in Multiagent Systems
by: Shi, Qi
Published: (2024)
by: Shi, Qi
Published: (2024)
LLMs are Capable of Misaligned Behavior Under Explicit Prohibition and Surveillance
by: Ivanov, Igor
Published: (2025)
by: Ivanov, Igor
Published: (2025)
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach
by: Amirizaniani, Maryam, et al.
Published: (2024)
by: Amirizaniani, Maryam, et al.
Published: (2024)
When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA
by: Roh, Taeyun, et al.
Published: (2026)
by: Roh, Taeyun, et al.
Published: (2026)
Maximally Permissive Reward Machines
by: Varricchione, Giovanni, et al.
Published: (2024)
by: Varricchione, Giovanni, et al.
Published: (2024)
Auditing Disability Representation in Vision-Language Models
by: Panda, Srikant, et al.
Published: (2026)
by: Panda, Srikant, et al.
Published: (2026)
When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models
by: Sadanandan, Binesh, et al.
Published: (2026)
by: Sadanandan, Binesh, et al.
Published: (2026)
Should AI Become an Intergenerational Civil Right?
by: Crowcroft, Jon, et al.
Published: (2025)
by: Crowcroft, Jon, et al.
Published: (2025)
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards
by: Alzahrani, Norah, et al.
Published: (2024)
by: Alzahrani, Norah, et al.
Published: (2024)
When Noise Lowers The Loss: Rethinking Likelihood-Based Evaluation in Music Large Language Models
by: Li, Xiaosha, et al.
Published: (2026)
by: Li, Xiaosha, et al.
Published: (2026)
Style Attack Disguise: When Fonts Become a Camouflage for Adversarial Intent
by: Zhang, Yangshijie, et al.
Published: (2025)
by: Zhang, Yangshijie, et al.
Published: (2025)
AuditWen:An Open-Source Large Language Model for Audit
by: Huang, Jiajia, et al.
Published: (2024)
by: Huang, Jiajia, et al.
Published: (2024)
When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
by: Hou, Jiacheng, et al.
Published: (2026)
by: Hou, Jiacheng, et al.
Published: (2026)
When Reasoning Traces Become Performative: Step-Level Evidence that Chain-of-Thought Is an Imperfect Oversight Channel
by: Li, Wenkai, et al.
Published: (2026)
by: Li, Wenkai, et al.
Published: (2026)
Weak Permission is not Well-Founded, Grounded and Stable
by: Governatori, Guido
Published: (2024)
by: Governatori, Guido
Published: (2024)
AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
by: Li, Mingyuan, et al.
Published: (2024)
by: Li, Mingyuan, et al.
Published: (2024)
Privacy Auditing of Large Language Models
by: Panda, Ashwinee, et al.
Published: (2025)
by: Panda, Ashwinee, et al.
Published: (2025)
The Relic Condition: When Published Scholarship Becomes Material for Its Own Replacement
by: Deng, Lin, et al.
Published: (2026)
by: Deng, Lin, et al.
Published: (2026)
From Prohibition to Adoption: How Hong Kong Universities Are Navigating ChatGPT in Academic Workflows
by: Huang, Junjun, et al.
Published: (2024)
by: Huang, Junjun, et al.
Published: (2024)
Similar Items
-
Syntactic Framing Fragility: An Audit of Robustness in LLM Ethical Decisions
by: Elkins, Katherine, et al.
Published: (2025) -
The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making
by: Chun, Jon, et al.
Published: (2026) -
Informed AI Regulation: Comparing the Ethical Frameworks of Leading LLM Chatbots Using an Ethics-Based Audit to Assess Moral Reasoning and Normative Values
by: Chun, Jon, et al.
Published: (2024) -
AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making
by: Chun, Jon, et al.
Published: (2026) -
The AI Fiction Paradox
by: Elkins, Katherine
Published: (2026)