:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Burnat, Florian A. D., Davidson, Brittany I.
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2605.06327
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Accountability Paradox: How Platform API Restrictions Undermine AI Transparency Mandates
by: Burnat, Florian A. D., et al.
Published: (2025)

Regulatory gray areas of LLM Terms
by: Davidson, Brittany I., et al.
Published: (2026)

Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context
by: Ruan, Kai, et al.
Published: (2024)

Bring Your Own Prompts: Use-Case-Specific Bias and Fairness Evaluation for LLMs
by: Bouchard, Dylan
Published: (2024)

Quotient Semivalues for False-Name-Resistant Data Attribution
by: Burnat, Florian A. D., et al.
Published: (2026)

Gaming the Metric, Not the Harm: Certifying Safety Audits against Strategic Platform Manipulation
by: Burnat, Florian A. D., et al.
Published: (2026)

A Benchmark for Strategic Auditee Gaming Under Continuous Compliance Monitoring
by: Burnat, Florian A. D., et al.
Published: (2026)

Narrative Context Protocol: An Open-Source Storytelling Framework for Generative AI
by: Gerba, Hank
Published: (2025)

Context Misleads LLMs: The Role of Context Filtering in Maintaining Safe Alignment of LLMs
by: Kim, Jinhwa, et al.
Published: (2025)

Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs
by: Trott, Sean, et al.
Published: (2026)

Qworld: Question-Specific Evaluation Criteria for LLMs
by: Gao, Shanghua, et al.
Published: (2026)

Tamper-Resistant Safeguards for Open-Weight LLMs
by: Tamirisa, Rishub, et al.
Published: (2024)

Evaluating the Sensitivity of LLMs to Prior Context
by: Hankache, Robert, et al.
Published: (2025)

DETAIL Matters: Measuring the Impact of Prompt Specificity on Reasoning in Large Language Models
by: Kim, Olivia
Published: (2025)

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
by: Xu, Zhangchen, et al.
Published: (2024)

Humans and LLMs Diverge on Probabilistic Inferences
by: Kamath, Gaurav, et al.
Published: (2026)

On Evaluating LLM Alignment by Evaluating LLMs as Judges
by: Liu, Yixin, et al.
Published: (2025)

Adapting LLMs to Time Series Forecasting via Temporal Heterogeneity Modeling and Semantic Alignment
by: Sun, Yanru, et al.
Published: (2025)

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
by: Thakur, Aman Singh, et al.
Published: (2024)

Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in
by: Agarwal, Utkarsh, et al.
Published: (2024)

Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions
by: Murugadoss, Bhuvanashree, et al.
Published: (2024)

SDA: Steering-Driven Distribution Alignment for Open LLMs without Fine-Tuning
by: Xia, Wei, et al.
Published: (2025)

OpenSanctions Pairs: Large-Scale Entity Matching with LLMs
by: Smith, Chandler, et al.
Published: (2026)

Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation
by: Li, Mufei, et al.
Published: (2025)

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs
by: Tan, Wenhui, et al.
Published: (2026)

Systematic Evaluation of Long-Context LLMs on Financial Concepts
by: Gupta, Lavanya, et al.
Published: (2024)

CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs
by: Guttmann, Kamil, et al.
Published: (2026)

Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models
by: Castillo-Bolado, David, et al.
Published: (2024)

ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
by: Zheng, Jingnan, et al.
Published: (2024)

How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment
by: Huang, Heyan, et al.
Published: (2024)

TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes
by: Shah, Raj Sanjay, et al.
Published: (2025)

Hallucination Detection in LLMs with Topological Divergence on Attention Graphs
by: Bazarova, Alexandra, et al.
Published: (2025)

Unveiling Divergent Inductive Biases of LLMs on Temporal Data
by: Kishore, Sindhu, et al.
Published: (2024)

LLM-Human Pipeline for Cultural Context Grounding of Conversations
by: Pujari, Rajkumar, et al.
Published: (2024)

Pipelined Decoder for Efficient Context-Aware Text Generation
by: Huang, Zixian, et al.
Published: (2025)

Adapting LLMs for Efficient Context Processing through Soft Prompt Compression
by: Wang, Cangqing, et al.
Published: (2024)

The Devil in the Details: Emergent Misalignment, Format and Coherence in Open-Weights LLMs
by: Dickson, Craig
Published: (2025)

Alignment Tuning for Large Language Models: A Data-Centric Lens on Alignment Data Pipelines
by: Song, Hwanjun
Published: (2026)

Beyond Textual Context: Structural Graph Encoding with Adaptive Space Alignment to alleviate the hallucination of LLMs
by: Zhang, Yifang, et al.
Published: (2025)

Diverge to Induce Prompting: Multi-Rationale Induction for Zero-Shot Reasoning
by: Chen, Po-Chun, et al.
Published: (2026)