:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Berlin, Konstantin, Swanda, Adam
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.24247
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM Cyber Evaluations Don't Capture Real-World Risk
by: Lukošiūtė, Kamilė, et al.
Published: (2025)

Evaluating the role of `Constitutions' for learning from AI feedback
by: Redgate, Saskia, et al.
Published: (2024)

C3AI: Crafting and Evaluating Constitutions for Constitutional AI
by: Kyrychenko, Yara, et al.
Published: (2025)

A Framework for Rapidly Developing and Deploying Protection Against Large Language Model Attacks
by: Swanda, Adam, et al.
Published: (2025)

Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
by: Mohammadi, Seyedali, et al.
Published: (2025)

Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation
by: Lee, Jaehyeok, et al.
Published: (2024)

Inverse Constitutional AI: Compressing Preferences into Principles
by: Findeis, Arduin, et al.
Published: (2024)

Less is More for Improving Automatic Evaluation of Factual Consistency
by: Wang, Tong, et al.
Published: (2024)

Contradiction Detection in RAG Systems: Evaluating LLMs as Context Validators for Improved Information Consistency
by: Gokul, Vignesh, et al.
Published: (2025)

Improving Score Reliability of Multiple Choice Benchmarks with Consistency Evaluation and Altered Answer Choices
by: Cavalin, Paulo, et al.
Published: (2025)

Confidence Improves Self-Consistency in LLMs
by: Taubenfeld, Amir, et al.
Published: (2025)

Document-Level Event Extraction with Definition-Driven ICL
by: Liu, Zhuoyuan, et al.
Published: (2024)

Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI
by: Maiya, Sharan, et al.
Published: (2025)

Reverse Constitutional AI: A Framework for Controllable Toxic Data Generation via Probability-Clamped RLAIF
by: Fang, Yuan, et al.
Published: (2026)

The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching
by: Kostiuk, Yevhen, et al.
Published: (2025)

FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations
by: Wen, Athena, et al.
Published: (2025)

Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models
by: Liu, Yinhong, et al.
Published: (2024)

Semantic Role Labeling of NomBank Partitives
by: Meyers, Adam, et al.
Published: (2024)

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
by: Cui, Wendi, et al.
Published: (2024)

Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies
by: Wu, Chung-Wen, et al.
Published: (2024)

WorkRB: A Community-Driven Evaluation Framework for AI in the Work Domain
by: De Lange, Matthias, et al.
Published: (2026)

Evaluating Role-Consistency in LLMs for Counselor Training
by: Rudolph, Eric, et al.
Published: (2026)

Select, Label, Evaluate: Active Testing in NLP
by: Purificato, Antonio, et al.
Published: (2026)

Improving Multi-turn Dialogue Consistency with Self-Recall Thinking
by: Pang, Renning, et al.
Published: (2026)

Improving Factual Consistency of News Summarization by Contrastive Preference Optimization
by: Feng, Huawen, et al.
Published: (2023)

Reducing Political Manipulation with Consistency Training
by: Phan, Long, et al.
Published: (2026)

Improving Task Diversity in Label Efficient Supervised Finetuning of LLMs
by: Arabelly, Abhinav, et al.
Published: (2025)

Does Claude's Constitution Have a Culture?
by: Pourdavood, Parham
Published: (2026)

Epistemic Constitutionalism Or: how to avoid coherence bias
by: Loi, Michele
Published: (2026)

Evaluating Consistency and Reasoning Capabilities of Large Language Models
by: Saxena, Yash, et al.
Published: (2024)

Improving Neural Topic Modeling with Semantically-Grounded Soft Label Distributions
by: Li, Raymond, et al.
Published: (2026)

Improving Non-autoregressive Machine Translation with Error Exposure and Consistency Regularization
by: Chen, Xinran, et al.
Published: (2024)

ConstitutionalExperts: Training a Mixture of Principle-based Prompts
by: Petridis, Savvas, et al.
Published: (2024)

Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification
by: Kumar, Adarsh, et al.
Published: (2025)

AXCEL: Automated eXplainable Consistency Evaluation using LLMs
by: Sreekar, P Aditya, et al.
Published: (2024)

ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
by: Hong, Zhaochen, et al.
Published: (2025)

SaGE: Evaluating Moral Consistency in Large Language Models
by: Bonagiri, Vamshi Krishna, et al.
Published: (2024)

Beyond Modality Limitations: A Unified MLLM Approach to Automated Speaking Assessment with Effective Curriculum Learning
by: Fang, Yu-Hsuan, et al.
Published: (2025)

Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information
by: Tanaka, Yoshiki, et al.
Published: (2026)

Enhancing AI-Driven Education: Integrating Cognitive Frameworks, Linguistic Feedback Analysis, and Ethical Considerations for Improved Content Generation
by: Yaacoub, Antoun, et al.
Published: (2025)