Saved in:
| Main Author: | Shportun, Nazarii |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.30589 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation
by: Bouchekif, Abdessalam, et al.
Published: (2025)
by: Bouchekif, Abdessalam, et al.
Published: (2025)
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
by: Zhang, Yuxuan
Published: (2025)
by: Zhang, Yuxuan
Published: (2025)
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
by: Koc, Vincent
Published: (2025)
by: Koc, Vincent
Published: (2025)
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
by: Lyu, Bohan, et al.
Published: (2024)
by: Lyu, Bohan, et al.
Published: (2024)
EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning
by: Sauter, Andreas, et al.
Published: (2026)
by: Sauter, Andreas, et al.
Published: (2026)
Machine Unlearning for Masked Diffusion Language Models
by: Lee, Georu, et al.
Published: (2026)
by: Lee, Georu, et al.
Published: (2026)
Why Models Know But Don't Say: Chain-of-Thought Faithfulness Divergence Between Thinking Tokens and Answers in Open-Weight Reasoning Models
by: Young, Richard J.
Published: (2026)
by: Young, Richard J.
Published: (2026)
Truth as a Compression Artifact in Language Model Training
by: Krestnikov, Konstantin
Published: (2026)
by: Krestnikov, Konstantin
Published: (2026)
Intention Collapse: Intention-Level Metrics for Reasoning in Language Models
by: Vera, Patricio
Published: (2026)
by: Vera, Patricio
Published: (2026)
RMGAP: Benchmarking the Generalization of Reward Models across Diverse Preferences
by: Zhou, Yangyang, et al.
Published: (2026)
by: Zhou, Yangyang, et al.
Published: (2026)
The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
Beyond Hallucinations: A Composite Score for Measuring Reliability in Open-Source Large Language Models
by: Salla, Rohit Kumar, et al.
Published: (2025)
by: Salla, Rohit Kumar, et al.
Published: (2025)
Distilling Self-Consistency into Verbal Confidence: A Pre-Registered Negative Result and Post-Hoc Rescue on Gemma 3 4B
by: Cacioli, Jon-Paul
Published: (2026)
by: Cacioli, Jon-Paul
Published: (2026)
Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning
by: Cacioli, Jon-Paul
Published: (2026)
by: Cacioli, Jon-Paul
Published: (2026)
Align and Shine: Building High-Quality Sentence-Aligned Corpora for Multilingual Text Simplification
by: Hilasaca, Kenji, et al.
Published: (2026)
by: Hilasaca, Kenji, et al.
Published: (2026)
Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs
by: Keeman, Michael
Published: (2026)
by: Keeman, Michael
Published: (2026)
The Pragmatic Persona: Discovering LLM Persona through Bridging Inference
by: Yang, Jisoo, et al.
Published: (2026)
by: Yang, Jisoo, et al.
Published: (2026)
UrduBench: An Urdu Reasoning Benchmark using Contextually Ensembled Translations with Human-in-the-Loop
by: Shafique, Muhammad Ali, et al.
Published: (2026)
by: Shafique, Muhammad Ali, et al.
Published: (2026)
Eyla: Toward an Identity-Anchored LLM Architecture with Integrated Biological Priors -- Vision, Implementation Attempt, and Lessons from AI-Assisted Development
by: Aditto, Arif
Published: (2026)
by: Aditto, Arif
Published: (2026)
KAConvText: Novel Approach to Burmese Sentence Classification using Kolmogorov-Arnold Convolution
by: Thu, Ye Kyaw, et al.
Published: (2025)
by: Thu, Ye Kyaw, et al.
Published: (2025)
When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR)
by: Agarwal, Mahak, et al.
Published: (2025)
by: Agarwal, Mahak, et al.
Published: (2025)
A Hierarchical Error Framework for Reliable Automated Coding in Communication Research: Applications to Health and Political Communication
by: Zhao, Zhilong, et al.
Published: (2025)
by: Zhao, Zhilong, et al.
Published: (2025)
Can AI Read Between The Lines? Benchmarking LLMs On Financial Nuance
by: Kubica, Dominick, et al.
Published: (2025)
by: Kubica, Dominick, et al.
Published: (2025)
Causally Grounded Mechanistic Interpretability for LLMs with Faithful Natural-Language Explanations
by: Mahale, Ajay Pravin
Published: (2026)
by: Mahale, Ajay Pravin
Published: (2026)
CogCanvas: Verbatim-Grounded Artifact Extraction for Long LLM Conversations
by: An, Tao
Published: (2025)
by: An, Tao
Published: (2025)
MeMo: Towards Language Models with Associative Memory Mechanisms
by: Zanzotto, Fabio Massimo, et al.
Published: (2025)
by: Zanzotto, Fabio Massimo, et al.
Published: (2025)
Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking
by: Zhang, Liangliang, et al.
Published: (2025)
by: Zhang, Liangliang, et al.
Published: (2025)
Emergent Lexical Semantics in Neural Language Models: Testing Martin's Law on LLM-Generated Text
by: Kugler, Kai
Published: (2025)
by: Kugler, Kai
Published: (2025)
HyperPersona: A Multi-Level Hypergraph Framework for Text-Based Automatic Personality Prediction
by: Heydari, Sina, et al.
Published: (2026)
by: Heydari, Sina, et al.
Published: (2026)
Council Mode: A Heterogeneous Multi-Agent Consensus Framework for Reducing LLM Hallucination and Bias
by: Wu, Shuai, et al.
Published: (2026)
by: Wu, Shuai, et al.
Published: (2026)
Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents
by: Jehu-Appiah, Rodney
Published: (2026)
by: Jehu-Appiah, Rodney
Published: (2026)
Fuzzy, Symbolic, and Contextual: Enhancing LLM Instruction via Cognitive Scaffolding
by: Figueiredo, Vanessa
Published: (2025)
by: Figueiredo, Vanessa
Published: (2025)
D-COT: Disciplined Chain-of-Thought Learning for Efficient Reasoning in Small Language Models
by: Ubukata, Shunsuke
Published: (2026)
by: Ubukata, Shunsuke
Published: (2026)
Model Collapse as Cultural Evolution
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Word Overuse and Alignment in Large Language Models: The Influence of Learning from Human Feedback
by: Juzek, Tom S., et al.
Published: (2025)
by: Juzek, Tom S., et al.
Published: (2025)
Prototype Transformer: Towards Language Model Architectures Interpretable by Design
by: Yordanov, Yordan, et al.
Published: (2026)
by: Yordanov, Yordan, et al.
Published: (2026)
Towards Ontology-Enhanced Representation Learning for Large Language Models
by: Ronzano, Francesco, et al.
Published: (2024)
by: Ronzano, Francesco, et al.
Published: (2024)
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models
by: Zhang, Gongbo, et al.
Published: (2026)
by: Zhang, Gongbo, et al.
Published: (2026)
Evaluating Large Language Models for IUCN Red List Species Information
by: Uryu, Shinya
Published: (2025)
by: Uryu, Shinya
Published: (2025)
Similar Items
-
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025) -
Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation
by: Bouchekif, Abdessalam, et al.
Published: (2025) -
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
by: Zhang, Yuxuan
Published: (2025) -
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
by: Koc, Vincent
Published: (2025) -
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
by: Lyu, Bohan, et al.
Published: (2024)