Saved in:
| Main Authors: | Aljaafari, Nura, Valentino, Marco, Freitas, André |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.25520 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Interpreting token compositionality in LLMs: A robustness analysis
by: Aljaafari, Nura, et al.
Published: (2024)
by: Aljaafari, Nura, et al.
Published: (2024)
TRACE: Training and Inference-Time Interpretability Analysis for Language Models
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
Emergence and Localisation of Semantic Role Circuits in LLMs
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
The Mechanics of Conceptual Interpretation in GPT Models: Interpretative Insights
by: Aljaafari, Nura, et al.
Published: (2024)
by: Aljaafari, Nura, et al.
Published: (2024)
TRACE for Tracking the Emergence of Semantic Representations in Transformers
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
CARMA: Enhanced Compositionality in LLMs via Advanced Regularisation and Mutual Information Alignment
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach
by: Aljaafari, Nura, et al.
Published: (2026)
by: Aljaafari, Nura, et al.
Published: (2026)
Reasoning Circuits in Language Models: A Mechanistic Interpretation of Syllogistic Inference
by: Kim, Geonhee, et al.
Published: (2024)
by: Kim, Geonhee, et al.
Published: (2024)
Autoformalization in the Wild: Assessing LLMs on Real-World Mathematical Definitions
by: Zhang, Lan, et al.
Published: (2025)
by: Zhang, Lan, et al.
Published: (2025)
A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference
by: Thayaparan, Mokanarangan, et al.
Published: (2024)
by: Thayaparan, Mokanarangan, et al.
Published: (2024)
SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials
by: Jullien, Mael, et al.
Published: (2024)
by: Jullien, Mael, et al.
Published: (2024)
Inferring Latent Intentions: Attributional Natural Language Inference in LLM Agents
by: Quan, Xin, et al.
Published: (2026)
by: Quan, Xin, et al.
Published: (2026)
Reasoning with Natural Language Explanations
by: Valentino, Marco, et al.
Published: (2024)
by: Valentino, Marco, et al.
Published: (2024)
Decompose-and-Formalise: Recursively Verifiable Natural Language Inference
by: Quan, Xin, et al.
Published: (2026)
by: Quan, Xin, et al.
Published: (2026)
Inference to the Best Explanation in Large Language Models
by: Dalal, Dhairya, et al.
Published: (2024)
by: Dalal, Dhairya, et al.
Published: (2024)
Integrating Expert Knowledge into Logical Programs via LLMs
by: Górski, Franciszek, et al.
Published: (2025)
by: Górski, Franciszek, et al.
Published: (2025)
MASA: LLM-Driven Multi-Agent Systems for Autoformalization
by: Zhang, Lan, et al.
Published: (2025)
by: Zhang, Lan, et al.
Published: (2025)
Monotonic Reference-Free Refinement for Autoformalization
by: Zhang, Lan, et al.
Published: (2026)
by: Zhang, Lan, et al.
Published: (2026)
Estimating the Causal Effects of Natural Logic Features in Neural NLI Models
by: Rozanova, Julia, et al.
Published: (2023)
by: Rozanova, Julia, et al.
Published: (2023)
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models
by: Rozanova, Julia, et al.
Published: (2024)
by: Rozanova, Julia, et al.
Published: (2024)
Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning
by: Zhang, Lan, et al.
Published: (2025)
by: Zhang, Lan, et al.
Published: (2025)
Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions
by: Ranaldi, Leonardo, et al.
Published: (2025)
by: Ranaldi, Leonardo, et al.
Published: (2025)
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
by: Zhang, Yingji, et al.
Published: (2024)
by: Zhang, Yingji, et al.
Published: (2024)
Controlling Equational Reasoning in Large Language Models with Prompt Interventions
by: Meadows, Jordan, et al.
Published: (2023)
by: Meadows, Jordan, et al.
Published: (2023)
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions
by: Valentino, Marco, et al.
Published: (2023)
by: Valentino, Marco, et al.
Published: (2023)
Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations
by: Ranaldi, Leonardo, et al.
Published: (2024)
by: Ranaldi, Leonardo, et al.
Published: (2024)
Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study
by: Zhang, Yingji, et al.
Published: (2025)
by: Zhang, Yingji, et al.
Published: (2025)
Adaptive LLM-Symbolic Reasoning via Dynamic Logical Solver Composition
by: Xu, Lei, et al.
Published: (2025)
by: Xu, Lei, et al.
Published: (2025)
A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers
by: Meadows, Jordan, et al.
Published: (2023)
by: Meadows, Jordan, et al.
Published: (2023)
Dissecting Clinical Reasoning in Language Models: A Comparative Study of Prompts and Model Adaptation Strategies
by: Jullien, Mael, et al.
Published: (2025)
by: Jullien, Mael, et al.
Published: (2025)
Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving
by: Quan, Xin, et al.
Published: (2024)
by: Quan, Xin, et al.
Published: (2024)
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement
by: Quan, Xin, et al.
Published: (2024)
by: Quan, Xin, et al.
Published: (2024)
SylloBio-NLI: Evaluating Large Language Models on Biomedical Syllogistic Reasoning
by: Wysocka, Magdalena, et al.
Published: (2024)
by: Wysocka, Magdalena, et al.
Published: (2024)
Faithful and Robust LLM-Driven Theorem Proving for NLI Explanations
by: Quan, Xin, et al.
Published: (2025)
by: Quan, Xin, et al.
Published: (2025)
Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering
by: Valentino, Marco, et al.
Published: (2025)
by: Valentino, Marco, et al.
Published: (2025)
PEIRCE: Unifying Material and Formal Reasoning via LLM-Driven Neuro-Symbolic Refinement
by: Quan, Xin, et al.
Published: (2025)
by: Quan, Xin, et al.
Published: (2025)
Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective
by: Chandna, Bhavik, et al.
Published: (2025)
by: Chandna, Bhavik, et al.
Published: (2025)
Mechanistic Interpretability of Emotion Inference in Large Language Models
by: Tak, Ala N., et al.
Published: (2025)
by: Tak, Ala N., et al.
Published: (2025)
From Syntax to Emotion: A Mechanistic Analysis of Emotion Inference in LLMs
by: Shu, Bangzhao, et al.
Published: (2026)
by: Shu, Bangzhao, et al.
Published: (2026)
Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering
by: Yu, Zeping, et al.
Published: (2024)
by: Yu, Zeping, et al.
Published: (2024)
Similar Items
-
Interpreting token compositionality in LLMs: A robustness analysis
by: Aljaafari, Nura, et al.
Published: (2024) -
TRACE: Training and Inference-Time Interpretability Analysis for Language Models
by: Aljaafari, Nura, et al.
Published: (2025) -
Emergence and Localisation of Semantic Role Circuits in LLMs
by: Aljaafari, Nura, et al.
Published: (2025) -
The Mechanics of Conceptual Interpretation in GPT Models: Interpretative Insights
by: Aljaafari, Nura, et al.
Published: (2024) -
TRACE for Tracking the Emergence of Semantic Representations in Transformers
by: Aljaafari, Nura, et al.
Published: (2025)