Saved in:
| Main Authors: | Aljaafari, Nura, Carvalho, Danilo S., Freitas, André |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.11066 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Emergence and Localisation of Semantic Role Circuits in LLMs
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
Interpreting token compositionality in LLMs: A robustness analysis
by: Aljaafari, Nura, et al.
Published: (2024)
by: Aljaafari, Nura, et al.
Published: (2024)
TRACE: Training and Inference-Time Interpretability Analysis for Language Models
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
TRACE for Tracking the Emergence of Semantic Representations in Transformers
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
The Mechanics of Conceptual Interpretation in GPT Models: Interpretative Insights
by: Aljaafari, Nura, et al.
Published: (2024)
by: Aljaafari, Nura, et al.
Published: (2024)
Is Inference Mediated by Distinct Semantic Structures in LLMs? A Mechanistic Interpretation
by: Aljaafari, Nura, et al.
Published: (2026)
by: Aljaafari, Nura, et al.
Published: (2026)
From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach
by: Aljaafari, Nura, et al.
Published: (2026)
by: Aljaafari, Nura, et al.
Published: (2026)
Bridging Compositional and Distributional Semantics: A Survey on Latent Semantic Geometry via AutoEncoder
by: Zhang, Yingji, et al.
Published: (2025)
by: Zhang, Yingji, et al.
Published: (2025)
Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks
by: Zhang, Yingji, et al.
Published: (2023)
by: Zhang, Yingji, et al.
Published: (2023)
Inductive Learning of Logical Theories with LLMs: An Expressivity-Graded Analysis
by: Gandarela, João Pedro, et al.
Published: (2024)
by: Gandarela, João Pedro, et al.
Published: (2024)
Quasi-symbolic Semantic Geometry over Transformer-based Variational AutoEncoder
by: Zhang, Yingji, et al.
Published: (2022)
by: Zhang, Yingji, et al.
Published: (2022)
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions
by: Valentino, Marco, et al.
Published: (2023)
by: Valentino, Marco, et al.
Published: (2023)
Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study
by: Zhang, Yingji, et al.
Published: (2025)
by: Zhang, Yingji, et al.
Published: (2025)
PEIRCE: Unifying Material and Formal Reasoning via LLM-Driven Neuro-Symbolic Refinement
by: Quan, Xin, et al.
Published: (2025)
by: Quan, Xin, et al.
Published: (2025)
Towards Controllable Natural Language Inference through Lexical Inference Types
by: Zhang, Yingji, et al.
Published: (2023)
by: Zhang, Yingji, et al.
Published: (2023)
LangVAE and LangSpace: Building and Probing for Language Model VAEs
by: Carvalho, Danilo S., et al.
Published: (2025)
by: Carvalho, Danilo S., et al.
Published: (2025)
Montague semantics and modifier consistency measurement in neural language models
by: Carvalho, Danilo S., et al.
Published: (2022)
by: Carvalho, Danilo S., et al.
Published: (2022)
SylloBio-NLI: Evaluating Large Language Models on Biomedical Syllogistic Reasoning
by: Wysocka, Magdalena, et al.
Published: (2024)
by: Wysocka, Magdalena, et al.
Published: (2024)
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
by: Zhang, Yingji, et al.
Published: (2024)
by: Zhang, Yingji, et al.
Published: (2024)
CARMA: Comprehensive Automatically-annotated Reddit Mental Health Dataset for Arabic
by: Mankarious, Saad, et al.
Published: (2025)
by: Mankarious, Saad, et al.
Published: (2025)
Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs
by: Delmas, Maxime, et al.
Published: (2025)
by: Delmas, Maxime, et al.
Published: (2025)
An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings
by: Govande, Soham V.
Published: (2024)
by: Govande, Soham V.
Published: (2024)
Adaptive LLM-Symbolic Reasoning via Dynamic Logical Solver Composition
by: Xu, Lei, et al.
Published: (2025)
by: Xu, Lei, et al.
Published: (2025)
Explanation Regularisation through the Lens of Attributions
by: Ferreira, Pedro, et al.
Published: (2024)
by: Ferreira, Pedro, et al.
Published: (2024)
Autoformalization in the Wild: Assessing LLMs on Real-World Mathematical Definitions
by: Zhang, Lan, et al.
Published: (2025)
by: Zhang, Lan, et al.
Published: (2025)
Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
by: Wu, Xuansheng, et al.
Published: (2025)
by: Wu, Xuansheng, et al.
Published: (2025)
MAIN: Mutual Alignment Is Necessary for instruction tuning
by: Yang, Fanyi, et al.
Published: (2025)
by: Yang, Fanyi, et al.
Published: (2025)
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels
by: Fränken, Jan-Philipp, et al.
Published: (2024)
by: Fränken, Jan-Philipp, et al.
Published: (2024)
Rethinking the Understanding Ability across LLMs through Mutual Information
by: Wang, Shaojie, et al.
Published: (2025)
by: Wang, Shaojie, et al.
Published: (2025)
Integrating Expert Knowledge into Logical Programs via LLMs
by: Górski, Franciszek, et al.
Published: (2025)
by: Górski, Franciszek, et al.
Published: (2025)
An LLM-based Knowledge Synthesis and Scientific Reasoning Framework for Biomedical Discovery
by: Wysocki, Oskar, et al.
Published: (2024)
by: Wysocki, Oskar, et al.
Published: (2024)
NeuroMax: Enhancing Neural Topic Modeling via Maximizing Mutual Information and Group Topic Regularization
by: Pham, Duy-Tung, et al.
Published: (2024)
by: Pham, Duy-Tung, et al.
Published: (2024)
Training LLMs Beyond Next Token Prediction -- Filling the Mutual Information Gap
by: Yang, Chun-Hao, et al.
Published: (2025)
by: Yang, Chun-Hao, et al.
Published: (2025)
PromptNCE: Pointwise Mutual Information Predictions Using Only LLMs and Contrastive Estimation Prompts
by: Woodrow, Juliette, et al.
Published: (2026)
by: Woodrow, Juliette, et al.
Published: (2026)
EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
by: Wang, Yongxin, et al.
Published: (2024)
by: Wang, Yongxin, et al.
Published: (2024)
Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning
by: Choenni, Rochelle, et al.
Published: (2024)
by: Choenni, Rochelle, et al.
Published: (2024)
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
by: Qi, Zhenting, et al.
Published: (2024)
by: Qi, Zhenting, et al.
Published: (2024)
Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions
by: Ranaldi, Leonardo, et al.
Published: (2025)
by: Ranaldi, Leonardo, et al.
Published: (2025)
TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs
by: Gu, Yuxuan, et al.
Published: (2025)
by: Gu, Yuxuan, et al.
Published: (2025)
Panacea: Pareto Alignment via Preference Adaptation for LLMs
by: Zhong, Yifan, et al.
Published: (2024)
by: Zhong, Yifan, et al.
Published: (2024)
Similar Items
-
Emergence and Localisation of Semantic Role Circuits in LLMs
by: Aljaafari, Nura, et al.
Published: (2025) -
Interpreting token compositionality in LLMs: A robustness analysis
by: Aljaafari, Nura, et al.
Published: (2024) -
TRACE: Training and Inference-Time Interpretability Analysis for Language Models
by: Aljaafari, Nura, et al.
Published: (2025) -
TRACE for Tracking the Emergence of Semantic Representations in Transformers
by: Aljaafari, Nura, et al.
Published: (2025) -
The Mechanics of Conceptual Interpretation in GPT Models: Interpretative Insights
by: Aljaafari, Nura, et al.
Published: (2024)