Saved in:
| Main Authors: | Soru, Tommaso, Marshall, Jim |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.04880 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Large Language Models Are Not Strong Abstract Reasoners
by: Gendron, Gaël, et al.
Published: (2023)
by: Gendron, Gaël, et al.
Published: (2023)
DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models
by: Zhang, Zijian, et al.
Published: (2024)
by: Zhang, Zijian, et al.
Published: (2024)
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
by: Abramov, Roman, et al.
Published: (2025)
by: Abramov, Roman, et al.
Published: (2025)
Can Large Language Models Learn Independent Causal Mechanisms?
by: Gendron, Gaël, et al.
Published: (2024)
by: Gendron, Gaël, et al.
Published: (2024)
NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models
by: Putra, Rizky Ramadhana, et al.
Published: (2026)
by: Putra, Rizky Ramadhana, et al.
Published: (2026)
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
by: Wei, Kaiwen, et al.
Published: (2025)
by: Wei, Kaiwen, et al.
Published: (2025)
Training Language Models to Use Prolog as a Tool
by: Mellgren, Niklas, et al.
Published: (2025)
by: Mellgren, Niklas, et al.
Published: (2025)
Gyan: An Explainable Neuro-Symbolic Language Model
by: Srinivasan, Venkat, et al.
Published: (2026)
by: Srinivasan, Venkat, et al.
Published: (2026)
By Their Fruits You Will Know Them: Comparing Formalizations of Law by the Decisions They Encode
by: Vernie, Julius, et al.
Published: (2026)
by: Vernie, Julius, et al.
Published: (2026)
1GC-7RC: One Graphic Card -- Seven Research Challenges! How Good Are AI Agents at Doing Your Job?
by: Kampa, Robin-Nico, et al.
Published: (2026)
by: Kampa, Robin-Nico, et al.
Published: (2026)
Robust Uncertainty Quantification for Factual Generation of Large Language Models
by: Zhang, Yuhao, et al.
Published: (2026)
by: Zhang, Yuhao, et al.
Published: (2026)
SQL Query Engine: A Self-Healing LLM Pipeline for Natural Language to PostgreSQL Translation
by: Ijaz, Muhammad Adeel
Published: (2026)
by: Ijaz, Muhammad Adeel
Published: (2026)
Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds
by: Gendron, Gaël, et al.
Published: (2025)
by: Gendron, Gaël, et al.
Published: (2025)
Towards Leveraging Large Language Models for Automated Medical Q&A Evaluation
by: Krolik, Jack, et al.
Published: (2024)
by: Krolik, Jack, et al.
Published: (2024)
Reinforced Language Models for Sequential Decision Making
by: Dilkes, Jim, et al.
Published: (2025)
by: Dilkes, Jim, et al.
Published: (2025)
MoDeGPT: Modular Decomposition for Large Language Model Compression
by: Lin, Chi-Heng, et al.
Published: (2024)
by: Lin, Chi-Heng, et al.
Published: (2024)
Understanding Syllogistic Reasoning in LLMs from Formal and Natural Language Perspectives
by: Poddar, Aheli, et al.
Published: (2025)
by: Poddar, Aheli, et al.
Published: (2025)
Advancing Natural Language Formalization to First Order Logic with Fine-tuned LLMs
by: Vossel, Felix, et al.
Published: (2025)
by: Vossel, Felix, et al.
Published: (2025)
Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
by: Walker, Nicholas
Published: (2024)
by: Walker, Nicholas
Published: (2024)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Evolve: A Persistent Knowledge Lifecycle for Small Language Models
by: Hovagimian, Dikran
Published: (2026)
by: Hovagimian, Dikran
Published: (2026)
Advancing Explainability in Neural Machine Translation: Analytical Metrics for Attention and Alignment Consistency
by: Mishra, Anurag
Published: (2024)
by: Mishra, Anurag
Published: (2024)
Enhancing LLM Code Generation Capabilities through Test-Driven Development and Code Interpreter
by: Jalil, Sajed, et al.
Published: (2025)
by: Jalil, Sajed, et al.
Published: (2025)
Multi-Agent Systems Powered by Large Language Models: Applications in Swarm Intelligence
by: Jimenez-Romero, Cristian, et al.
Published: (2025)
by: Jimenez-Romero, Cristian, et al.
Published: (2025)
Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
Engineering A Large Language Model From Scratch
by: Oketunji, Abiodun Finbarrs
Published: (2024)
by: Oketunji, Abiodun Finbarrs
Published: (2024)
Unified Modeling Language Code Generation from Diagram Images Using Multimodal Large Language Models
by: Bates, Averi, et al.
Published: (2025)
by: Bates, Averi, et al.
Published: (2025)
Practical Design and Benchmarking of Generative AI Applications for Surgical Billing and Coding
by: Rollman, John C., et al.
Published: (2025)
by: Rollman, John C., et al.
Published: (2025)
Stratified Hazard Sampling: Minimal-Variance Event Scheduling for CTMC/DTMC Discrete Diffusion and Flow Models
by: Jang, Seunghwan, et al.
Published: (2026)
by: Jang, Seunghwan, et al.
Published: (2026)
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)
by: Oketunji, Abiodun Finbarrs
Published: (2023)
ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
by: Feuer, Benjamin, et al.
Published: (2023)
by: Feuer, Benjamin, et al.
Published: (2023)
Stroke Lesions as a Rosetta Stone for Language Model Interpretability
by: Fridriksson, Julius, et al.
Published: (2026)
by: Fridriksson, Julius, et al.
Published: (2026)
Language Models Are Implicitly Continuous
by: Marro, Samuele, et al.
Published: (2025)
by: Marro, Samuele, et al.
Published: (2025)
Sparsification and Reconstruction from the Perspective of Representation Geometry
by: Sun, Wenjie, et al.
Published: (2025)
by: Sun, Wenjie, et al.
Published: (2025)
Memory Architectures for Multi-Turn Text-to-SQL: A Benchmark and Empirical Study
by: Tummalapenta, Ravi Kumar, et al.
Published: (2026)
by: Tummalapenta, Ravi Kumar, et al.
Published: (2026)
Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports
by: Jabal, Mohamed Sobhi, et al.
Published: (2024)
by: Jabal, Mohamed Sobhi, et al.
Published: (2024)
Combining Language and Topic Models for Hierarchical Text Classification
by: Toit, Jaco du, et al.
Published: (2025)
by: Toit, Jaco du, et al.
Published: (2025)
Context-Aware Clustering using Large Language Models
by: Tipirneni, Sindhu, et al.
Published: (2024)
by: Tipirneni, Sindhu, et al.
Published: (2024)
Memory Bank Compression for Continual Adaptation of Large Language Models
by: Katraouras, Thomas, et al.
Published: (2026)
by: Katraouras, Thomas, et al.
Published: (2026)
Prototype Transformer: Towards Language Model Architectures Interpretable by Design
by: Yordanov, Yordan, et al.
Published: (2026)
by: Yordanov, Yordan, et al.
Published: (2026)
Similar Items
-
Large Language Models Are Not Strong Abstract Reasoners
by: Gendron, Gaël, et al.
Published: (2023) -
DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models
by: Zhang, Zijian, et al.
Published: (2024) -
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
by: Abramov, Roman, et al.
Published: (2025) -
Can Large Language Models Learn Independent Causal Mechanisms?
by: Gendron, Gaël, et al.
Published: (2024) -
NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models
by: Putra, Rizky Ramadhana, et al.
Published: (2026)