:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Adarsh, Shivam, Shridhar, Kumar, Gulcehre, Caglar, Monath, Nicholas, Sachan, Mrinmaya
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.18574
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SMART: Self-learning Meta-strategy Agent for Reasoning Tasks
by: Liu, Rongxing, et al.
Published: (2024)

In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning
by: Terekhov, Mikhail, et al.
Published: (2024)

Promises, Outlooks and Challenges of Diffusion Language Modeling
by: Deschenaux, Justin, et al.
Published: (2024)

Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing
by: Ozyurt, Yilmazcan, et al.
Published: (2025)

The Role of Deep Learning Regularizations on Actors in Offline RL
by: Tarasov, Denis, et al.
Published: (2024)

Probing for Arithmetic Errors in Language Models
by: Sun, Yucheng, et al.
Published: (2025)

Distilling LLMs' Decomposition Abilities into Compact Language Models
by: Tarasov, Denis, et al.
Published: (2024)

Self-rewarding correction for mathematical reasoning
by: Xiong, Wei, et al.
Published: (2025)

Uncovering Hidden Correctness in LLM Causal Reasoning via Symbolic Verification
by: He, Paul, et al.
Published: (2026)

Simulating Students or Sycophantic Problem Solving? On Misconception Faithfulness of LLM Simulators
by: Do, Heejin, et al.
Published: (2026)

Variational Classification
by: Dhuliawala, Shehzaad, et al.
Published: (2023)

How to Engage Your Readers? Generating Guiding Questions to Promote Active Reading
by: Cui, Peng, et al.
Published: (2024)

Self-Recognition in Language Models
by: Davidson, Tim R., et al.
Published: (2024)

Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
by: Deschenaux, Justin, et al.
Published: (2024)

PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
by: Chen, Chang, et al.
Published: (2024)

Improving Large Language Model Safety with Contrastive Representation Learning
by: Simko, Samuel, et al.
Published: (2025)

Simple Hierarchical Planning with Diffusion
by: Chen, Chang, et al.
Published: (2024)

Control Tax: The Price of Keeping AI in Check
by: Terekhov, Mikhail, et al.
Published: (2025)

Fluid Representations in Reasoning Models
by: Kharlapenko, Dmitrii, et al.
Published: (2026)

Towards Aligning Language Models with Textual Feedback
by: Lloret, Saüc Abadal, et al.
Published: (2024)

Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning
by: Wang, Yucheng, et al.
Published: (2025)

Generating Pedagogically Meaningful Visuals for Math Word Problems: A New Benchmark and Analysis of Text-to-Image Models
by: Wang, Junling, et al.
Published: (2025)

Efficient Knowledge Distillation via Curriculum Extraction
by: Gupta, Shivam, et al.
Published: (2025)

Can Vision-Language Models Solve Visual Math Equations?
by: Choudhury, Monjoy Narayan, et al.
Published: (2025)

Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors
by: Daheim, Nico, et al.
Published: (2024)

Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization
by: Rashiti, Gentiana, et al.
Published: (2024)

MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs
by: Opedal, Andreas, et al.
Published: (2024)

Multilingual Performance Biases of Large Language Models in Education
by: Gupta, Vansh, et al.
Published: (2025)

Tackling the Root of Misinformation by Teaching Laypeople about Logical Fallacies via Socratic Questioning and Critical Argumentation
by: Shi, Minjing, et al.
Published: (2026)

PRISM: Efficient Long-Range Reasoning With Short-Context LLMs
by: Jayalath, Dulhan, et al.
Published: (2024)

How Context Shapes Truth: Geometric Transformations of Statement-level Truth Representations in LLMs
by: Adarsh, Shivam, et al.
Published: (2026)

AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators
by: Ni, Jingwei, et al.
Published: (2024)

Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding
by: Chi, Ziheng, et al.
Published: (2025)

On the Emergence of Induction Heads for In-Context Learning
by: Musat, Tiberiu, et al.
Published: (2025)

Post-Training Language Models for Crosslingual Consistency
by: Liu, Tianyu, et al.
Published: (2026)

Self Distillation via Iterative Constructive Perturbations
by: Dave, Maheak, et al.
Published: (2025)

From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning
by: Dinucu-Jianu, David, et al.
Published: (2025)

Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games
by: Piedrahita, David Guzman, et al.
Published: (2025)

Pointwise Mutual Information as a Performance Gauge for Retrieval-Augmented Generation
by: Liu, Tianyu, et al.
Published: (2024)

DIRAS: Efficient LLM Annotation of Document Relevance in Retrieval Augmented Generation
by: Ni, Jingwei, et al.
Published: (2024)