:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Galashov, Alexandre, Jones, Matt, Ke, Rosemary, Cao, Yuan, Nagarajan, Vaishnavh, Mozer, Michael C.
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.13879
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The pitfalls of next-token prediction
by: Bachmann, Gregor, et al.
Published: (2024)

Deep sequence models tend to memorize geometrically; it is unclear why
by: Noroozizadeh, Shahriar, et al.
Published: (2025)

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
by: Nagarajan, Vaishnavh, et al.
Published: (2025)

Think before you speak: Training Language Models With Pause Tokens
by: Goyal, Sachin, et al.
Published: (2023)

Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
by: Gopalakrishnan, Anand, et al.
Published: (2025)

On student-teacher deviations in distillation: does it pay to disobey?
by: Nagarajan, Vaishnavh, et al.
Published: (2023)

Analysis of Optimality of Large Language Models on Planning Problems
by: Bohnet, Bernd, et al.
Published: (2026)

Deep MMD Gradient Flow without adversarial training
by: Galashov, Alexandre, et al.
Published: (2024)

Clip Your Sequences Fairly: Enforcing Length Fairness for Sequence-Level RL
by: Mao, Hanyi, et al.
Published: (2025)

SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
by: Yang, Ruihan, et al.
Published: (2024)

CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute
by: Jin, Chen, et al.
Published: (2026)

Think When You Need: Self-Adaptive Chain-of-Thought Learning
by: Yang, Junjie, et al.
Published: (2025)

Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
by: Chen, Jiangjie, et al.
Published: (2023)

Catching Chameleons: Detecting Evolving Disinformation Generated using Large Language Models
by: Jiang, Bohan, et al.
Published: (2024)

SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence
by: Liu, Zhining, et al.
Published: (2025)

Self-Resource Allocation in Multi-Agent LLM Systems
by: Amayuelas, Alfonso, et al.
Published: (2025)

Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awareness
by: Li, Jian, et al.
Published: (2024)

Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference
by: Feng, Yuan, et al.
Published: (2024)

Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration
by: Zhong, Linhao, et al.
Published: (2026)

Can AI Be as Creative as Humans?
by: Wang, Haonan, et al.
Published: (2024)

Attention Sinks: A 'Catch, Tag, Release' Mechanism for Embeddings
by: Zhang, Stephen, et al.
Published: (2025)

Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
by: Belyi, Masha, et al.
Published: (2024)

Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
by: Wang, Xinglin, et al.
Published: (2024)

Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs
by: Guo, Siyuan, et al.
Published: (2024)

SelECT-SQL: Self-correcting ensemble Chain-of-Thought for Text-to-SQL
by: Shen, Ke, et al.
Published: (2024)

Reflection Pretraining Enables Token-Level Self-Correction in Biological Sequence Models
by: Zhang, Xiang, et al.
Published: (2025)

Fast-weight Product Key Memory
by: Zhao, Tianyu, et al.
Published: (2026)

Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
by: Li, Jiachun, et al.
Published: (2024)

Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training
by: Yang, Yanlai, et al.
Published: (2024)

Catch Me If You Can? Not Yet: LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday Authors
by: Wang, Zhengxiang, et al.
Published: (2025)

Adaptive Rectification Sampling for Test-Time Compute Scaling
by: Tan, Zhendong, et al.
Published: (2025)

Shorten After You're Right: Lazy Length Penalties for Reasoning RL
by: Yuan, Danlong, et al.
Published: (2025)

TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house Criteria by Criteria Division and Zero-shot Plus Few-shot
by: Zhang, Kaiqi, et al.
Published: (2024)

Reflective Agreement: Combining Self-Mixture of Agents with a Sequence Tagger for Robust Event Extraction
by: Haji, Fatemeh, et al.
Published: (2025)

Adaptive Graph Refinement and Label Propagation with LLMs for Cost-Effective Entity Resolution
by: Wang, Hongtao, et al.
Published: (2026)

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
by: Shao, Shuai, et al.
Published: (2025)

DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines
by: Singhvi, Arnav, et al.
Published: (2023)

How Persuasive is Your Context?
by: Nguyen, Tu, et al.
Published: (2025)

Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs
by: Chang, Yuan, et al.
Published: (2026)

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning
by: Li, Zheng, et al.
Published: (2025)