Saved in:
| Main Authors: | Lombardo, Gianfranco, Trimigno, Giuseppe, Cagnoni, Stefano |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.09011 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Law of Next-Token Prediction in Large Language Models
by: He, Hangfeng, et al.
Published: (2024)
by: He, Hangfeng, et al.
Published: (2024)
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
by: Shao, Chenze, et al.
Published: (2024)
by: Shao, Chenze, et al.
Published: (2024)
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
by: Gagnon, Leo, et al.
Published: (2025)
by: Gagnon, Leo, et al.
Published: (2025)
Cautious Next Token Prediction
by: Wang, Yizhou, et al.
Published: (2025)
by: Wang, Yizhou, et al.
Published: (2025)
Provable Long-Range Benefits of Next-Token Prediction
by: Cao, Xinyuan, et al.
Published: (2025)
by: Cao, Xinyuan, et al.
Published: (2025)
Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language Models
by: Kim, Minseo, et al.
Published: (2025)
by: Kim, Minseo, et al.
Published: (2025)
Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective
by: Mao, Liyuan, et al.
Published: (2026)
by: Mao, Liyuan, et al.
Published: (2026)
Next-Token Prediction and Regret Minimization
by: Mohri, Mehryar, et al.
Published: (2026)
by: Mohri, Mehryar, et al.
Published: (2026)
CoVeR: Conformal Calibration for Versatile and Reliable Autoregressive Next-Token Prediction
by: Chen, Yuzhu, et al.
Published: (2025)
by: Chen, Yuzhu, et al.
Published: (2025)
Mechanics of Next Token Prediction with Self-Attention
by: Li, Yingcong, et al.
Published: (2024)
by: Li, Yingcong, et al.
Published: (2024)
Reinforcement Learning with Promising Tokens for Large Language Models
by: Pang, Jing-Cheng, et al.
Published: (2026)
by: Pang, Jing-Cheng, et al.
Published: (2026)
GeoToken: Hierarchical Geolocalization of Images via Next Token Prediction
by: Ghasemi, Narges, et al.
Published: (2025)
by: Ghasemi, Narges, et al.
Published: (2025)
Geometric Dynamics of Agentic Loops in Large Language Models
by: Tacheny, Nicolas
Published: (2025)
by: Tacheny, Nicolas
Published: (2025)
A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment
by: Rohekar, Raanan Y., et al.
Published: (2024)
by: Rohekar, Raanan Y., et al.
Published: (2024)
Dynamics of Spontaneous Topic Changes in Next Token Prediction with Self-Attention
by: Jia, Mumin, et al.
Published: (2025)
by: Jia, Mumin, et al.
Published: (2025)
Deception Abilities Emerged in Large Language Models
by: Hagendorff, Thilo
Published: (2023)
by: Hagendorff, Thilo
Published: (2023)
ProToken: Token-Level Attribution for Federated Large Language Models
by: Gill, Waris, et al.
Published: (2026)
by: Gill, Waris, et al.
Published: (2026)
Linear Representations of Political Perspective Emerge in Large Language Models
by: Kim, Junsol, et al.
Published: (2025)
by: Kim, Junsol, et al.
Published: (2025)
Manifold Trajectories in Next-Token Prediction: From Replicator Dynamics to Softmax Equilibrium
by: Lee-Jenkins, Christopher R.
Published: (2025)
by: Lee-Jenkins, Christopher R.
Published: (2025)
Counterfactual Token Generation in Large Language Models
by: Chatzi, Ivi, et al.
Published: (2024)
by: Chatzi, Ivi, et al.
Published: (2024)
Scaling Capability in Token Space: An Analysis of Large Vision Language Model
by: Li, Tenghui, et al.
Published: (2024)
by: Li, Tenghui, et al.
Published: (2024)
Large Language Models for Next Point-of-Interest Recommendation
by: Li, Peibo, et al.
Published: (2024)
by: Li, Peibo, et al.
Published: (2024)
Genomic Next-Token Predictors are In-Context Learners
by: Breslow, Nathan, et al.
Published: (2025)
by: Breslow, Nathan, et al.
Published: (2025)
Understanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction
by: Zhong, Qimin, et al.
Published: (2025)
by: Zhong, Qimin, et al.
Published: (2025)
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models
by: Scholten, Yan, et al.
Published: (2024)
by: Scholten, Yan, et al.
Published: (2024)
High-Resolution Image Synthesis via Next-Token Prediction
by: Chen, Dengsheng, et al.
Published: (2024)
by: Chen, Dengsheng, et al.
Published: (2024)
Evolutionary Computation and Explainable AI: A Roadmap to Understandable Intelligent Systems
by: Zhou, Ryan, et al.
Published: (2024)
by: Zhou, Ryan, et al.
Published: (2024)
Emergent Causal-Geometric Dynamics Across Depth in Large Language Models
by: Haim, Shahar, et al.
Published: (2026)
by: Haim, Shahar, et al.
Published: (2026)
Revisiting Graph-Tokenizing Large Language Models: A Systematic Evaluation of Graph Token Understanding
by: Zhang, Zhongjian, et al.
Published: (2026)
by: Zhang, Zhongjian, et al.
Published: (2026)
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
by: Zhao, Chongyang, et al.
Published: (2026)
by: Zhao, Chongyang, et al.
Published: (2026)
Compositional Steering of Large Language Models with Steering Tokens
by: Radevski, Gorjan, et al.
Published: (2026)
by: Radevski, Gorjan, et al.
Published: (2026)
Alignment and Safety in Large Language Models: Safety Mechanisms, Training Paradigms, and Emerging Challenges
by: Lu, Haoran, et al.
Published: (2025)
by: Lu, Haoran, et al.
Published: (2025)
Probabilistic Token Alignment for Large Language Model Fusion
by: Zeng, Runjia, et al.
Published: (2025)
by: Zeng, Runjia, et al.
Published: (2025)
Token-Efficient Leverage Learning in Large Language Models
by: Zeng, Yuanhao, et al.
Published: (2024)
by: Zeng, Yuanhao, et al.
Published: (2024)
Geometric Analysis of Token Selection in Multi-Head Attention
by: Mudarisov, Timur, et al.
Published: (2026)
by: Mudarisov, Timur, et al.
Published: (2026)
Next Embedding Prediction Makes World Models Stronger
by: Bredis, George, et al.
Published: (2026)
by: Bredis, George, et al.
Published: (2026)
Engagement-Driven Content Generation with Large Language Models
by: Coppolillo, Erica, et al.
Published: (2024)
by: Coppolillo, Erica, et al.
Published: (2024)
Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models
by: Xue, Siqiao, et al.
Published: (2024)
by: Xue, Siqiao, et al.
Published: (2024)
Physics in Next-token Prediction
by: An, Hongjun, et al.
Published: (2024)
by: An, Hongjun, et al.
Published: (2024)
LBLLM: Lightweight Binarization of Large Language Models via Three-Stage Distillation
by: Song, Siqing, et al.
Published: (2026)
by: Song, Siqing, et al.
Published: (2026)
Similar Items
-
A Law of Next-Token Prediction in Large Language Models
by: He, Hangfeng, et al.
Published: (2024) -
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
by: Shao, Chenze, et al.
Published: (2024) -
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
by: Gagnon, Leo, et al.
Published: (2025) -
Cautious Next Token Prediction
by: Wang, Yizhou, et al.
Published: (2025) -
Provable Long-Range Benefits of Next-Token Prediction
by: Cao, Xinyuan, et al.
Published: (2025)