Saved in:
| Main Authors: | Pham, Quoc Tuan, Jafari, Mehdi, Salim, Flora |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.06266 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mechanistic Indicators of Steering Effectiveness in Large Language Models
by: Jafari, Mehdi, et al.
Published: (2026)
by: Jafari, Mehdi, et al.
Published: (2026)
Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction
by: Jafari, Mehdi, et al.
Published: (2025)
by: Jafari, Mehdi, et al.
Published: (2025)
PyRQA -- Conducting Recurrence Quantification Analysis on Very Long Time Series Efficiently
by: Rawald, Tobias, et al.
Published: (2024)
by: Rawald, Tobias, et al.
Published: (2024)
Is my model perplexed for the right reason? Contrasting LLMs' Benchmark Behavior with Token-Level Perplexity
by: Prins, Zoë, et al.
Published: (2026)
by: Prins, Zoë, et al.
Published: (2026)
Interpreting token compositionality in LLMs: A robustness analysis
by: Aljaafari, Nura, et al.
Published: (2024)
by: Aljaafari, Nura, et al.
Published: (2024)
ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations
by: Pham, Quang Hieu, et al.
Published: (2025)
by: Pham, Quang Hieu, et al.
Published: (2025)
What am I missing here?: Evaluating Large Language Models for Masked Sentence Prediction
by: Wyatt, Charlie, et al.
Published: (2025)
by: Wyatt, Charlie, et al.
Published: (2025)
LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems
by: Yu, Xiao, et al.
Published: (2024)
by: Yu, Xiao, et al.
Published: (2024)
Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English
by: Nguyen, Duke, et al.
Published: (2025)
by: Nguyen, Duke, et al.
Published: (2025)
Alternatives To Next Token Prediction In Text Generation -- A Survey
by: Wyatt, Charlie, et al.
Published: (2025)
by: Wyatt, Charlie, et al.
Published: (2025)
Who's Who: Large Language Models Meet Knowledge Conflicts in Practice
by: Pham, Quang Hieu, et al.
Published: (2024)
by: Pham, Quang Hieu, et al.
Published: (2024)
Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning
by: Sclar, Melanie, et al.
Published: (2024)
by: Sclar, Melanie, et al.
Published: (2024)
MAPLE: Mobile App Prediction Leveraging Large Language Model Embeddings
by: Khaokaew, Yonchanok, et al.
Published: (2023)
by: Khaokaew, Yonchanok, et al.
Published: (2023)
Interpretable Next-token Prediction via the Generalized Induction Head
by: Kim, Eunji, et al.
Published: (2024)
by: Kim, Eunji, et al.
Published: (2024)
Contextual morphologically-guided tokenization for Latin encoder models
by: Hudspeth, Marisa, et al.
Published: (2025)
by: Hudspeth, Marisa, et al.
Published: (2025)
Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models
by: Pham, Duy Khoa, et al.
Published: (2024)
by: Pham, Duy Khoa, et al.
Published: (2024)
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)
CP-MoE: Consistency-Preserving Mixture-of-Experts for Continual Learning
by: Liu, Yang, et al.
Published: (2026)
by: Liu, Yang, et al.
Published: (2026)
Mitigating Data Scarcity in Psychological Defense Classification with Context-Aware Synthetic Augmentation
by: Vu, Hoang-Thuy-Duong, et al.
Published: (2026)
by: Vu, Hoang-Thuy-Duong, et al.
Published: (2026)
ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents
by: Li, Zechen, et al.
Published: (2025)
by: Li, Zechen, et al.
Published: (2025)
Do language models plan ahead for future tokens?
by: Wu, Wilson, et al.
Published: (2024)
by: Wu, Wilson, et al.
Published: (2024)
Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)
by: Rauba, Paulius, et al.
Published: (2025)
Prompt Mining for Language-based Human Mobility Forecasting
by: Xue, Hao, et al.
Published: (2024)
by: Xue, Hao, et al.
Published: (2024)
Collaborative decoding of critical tokens for boosting factuality of large language models
by: Jin, Lifeng, et al.
Published: (2024)
by: Jin, Lifeng, et al.
Published: (2024)
SensorLLM: Aligning Large Language Models with Motion Sensors for Human Activity Recognition
by: Li, Zechen, et al.
Published: (2024)
by: Li, Zechen, et al.
Published: (2024)
AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
by: Chu, The Chuong, et al.
Published: (2025)
by: Chu, The Chuong, et al.
Published: (2025)
Distributional reasoning in LLMs: Parallel reasoning processes in multi-hop reasoning
by: Shalev, Yuval, et al.
Published: (2024)
by: Shalev, Yuval, et al.
Published: (2024)
Leveraging Sentence-oriented Augmentation and Transformer-Based Architecture for Vietnamese-Bahnaric Translation
by: Nguyen, Tan Sang, et al.
Published: (2026)
by: Nguyen, Tan Sang, et al.
Published: (2026)
On the scaling relationship between cloze probabilities and language model next-token prediction
by: Jacobs, Cassandra L., et al.
Published: (2026)
by: Jacobs, Cassandra L., et al.
Published: (2026)
Where is the signal in tokenization space?
by: Geh, Renato Lui, et al.
Published: (2024)
by: Geh, Renato Lui, et al.
Published: (2024)
Automatic Real-word Error Correction in Persian Text
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)
Byte-token Enhanced Language Models for Temporal Point Processes Analysis
by: Kong, Quyu, et al.
Published: (2025)
by: Kong, Quyu, et al.
Published: (2025)
On the token distance modeling ability of higher RoPE attention dimension
by: Hong, Xiangyu, et al.
Published: (2024)
by: Hong, Xiangyu, et al.
Published: (2024)
Why do LLMs attend to the first token?
by: Barbero, Federico, et al.
Published: (2025)
by: Barbero, Federico, et al.
Published: (2025)
Revisiting subword tokenization: A case study on affixal negation in large language models
by: Truong, Thinh Hung, et al.
Published: (2024)
by: Truong, Thinh Hung, et al.
Published: (2024)
Practical token pruning for foundation models in few-shot conversational virtual assistant systems
by: Qi, Haode, et al.
Published: (2024)
by: Qi, Haode, et al.
Published: (2024)
Is continuous CoT better suited for multi-lingual reasoning?
by: Bashir, Ali Hamza, et al.
Published: (2026)
by: Bashir, Ali Hamza, et al.
Published: (2026)
Language models are better than humans at next-token prediction
by: Shlegeris, Buck, et al.
Published: (2022)
by: Shlegeris, Buck, et al.
Published: (2022)
RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA
by: Yang, Ruiyi, et al.
Published: (2025)
by: Yang, Ruiyi, et al.
Published: (2025)
Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent
by: Nusrat, Humza, et al.
Published: (2025)
by: Nusrat, Humza, et al.
Published: (2025)
Similar Items
-
Mechanistic Indicators of Steering Effectiveness in Large Language Models
by: Jafari, Mehdi, et al.
Published: (2026) -
Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction
by: Jafari, Mehdi, et al.
Published: (2025) -
PyRQA -- Conducting Recurrence Quantification Analysis on Very Long Time Series Efficiently
by: Rawald, Tobias, et al.
Published: (2024) -
Is my model perplexed for the right reason? Contrasting LLMs' Benchmark Behavior with Token-Level Perplexity
by: Prins, Zoë, et al.
Published: (2026) -
Interpreting token compositionality in LLMs: A robustness analysis
by: Aljaafari, Nura, et al.
Published: (2024)