:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pham, Quoc Tuan, Jafari, Mehdi, Salim, Flora
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.06266
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Mechanistic Indicators of Steering Effectiveness in Large Language Models
by: Jafari, Mehdi, et al.
Published: (2026)

Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction
by: Jafari, Mehdi, et al.
Published: (2025)

PyRQA -- Conducting Recurrence Quantification Analysis on Very Long Time Series Efficiently
by: Rawald, Tobias, et al.
Published: (2024)

Is my model perplexed for the right reason? Contrasting LLMs' Benchmark Behavior with Token-Level Perplexity
by: Prins, Zoë, et al.
Published: (2026)

Interpreting token compositionality in LLMs: A robustness analysis
by: Aljaafari, Nura, et al.
Published: (2024)

ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations
by: Pham, Quang Hieu, et al.
Published: (2025)

What am I missing here?: Evaluating Large Language Models for Masked Sentence Prediction
by: Wyatt, Charlie, et al.
Published: (2025)

LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems
by: Yu, Xiao, et al.
Published: (2024)

Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English
by: Nguyen, Duke, et al.
Published: (2025)

Alternatives To Next Token Prediction In Text Generation -- A Survey
by: Wyatt, Charlie, et al.
Published: (2025)

Who's Who: Large Language Models Meet Knowledge Conflicts in Practice
by: Pham, Quang Hieu, et al.
Published: (2024)

Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning
by: Sclar, Melanie, et al.
Published: (2024)

MAPLE: Mobile App Prediction Leveraging Large Language Model Embeddings
by: Khaokaew, Yonchanok, et al.
Published: (2023)

Interpretable Next-token Prediction via the Generalized Induction Head
by: Kim, Eunji, et al.
Published: (2024)

Contextual morphologically-guided tokenization for Latin encoder models
by: Hudspeth, Marisa, et al.
Published: (2025)

Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models
by: Pham, Duy Khoa, et al.
Published: (2024)

PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)

CP-MoE: Consistency-Preserving Mixture-of-Experts for Continual Learning
by: Liu, Yang, et al.
Published: (2026)

Mitigating Data Scarcity in Psychological Defense Classification with Context-Aware Synthetic Augmentation
by: Vu, Hoang-Thuy-Duong, et al.
Published: (2026)

ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents
by: Li, Zechen, et al.
Published: (2025)

Do language models plan ahead for future tokens?
by: Wu, Wilson, et al.
Published: (2024)

Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)

Prompt Mining for Language-based Human Mobility Forecasting
by: Xue, Hao, et al.
Published: (2024)

Collaborative decoding of critical tokens for boosting factuality of large language models
by: Jin, Lifeng, et al.
Published: (2024)

SensorLLM: Aligning Large Language Models with Motion Sensors for Human Activity Recognition
by: Li, Zechen, et al.
Published: (2024)

AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
by: Chu, The Chuong, et al.
Published: (2025)

Distributional reasoning in LLMs: Parallel reasoning processes in multi-hop reasoning
by: Shalev, Yuval, et al.
Published: (2024)

Leveraging Sentence-oriented Augmentation and Transformer-Based Architecture for Vietnamese-Bahnaric Translation
by: Nguyen, Tan Sang, et al.
Published: (2026)

On the scaling relationship between cloze probabilities and language model next-token prediction
by: Jacobs, Cassandra L., et al.
Published: (2026)

Where is the signal in tokenization space?
by: Geh, Renato Lui, et al.
Published: (2024)

Automatic Real-word Error Correction in Persian Text
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)

Byte-token Enhanced Language Models for Temporal Point Processes Analysis
by: Kong, Quyu, et al.
Published: (2025)

On the token distance modeling ability of higher RoPE attention dimension
by: Hong, Xiangyu, et al.
Published: (2024)

Why do LLMs attend to the first token?
by: Barbero, Federico, et al.
Published: (2025)

Revisiting subword tokenization: A case study on affixal negation in large language models
by: Truong, Thinh Hung, et al.
Published: (2024)

Practical token pruning for foundation models in few-shot conversational virtual assistant systems
by: Qi, Haode, et al.
Published: (2024)

Is continuous CoT better suited for multi-lingual reasoning?
by: Bashir, Ali Hamza, et al.
Published: (2026)

Language models are better than humans at next-token prediction
by: Shlegeris, Buck, et al.
Published: (2022)

RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA
by: Yang, Ruiyi, et al.
Published: (2025)

Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent
by: Nusrat, Humza, et al.
Published: (2025)