:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Jiahao, Dong, Liwei
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2508.08243
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Medal Matters: Probing LLMs' Failure Cases Through Olympic Rankings
by: Choi, Juhwan, et al.
Published: (2024)

Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
by: Liu, Jiahao, et al.
Published: (2025)

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
by: Ma, Xuezhe, et al.
Published: (2024)

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
by: Qin, Zhen, et al.
Published: (2024)

Probing Multimodal Large Language Models for Global and Local Semantic Representations
by: Tao, Mingxu, et al.
Published: (2024)

Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
by: Chen, Angelica, et al.
Published: (2023)

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
by: Ren, Liliang, et al.
Published: (2024)

Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities
by: Saporta, Adriel, et al.
Published: (2024)

What are They Thinking? Delineation, Probing and Tracking of Concepts in LLMs
by: Abdelwahab, Mohamed, et al.
Published: (2026)

LECTOR: LLM-Enhanced Concept-based Test-Oriented Repetition for Adaptive Spaced Learning
by: Zhao, Jiahao
Published: (2025)

RAG-E: Quantifying Retriever-Generator Alignment and Failure Modes
by: Randl, Korbinian, et al.
Published: (2026)

REAL: Response Embedding-based Alignment for LLMs
by: Zhang, Honggen, et al.
Published: (2024)

Teaching LLMs to Refine with Tools
by: Yu, Dian, et al.
Published: (2024)

Learning from Failures: Understanding LLM Alignment through Failure-Aware Inverse RL
by: Patel, Nyal, et al.
Published: (2025)

ProbeLLM: Automating Principled Diagnosis of LLM Failures
by: Huang, Yue, et al.
Published: (2026)

Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
by: Li, Qintong, et al.
Published: (2024)

Cultural Palette: Pluralising Culture Alignment via Multi-agent Palette
by: Yuan, Jiahao, et al.
Published: (2024)

Alignment at Pre-training! Towards Native Alignment for Arabic LLMs
by: Liang, Juhao, et al.
Published: (2024)

PACIFIC: Can LLMs Discern the Traits Influencing Your Preferences? Evaluating Personality-Driven Preference Alignment in LLMs
by: Zhao, Tianyu, et al.
Published: (2026)

HAL: Inducing Human-likeness in LLMs with Alignment
by: Hasan, Masum, et al.
Published: (2026)

PluralLLM: Pluralistic Alignment in LLMs via Federated Learning
by: Srewa, Mahmoud, et al.
Published: (2025)

Evaluating Alignment of Behavioral Dispositions in LLMs
by: Taubenfeld, Amir, et al.
Published: (2026)

Concept Space Alignment in Multilingual LLMs
by: Peng, Qiwei, et al.
Published: (2024)

Following the Autoregressive Nature of LLM Embeddings via Compression and Alignment
by: Deng, Jingcheng, et al.
Published: (2025)

Failure Modes of LLMs for Causal Reasoning on Narratives
by: Yamin, Khurram, et al.
Published: (2024)

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life
by: Chiu, Yu Ying, et al.
Published: (2024)

Survey-to-Behavior: Downstream Alignment of Human Values in LLMs via Survey Questions
by: Nie, Shangrui, et al.
Published: (2025)

Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning
by: Choenni, Rochelle, et al.
Published: (2024)

AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
by: Zhong, Yiwu, et al.
Published: (2024)

Improving the Distributional Alignment of LLMs using Supervision
by: Kambhatla, Gauri, et al.
Published: (2025)

Breaking Thought Patterns: A Multi-Dimensional Reasoning Framework for LLMs
by: Tang, Xintong, et al.
Published: (2025)

Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer
by: Yeom, Jewon, et al.
Published: (2026)

Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing
by: Zhao, Raoyuan, et al.
Published: (2025)

How Order-Sensitive Are LLMs? OrderProbe for Deterministic Structural Reconstruction
by: He, Yingjie, et al.
Published: (2026)

CoT Vectors: Transferring and Probing the Reasoning Mechanisms of LLMs
by: Li, Li, et al.
Published: (2025)

LLM Probe: Evaluating LLMs for Low-Resource Languages
by: Teklehaymanot, Hailay Kidu, et al.
Published: (2026)

A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs
by: Srewa, Mahmoud, et al.
Published: (2025)

Probing the Limits of Stylistic Alignment in Vision-Language Models
by: Farajidizaji, Asma, et al.
Published: (2025)

Alignment is Localized: A Causal Probe into Preference Layers
by: Chaudhury, Archie
Published: (2025)

Can LLMs Express Personality Across Cultures? Introducing CulturalPersonas for Evaluating Trait Alignment
by: Dey, Priyanka, et al.
Published: (2025)