Saved in:
| Main Authors: | Zhao, Jiahao, Dong, Liwei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.08243 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Medal Matters: Probing LLMs' Failure Cases Through Olympic Rankings
by: Choi, Juhwan, et al.
Published: (2024)
by: Choi, Juhwan, et al.
Published: (2024)
Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
by: Liu, Jiahao, et al.
Published: (2025)
by: Liu, Jiahao, et al.
Published: (2025)
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
by: Ma, Xuezhe, et al.
Published: (2024)
by: Ma, Xuezhe, et al.
Published: (2024)
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
by: Qin, Zhen, et al.
Published: (2024)
by: Qin, Zhen, et al.
Published: (2024)
Probing Multimodal Large Language Models for Global and Local Semantic Representations
by: Tao, Mingxu, et al.
Published: (2024)
by: Tao, Mingxu, et al.
Published: (2024)
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
by: Chen, Angelica, et al.
Published: (2023)
by: Chen, Angelica, et al.
Published: (2023)
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
by: Ren, Liliang, et al.
Published: (2024)
by: Ren, Liliang, et al.
Published: (2024)
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities
by: Saporta, Adriel, et al.
Published: (2024)
by: Saporta, Adriel, et al.
Published: (2024)
What are They Thinking? Delineation, Probing and Tracking of Concepts in LLMs
by: Abdelwahab, Mohamed, et al.
Published: (2026)
by: Abdelwahab, Mohamed, et al.
Published: (2026)
LECTOR: LLM-Enhanced Concept-based Test-Oriented Repetition for Adaptive Spaced Learning
by: Zhao, Jiahao
Published: (2025)
by: Zhao, Jiahao
Published: (2025)
RAG-E: Quantifying Retriever-Generator Alignment and Failure Modes
by: Randl, Korbinian, et al.
Published: (2026)
by: Randl, Korbinian, et al.
Published: (2026)
REAL: Response Embedding-based Alignment for LLMs
by: Zhang, Honggen, et al.
Published: (2024)
by: Zhang, Honggen, et al.
Published: (2024)
Teaching LLMs to Refine with Tools
by: Yu, Dian, et al.
Published: (2024)
by: Yu, Dian, et al.
Published: (2024)
Learning from Failures: Understanding LLM Alignment through Failure-Aware Inverse RL
by: Patel, Nyal, et al.
Published: (2025)
by: Patel, Nyal, et al.
Published: (2025)
ProbeLLM: Automating Principled Diagnosis of LLM Failures
by: Huang, Yue, et al.
Published: (2026)
by: Huang, Yue, et al.
Published: (2026)
Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
by: Li, Qintong, et al.
Published: (2024)
by: Li, Qintong, et al.
Published: (2024)
Cultural Palette: Pluralising Culture Alignment via Multi-agent Palette
by: Yuan, Jiahao, et al.
Published: (2024)
by: Yuan, Jiahao, et al.
Published: (2024)
Alignment at Pre-training! Towards Native Alignment for Arabic LLMs
by: Liang, Juhao, et al.
Published: (2024)
by: Liang, Juhao, et al.
Published: (2024)
PACIFIC: Can LLMs Discern the Traits Influencing Your Preferences? Evaluating Personality-Driven Preference Alignment in LLMs
by: Zhao, Tianyu, et al.
Published: (2026)
by: Zhao, Tianyu, et al.
Published: (2026)
HAL: Inducing Human-likeness in LLMs with Alignment
by: Hasan, Masum, et al.
Published: (2026)
by: Hasan, Masum, et al.
Published: (2026)
PluralLLM: Pluralistic Alignment in LLMs via Federated Learning
by: Srewa, Mahmoud, et al.
Published: (2025)
by: Srewa, Mahmoud, et al.
Published: (2025)
Evaluating Alignment of Behavioral Dispositions in LLMs
by: Taubenfeld, Amir, et al.
Published: (2026)
by: Taubenfeld, Amir, et al.
Published: (2026)
Concept Space Alignment in Multilingual LLMs
by: Peng, Qiwei, et al.
Published: (2024)
by: Peng, Qiwei, et al.
Published: (2024)
Following the Autoregressive Nature of LLM Embeddings via Compression and Alignment
by: Deng, Jingcheng, et al.
Published: (2025)
by: Deng, Jingcheng, et al.
Published: (2025)
Failure Modes of LLMs for Causal Reasoning on Narratives
by: Yamin, Khurram, et al.
Published: (2024)
by: Yamin, Khurram, et al.
Published: (2024)
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life
by: Chiu, Yu Ying, et al.
Published: (2024)
by: Chiu, Yu Ying, et al.
Published: (2024)
Survey-to-Behavior: Downstream Alignment of Human Values in LLMs via Survey Questions
by: Nie, Shangrui, et al.
Published: (2025)
by: Nie, Shangrui, et al.
Published: (2025)
Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning
by: Choenni, Rochelle, et al.
Published: (2024)
by: Choenni, Rochelle, et al.
Published: (2024)
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
by: Zhong, Yiwu, et al.
Published: (2024)
by: Zhong, Yiwu, et al.
Published: (2024)
Improving the Distributional Alignment of LLMs using Supervision
by: Kambhatla, Gauri, et al.
Published: (2025)
by: Kambhatla, Gauri, et al.
Published: (2025)
Breaking Thought Patterns: A Multi-Dimensional Reasoning Framework for LLMs
by: Tang, Xintong, et al.
Published: (2025)
by: Tang, Xintong, et al.
Published: (2025)
Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer
by: Yeom, Jewon, et al.
Published: (2026)
by: Yeom, Jewon, et al.
Published: (2026)
Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing
by: Zhao, Raoyuan, et al.
Published: (2025)
by: Zhao, Raoyuan, et al.
Published: (2025)
How Order-Sensitive Are LLMs? OrderProbe for Deterministic Structural Reconstruction
by: He, Yingjie, et al.
Published: (2026)
by: He, Yingjie, et al.
Published: (2026)
CoT Vectors: Transferring and Probing the Reasoning Mechanisms of LLMs
by: Li, Li, et al.
Published: (2025)
by: Li, Li, et al.
Published: (2025)
LLM Probe: Evaluating LLMs for Low-Resource Languages
by: Teklehaymanot, Hailay Kidu, et al.
Published: (2026)
by: Teklehaymanot, Hailay Kidu, et al.
Published: (2026)
A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs
by: Srewa, Mahmoud, et al.
Published: (2025)
by: Srewa, Mahmoud, et al.
Published: (2025)
Probing the Limits of Stylistic Alignment in Vision-Language Models
by: Farajidizaji, Asma, et al.
Published: (2025)
by: Farajidizaji, Asma, et al.
Published: (2025)
Alignment is Localized: A Causal Probe into Preference Layers
by: Chaudhury, Archie
Published: (2025)
by: Chaudhury, Archie
Published: (2025)
Can LLMs Express Personality Across Cultures? Introducing CulturalPersonas for Evaluating Trait Alignment
by: Dey, Priyanka, et al.
Published: (2025)
by: Dey, Priyanka, et al.
Published: (2025)
Similar Items
-
Medal Matters: Probing LLMs' Failure Cases Through Olympic Rankings
by: Choi, Juhwan, et al.
Published: (2024) -
Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
by: Liu, Jiahao, et al.
Published: (2025) -
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
by: Ma, Xuezhe, et al.
Published: (2024) -
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
by: Qin, Zhen, et al.
Published: (2024) -
Probing Multimodal Large Language Models for Global and Local Semantic Representations
by: Tao, Mingxu, et al.
Published: (2024)