Saved in:
| Main Authors: | Jiashen, Du, Yao, Jesse, Liu, Allen, Zhang, Zhekai |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.08106 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
by: Wang, Hanrui, et al.
Published: (2020)
by: Wang, Hanrui, et al.
Published: (2020)
LLMs as Agentic Cooperative Players in Multiplayer UNO
by: Matinez, Yago Romano, et al.
Published: (2025)
by: Matinez, Yago Romano, et al.
Published: (2025)
Are complicated loss functions necessary for teaching LLMs to reason?
by: Carrino, Gabriele, et al.
Published: (2026)
by: Carrino, Gabriele, et al.
Published: (2026)
LLMs as mirrors of societal moral standards: reflection of cultural divergence and agreement across ethical topics
by: Meijer, Mijntje, et al.
Published: (2024)
by: Meijer, Mijntje, et al.
Published: (2024)
AI Act and Large Language Models (LLMs): When critical issues and privacy impact require human and ethical oversight
by: Fabiano, Nicola
Published: (2024)
by: Fabiano, Nicola
Published: (2024)
From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems
by: Rahman, A M Muntasir, et al.
Published: (2024)
by: Rahman, A M Muntasir, et al.
Published: (2024)
A Benchmark for the Detection of Metalinguistic Disagreements between LLMs and Knowledge Graphs
by: Allen, Bradley P., et al.
Published: (2025)
by: Allen, Bradley P., et al.
Published: (2025)
Investigating the structure of emotions by analyzing similarity and association of emotion words
by: Iwaki, Fumitaka, et al.
Published: (2026)
by: Iwaki, Fumitaka, et al.
Published: (2026)
Cognitive Bias in Decision-Making with LLMs
by: Echterhoff, Jessica, et al.
Published: (2024)
by: Echterhoff, Jessica, et al.
Published: (2024)
LLM_annotate: A Python package for annotating and analyzing fiction characters
by: Rosenbusch, Hannes
Published: (2025)
by: Rosenbusch, Hannes
Published: (2025)
Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements
by: Liang, Yiming, et al.
Published: (2025)
by: Liang, Yiming, et al.
Published: (2025)
Asking LLMs to Verify First is Almost Free Lunch
by: Wu, Shiguang, et al.
Published: (2025)
by: Wu, Shiguang, et al.
Published: (2025)
Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration
by: Pan, Yicheng, et al.
Published: (2025)
by: Pan, Yicheng, et al.
Published: (2025)
POLIS-Bench: Towards Multi-Dimensional Evaluation of LLMs for Bilingual Policy Tasks in Governmental Scenarios
by: Yang, Tingyue, et al.
Published: (2025)
by: Yang, Tingyue, et al.
Published: (2025)
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
by: Lin, Yujun, et al.
Published: (2024)
by: Lin, Yujun, et al.
Published: (2024)
Mind the Ambiguity: Aleatoric Uncertainty Quantification in LLMs for Safe Medical Question Answering
by: Liu, Yaokun, et al.
Published: (2026)
by: Liu, Yaokun, et al.
Published: (2026)
Are Your LLMs Capable of Stable Reasoning?
by: Liu, Junnan, et al.
Published: (2024)
by: Liu, Junnan, et al.
Published: (2024)
Enabling and Analyzing How to Efficiently Extract Information from Hybrid Long Documents with LLMs
by: Yue, Chongjian, et al.
Published: (2023)
by: Yue, Chongjian, et al.
Published: (2023)
Sample-Efficient Alignment for LLMs
by: Liu, Zichen, et al.
Published: (2024)
by: Liu, Zichen, et al.
Published: (2024)
Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset
by: Yue, Chongjian, et al.
Published: (2024)
by: Yue, Chongjian, et al.
Published: (2024)
DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science
by: Shu, Fan, et al.
Published: (2026)
by: Shu, Fan, et al.
Published: (2026)
Graph-Augmented Relation Extraction Model with LLMs-Generated Support Document
by: Dong, Vicky, et al.
Published: (2024)
by: Dong, Vicky, et al.
Published: (2024)
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
by: Xiong, Kai, et al.
Published: (2024)
by: Xiong, Kai, et al.
Published: (2024)
Understanding the Collapse of LLMs in Model Editing
by: Yang, Wanli, et al.
Published: (2024)
by: Yang, Wanli, et al.
Published: (2024)
The Consensus Trap: Rescuing Multi-Agent LLMs from Adversarial Majorities via Token-Level Collaboration
by: Liu, Jiayuan, et al.
Published: (2026)
by: Liu, Jiayuan, et al.
Published: (2026)
Kwai-STaR: Transform LLMs into State-Transition Reasoners
by: Lu, Xingyu, et al.
Published: (2024)
by: Lu, Xingyu, et al.
Published: (2024)
CHBench: A Cognitive Hierarchy Benchmark for Evaluating Strategic Reasoning Capability of LLMs
by: Liu, Hongtao, et al.
Published: (2025)
by: Liu, Hongtao, et al.
Published: (2025)
The Dual-use Dilemma in LLMs: Do Empowering Ethical Capacities Make a Degraded Utility?
by: Zhang, Yiyi, et al.
Published: (2025)
by: Zhang, Yiyi, et al.
Published: (2025)
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
by: Zhong, Qihuang, et al.
Published: (2024)
by: Zhong, Qihuang, et al.
Published: (2024)
Evaluation Ethics of LLMs in Legal Domain
by: Zhang, Ruizhe, et al.
Published: (2024)
by: Zhang, Ruizhe, et al.
Published: (2024)
Iterative Formalization and Planning in Partially Observable Environments
by: Gong, Liancheng, et al.
Published: (2025)
by: Gong, Liancheng, et al.
Published: (2025)
Probing the Lack of Stable Internal Beliefs in LLMs
by: Luo, Yifan, et al.
Published: (2026)
by: Luo, Yifan, et al.
Published: (2026)
Meituan Merchant Business Diagnosis via Policy-Guided Dual-Process User Simulation
by: Chen, Ziyang, et al.
Published: (2026)
by: Chen, Ziyang, et al.
Published: (2026)
Towards a Unified View of Large Language Model Post-Training
by: Lv, Xingtai, et al.
Published: (2025)
by: Lv, Xingtai, et al.
Published: (2025)
The Diminishing Returns of Early-Exit Decoding in Modern LLMs
by: Wei, Rui, et al.
Published: (2026)
by: Wei, Rui, et al.
Published: (2026)
Poisoned LangChain: Jailbreak LLMs by LangChain
by: Wang, Ziqiu, et al.
Published: (2024)
by: Wang, Ziqiu, et al.
Published: (2024)
Towards Compositional Generalization of LLMs via Skill Taxonomy Guided Data Synthesis
by: Wei, Yifan, et al.
Published: (2026)
by: Wei, Yifan, et al.
Published: (2026)
PAD: Personalized Alignment of LLMs at Decoding-Time
by: Chen, Ruizhe, et al.
Published: (2024)
by: Chen, Ruizhe, et al.
Published: (2024)
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs
by: Liu, An, et al.
Published: (2024)
by: Liu, An, et al.
Published: (2024)
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
by: Yao, Jihan, et al.
Published: (2024)
by: Yao, Jihan, et al.
Published: (2024)
Similar Items
-
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
by: Wang, Hanrui, et al.
Published: (2020) -
LLMs as Agentic Cooperative Players in Multiplayer UNO
by: Matinez, Yago Romano, et al.
Published: (2025) -
Are complicated loss functions necessary for teaching LLMs to reason?
by: Carrino, Gabriele, et al.
Published: (2026) -
LLMs as mirrors of societal moral standards: reflection of cultural divergence and agreement across ethical topics
by: Meijer, Mijntje, et al.
Published: (2024) -
AI Act and Large Language Models (LLMs): When critical issues and privacy impact require human and ethical oversight
by: Fabiano, Nicola
Published: (2024)