:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jiashen, Du, Yao, Jesse, Liu, Allen, Zhang, Zhekai
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.08106
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
by: Wang, Hanrui, et al.
Published: (2020)

LLMs as Agentic Cooperative Players in Multiplayer UNO
by: Matinez, Yago Romano, et al.
Published: (2025)

Are complicated loss functions necessary for teaching LLMs to reason?
by: Carrino, Gabriele, et al.
Published: (2026)

LLMs as mirrors of societal moral standards: reflection of cultural divergence and agreement across ethical topics
by: Meijer, Mijntje, et al.
Published: (2024)

AI Act and Large Language Models (LLMs): When critical issues and privacy impact require human and ethical oversight
by: Fabiano, Nicola
Published: (2024)

From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems
by: Rahman, A M Muntasir, et al.
Published: (2024)

A Benchmark for the Detection of Metalinguistic Disagreements between LLMs and Knowledge Graphs
by: Allen, Bradley P., et al.
Published: (2025)

Investigating the structure of emotions by analyzing similarity and association of emotion words
by: Iwaki, Fumitaka, et al.
Published: (2026)

Cognitive Bias in Decision-Making with LLMs
by: Echterhoff, Jessica, et al.
Published: (2024)

LLM_annotate: A Python package for annotating and analyzing fiction characters
by: Rosenbusch, Hannes
Published: (2025)

Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements
by: Liang, Yiming, et al.
Published: (2025)

Asking LLMs to Verify First is Almost Free Lunch
by: Wu, Shiguang, et al.
Published: (2025)

Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration
by: Pan, Yicheng, et al.
Published: (2025)

POLIS-Bench: Towards Multi-Dimensional Evaluation of LLMs for Bilingual Policy Tasks in Governmental Scenarios
by: Yang, Tingyue, et al.
Published: (2025)

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
by: Lin, Yujun, et al.
Published: (2024)

Mind the Ambiguity: Aleatoric Uncertainty Quantification in LLMs for Safe Medical Question Answering
by: Liu, Yaokun, et al.
Published: (2026)

Are Your LLMs Capable of Stable Reasoning?
by: Liu, Junnan, et al.
Published: (2024)

Enabling and Analyzing How to Efficiently Extract Information from Hybrid Long Documents with LLMs
by: Yue, Chongjian, et al.
Published: (2023)

Sample-Efficient Alignment for LLMs
by: Liu, Zichen, et al.
Published: (2024)

Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset
by: Yue, Chongjian, et al.
Published: (2024)

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science
by: Shu, Fan, et al.
Published: (2026)

Graph-Augmented Relation Extraction Model with LLMs-Generated Support Document
by: Dong, Vicky, et al.
Published: (2024)

Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
by: Xiong, Kai, et al.
Published: (2024)

Understanding the Collapse of LLMs in Model Editing
by: Yang, Wanli, et al.
Published: (2024)

The Consensus Trap: Rescuing Multi-Agent LLMs from Adversarial Majorities via Token-Level Collaboration
by: Liu, Jiayuan, et al.
Published: (2026)

Kwai-STaR: Transform LLMs into State-Transition Reasoners
by: Lu, Xingyu, et al.
Published: (2024)

CHBench: A Cognitive Hierarchy Benchmark for Evaluating Strategic Reasoning Capability of LLMs
by: Liu, Hongtao, et al.
Published: (2025)

The Dual-use Dilemma in LLMs: Do Empowering Ethical Capacities Make a Degraded Utility?
by: Zhang, Yiyi, et al.
Published: (2025)

Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
by: Zhong, Qihuang, et al.
Published: (2024)

Evaluation Ethics of LLMs in Legal Domain
by: Zhang, Ruizhe, et al.
Published: (2024)

Iterative Formalization and Planning in Partially Observable Environments
by: Gong, Liancheng, et al.
Published: (2025)

Probing the Lack of Stable Internal Beliefs in LLMs
by: Luo, Yifan, et al.
Published: (2026)

Meituan Merchant Business Diagnosis via Policy-Guided Dual-Process User Simulation
by: Chen, Ziyang, et al.
Published: (2026)

Towards a Unified View of Large Language Model Post-Training
by: Lv, Xingtai, et al.
Published: (2025)

The Diminishing Returns of Early-Exit Decoding in Modern LLMs
by: Wei, Rui, et al.
Published: (2026)

Poisoned LangChain: Jailbreak LLMs by LangChain
by: Wang, Ziqiu, et al.
Published: (2024)

Towards Compositional Generalization of LLMs via Skill Taxonomy Guided Data Synthesis
by: Wei, Yifan, et al.
Published: (2026)

PAD: Personalized Alignment of LLMs at Decoding-Time
by: Chen, Ruizhe, et al.
Published: (2024)

PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs
by: Liu, An, et al.
Published: (2024)

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
by: Yao, Jihan, et al.
Published: (2024)