:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ying, Jiahao, Cao, Yixin, Xiong, Kai, He, Yidong, Cui, Long, Liu, Yongbin
Format:	Preprint
Published:	2023
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2309.17415
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
by: Xiong, Kai, et al.
Published: (2024)

Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications
by: Long, Cui, et al.
Published: (2024)

Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric
by: Cao, Yixin, et al.
Published: (2025)

A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential
by: Tang, Wei, et al.
Published: (2024)

Beyond Semantic Similarity: Reducing Unnecessary API Calls via Behavior-Aligned Retriever
by: Chen, Yixin, et al.
Published: (2025)

Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms
by: Ying, Jiahao, et al.
Published: (2025)

LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
by: Ying, Jiahao, et al.
Published: (2024)

EffiEval: Efficient and Generalizable Model Evaluation via Capability Coverage Maximization
by: Wang, Yaoning, et al.
Published: (2025)

Towards LLMs Robustness to Changes in Prompt Format Styles
by: Ngweta, Lilian, et al.
Published: (2025)

Do LLMs Signal When They're Right? Evidence from Neuron Agreement
by: Chen, Kang, et al.
Published: (2025)

From Latent Signals to Reflection Behavior: Tracing Meta-Cognitive Activation Trajectory in R1-Style LLMs
by: Du, Yanrui, et al.
Published: (2026)

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
by: Liu, Yantao, et al.
Published: (2024)

Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
by: Xiong, Kai, et al.
Published: (2023)

Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
by: Pu, Xiao, et al.
Published: (2024)

Disentangling Language and Culture for Evaluating Multilingual Large Language Models
by: Ying, Jiahao, et al.
Published: (2025)

QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism
by: Wang, Bo, et al.
Published: (2024)

EvoWiki: Evaluating LLMs on Evolving Knowledge
by: Tang, Wei, et al.
Published: (2024)

A Survey of Test-Time Compute: From Intuitive Inference to Deliberate Reasoning
by: Ji, Yixin, et al.
Published: (2025)

Long or short CoT? Investigating Instance-level Switch of Large Reasoning Models
by: Zhang, Ruiqi, et al.
Published: (2025)

Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding
by: Joo, Seongho, et al.
Published: (2025)

White Men Lead, Black Women Help? Benchmarking and Mitigating Language Agency Social Biases in LLMs
by: Wan, Yixin, et al.
Published: (2024)

Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs
by: Chen, Kang, et al.
Published: (2025)

Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs
by: Muppidi, Ananth, et al.
Published: (2025)

Long Context vs. RAG for LLMs: An Evaluation and Revisits
by: Li, Xinze, et al.
Published: (2024)

LLMs Are Prone to Fallacies in Causal Inference
by: Joshi, Nitish, et al.
Published: (2024)

XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and Reasoning
by: Zhang, Zhihan, et al.
Published: (2025)

The Order Effect: Investigating Prompt Sensitivity to Input Order in LLMs
by: Guan, Bryan, et al.
Published: (2025)

Investigating the Influence of Language on Sycophantic Behavior of Multilingual LLMs
by: Aldahlawi, Bayan Abdullah, et al.
Published: (2026)

Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting
by: Nemkova, Apollinaire Poli, et al.
Published: (2025)

SETTP: Style Extraction and Tunable Inference via Dual-level Transferable Prompt Learning
by: Jin, Chunzhen, et al.
Published: (2024)

Insight Over Sight: Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
by: Liu, Xiaoyuan, et al.
Published: (2024)

Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting
by: Li, Xinzhe, et al.
Published: (2024)

Style-Specific Neurons for Steering LLMs in Text Style Transfer
by: Lai, Wen, et al.
Published: (2024)

Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs
by: Dong, Jiancheng, et al.
Published: (2025)

LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as Offline
by: Fu, Biao, et al.
Published: (2025)

StyleRec: A Benchmark Dataset for Prompt Recovery in Writing Style Transformation
by: Liu, Shenyang, et al.
Published: (2025)

Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation
by: He, Yanjie
Published: (2026)

Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs
by: Wang, Jiawen, et al.
Published: (2025)

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
by: Li, Xinze, et al.
Published: (2024)

Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance
by: Xiong, Kai, et al.
Published: (2024)