Saved in:
| Main Authors: | Ying, Jiahao, Cao, Yixin, Xiong, Kai, He, Yidong, Cui, Long, Liu, Yongbin |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.17415 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
by: Xiong, Kai, et al.
Published: (2024)
by: Xiong, Kai, et al.
Published: (2024)
Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications
by: Long, Cui, et al.
Published: (2024)
by: Long, Cui, et al.
Published: (2024)
Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric
by: Cao, Yixin, et al.
Published: (2025)
by: Cao, Yixin, et al.
Published: (2025)
A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential
by: Tang, Wei, et al.
Published: (2024)
by: Tang, Wei, et al.
Published: (2024)
Beyond Semantic Similarity: Reducing Unnecessary API Calls via Behavior-Aligned Retriever
by: Chen, Yixin, et al.
Published: (2025)
by: Chen, Yixin, et al.
Published: (2025)
Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms
by: Ying, Jiahao, et al.
Published: (2025)
by: Ying, Jiahao, et al.
Published: (2025)
LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
by: Ying, Jiahao, et al.
Published: (2024)
by: Ying, Jiahao, et al.
Published: (2024)
EffiEval: Efficient and Generalizable Model Evaluation via Capability Coverage Maximization
by: Wang, Yaoning, et al.
Published: (2025)
by: Wang, Yaoning, et al.
Published: (2025)
Towards LLMs Robustness to Changes in Prompt Format Styles
by: Ngweta, Lilian, et al.
Published: (2025)
by: Ngweta, Lilian, et al.
Published: (2025)
Do LLMs Signal When They're Right? Evidence from Neuron Agreement
by: Chen, Kang, et al.
Published: (2025)
by: Chen, Kang, et al.
Published: (2025)
From Latent Signals to Reflection Behavior: Tracing Meta-Cognitive Activation Trajectory in R1-Style LLMs
by: Du, Yanrui, et al.
Published: (2026)
by: Du, Yanrui, et al.
Published: (2026)
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
by: Liu, Yantao, et al.
Published: (2024)
by: Liu, Yantao, et al.
Published: (2024)
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
by: Xiong, Kai, et al.
Published: (2023)
by: Xiong, Kai, et al.
Published: (2023)
Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
by: Pu, Xiao, et al.
Published: (2024)
by: Pu, Xiao, et al.
Published: (2024)
Disentangling Language and Culture for Evaluating Multilingual Large Language Models
by: Ying, Jiahao, et al.
Published: (2025)
by: Ying, Jiahao, et al.
Published: (2025)
QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism
by: Wang, Bo, et al.
Published: (2024)
by: Wang, Bo, et al.
Published: (2024)
EvoWiki: Evaluating LLMs on Evolving Knowledge
by: Tang, Wei, et al.
Published: (2024)
by: Tang, Wei, et al.
Published: (2024)
A Survey of Test-Time Compute: From Intuitive Inference to Deliberate Reasoning
by: Ji, Yixin, et al.
Published: (2025)
by: Ji, Yixin, et al.
Published: (2025)
Long or short CoT? Investigating Instance-level Switch of Large Reasoning Models
by: Zhang, Ruiqi, et al.
Published: (2025)
by: Zhang, Ruiqi, et al.
Published: (2025)
Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding
by: Joo, Seongho, et al.
Published: (2025)
by: Joo, Seongho, et al.
Published: (2025)
White Men Lead, Black Women Help? Benchmarking and Mitigating Language Agency Social Biases in LLMs
by: Wan, Yixin, et al.
Published: (2024)
by: Wan, Yixin, et al.
Published: (2024)
Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs
by: Chen, Kang, et al.
Published: (2025)
by: Chen, Kang, et al.
Published: (2025)
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs
by: Muppidi, Ananth, et al.
Published: (2025)
by: Muppidi, Ananth, et al.
Published: (2025)
Long Context vs. RAG for LLMs: An Evaluation and Revisits
by: Li, Xinze, et al.
Published: (2024)
by: Li, Xinze, et al.
Published: (2024)
LLMs Are Prone to Fallacies in Causal Inference
by: Joshi, Nitish, et al.
Published: (2024)
by: Joshi, Nitish, et al.
Published: (2024)
XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and Reasoning
by: Zhang, Zhihan, et al.
Published: (2025)
by: Zhang, Zhihan, et al.
Published: (2025)
The Order Effect: Investigating Prompt Sensitivity to Input Order in LLMs
by: Guan, Bryan, et al.
Published: (2025)
by: Guan, Bryan, et al.
Published: (2025)
Investigating the Influence of Language on Sycophantic Behavior of Multilingual LLMs
by: Aldahlawi, Bayan Abdullah, et al.
Published: (2026)
by: Aldahlawi, Bayan Abdullah, et al.
Published: (2026)
Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting
by: Nemkova, Apollinaire Poli, et al.
Published: (2025)
by: Nemkova, Apollinaire Poli, et al.
Published: (2025)
SETTP: Style Extraction and Tunable Inference via Dual-level Transferable Prompt Learning
by: Jin, Chunzhen, et al.
Published: (2024)
by: Jin, Chunzhen, et al.
Published: (2024)
Insight Over Sight: Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
by: Liu, Xiaoyuan, et al.
Published: (2024)
by: Liu, Xiaoyuan, et al.
Published: (2024)
Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting
by: Li, Xinzhe, et al.
Published: (2024)
by: Li, Xinzhe, et al.
Published: (2024)
Style-Specific Neurons for Steering LLMs in Text Style Transfer
by: Lai, Wen, et al.
Published: (2024)
by: Lai, Wen, et al.
Published: (2024)
Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs
by: Dong, Jiancheng, et al.
Published: (2025)
by: Dong, Jiancheng, et al.
Published: (2025)
LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as Offline
by: Fu, Biao, et al.
Published: (2025)
by: Fu, Biao, et al.
Published: (2025)
StyleRec: A Benchmark Dataset for Prompt Recovery in Writing Style Transformation
by: Liu, Shenyang, et al.
Published: (2025)
by: Liu, Shenyang, et al.
Published: (2025)
Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation
by: He, Yanjie
Published: (2026)
by: He, Yanjie
Published: (2026)
Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs
by: Wang, Jiawen, et al.
Published: (2025)
by: Wang, Jiawen, et al.
Published: (2025)
Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
by: Li, Xinze, et al.
Published: (2024)
by: Li, Xinze, et al.
Published: (2024)
Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance
by: Xiong, Kai, et al.
Published: (2024)
by: Xiong, Kai, et al.
Published: (2024)
Similar Items
-
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
by: Xiong, Kai, et al.
Published: (2024) -
Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications
by: Long, Cui, et al.
Published: (2024) -
Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric
by: Cao, Yixin, et al.
Published: (2025) -
A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential
by: Tang, Wei, et al.
Published: (2024) -
Beyond Semantic Similarity: Reducing Unnecessary API Calls via Behavior-Aligned Retriever
by: Chen, Yixin, et al.
Published: (2025)