Saved in:
| Main Authors: | Tao, Shuchang, Yao, Liuyi, Ding, Hanxing, Xie, Yuexiang, Cao, Qi, Sun, Fei, Gao, Jinyang, Shen, Huawei, Ding, Bolin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.17287 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
by: Ding, Hanxing, et al.
Published: (2025)
by: Ding, Hanxing, et al.
Published: (2025)
On the Diminishing Returns of Complex Robust RAG Training in the Era of Powerful LLMs
by: Ding, Hanxing, et al.
Published: (2025)
by: Ding, Hanxing, et al.
Published: (2025)
Rowen: Adaptive Retrieval-Augmented Generation for Hallucination Mitigation in LLMs
by: Ding, Hanxing, et al.
Published: (2024)
by: Ding, Hanxing, et al.
Published: (2024)
Enhancing Tool Learning in Large Language Models with Hierarchical Error Checklists
by: Cui, Yue, et al.
Published: (2025)
by: Cui, Yue, et al.
Published: (2025)
Incentivizing Strong Reasoning from Weak Supervision
by: Yuan, Yige, et al.
Published: (2025)
by: Yuan, Yige, et al.
Published: (2025)
MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models
by: Wei, Zihao, et al.
Published: (2024)
by: Wei, Zihao, et al.
Published: (2024)
Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models
by: Deng, Jingcheng, et al.
Published: (2024)
by: Deng, Jingcheng, et al.
Published: (2024)
Stable Knowledge Editing in Large Language Models
by: Wei, Zihao, et al.
Published: (2024)
by: Wei, Zihao, et al.
Published: (2024)
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection
by: Xiaohu, Xie, et al.
Published: (2026)
by: Xiaohu, Xie, et al.
Published: (2026)
Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents
by: Yu, Yi, et al.
Published: (2026)
by: Yu, Yi, et al.
Published: (2026)
Supervised Optimism Correction: Be Confident When LLMs Are Sure
by: Zhang, Junjie, et al.
Published: (2025)
by: Zhang, Junjie, et al.
Published: (2025)
Accelerating the Surrogate Retraining for Poisoning Attacks against Recommender Systems
by: Wu, Yunfan, et al.
Published: (2024)
by: Wu, Yunfan, et al.
Published: (2024)
Towards Anthropomorphic Conversational AI Part I: A Practical Framework
by: Wei, Fei, et al.
Published: (2025)
by: Wei, Fei, et al.
Published: (2025)
Fact-Level Confidence Calibration and Self-Correction
by: Yuan, Yige, et al.
Published: (2024)
by: Yuan, Yige, et al.
Published: (2024)
Exploring Selective Layer Fine-Tuning in Federated Learning
by: Sun, Yuchang, et al.
Published: (2024)
by: Sun, Yuchang, et al.
Published: (2024)
$β$-DPO: Direct Preference Optimization with Dynamic $β$
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Accurate Table Question Answering with Accessible LLMs
by: Jiang, Yangfan, et al.
Published: (2026)
by: Jiang, Yangfan, et al.
Published: (2026)
Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing
by: Wu, Yuchen, et al.
Published: (2025)
by: Wu, Yuchen, et al.
Published: (2025)
Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data
by: Ling, Zhenqing, et al.
Published: (2025)
by: Ling, Zhenqing, et al.
Published: (2025)
Robust Recommender System: A Survey and Future Directions
by: Zhang, Kaike, et al.
Published: (2023)
by: Zhang, Kaike, et al.
Published: (2023)
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
by: Yao, Jihan, et al.
Published: (2024)
by: Yao, Jihan, et al.
Published: (2024)
What is Wrong with Perplexity for Long-context Language Modeling?
by: Fang, Lizhe, et al.
Published: (2024)
by: Fang, Lizhe, et al.
Published: (2024)
Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs
by: Tan, Hexiang, et al.
Published: (2025)
by: Tan, Hexiang, et al.
Published: (2025)
Understanding the Collapse of LLMs in Model Editing
by: Yang, Wanli, et al.
Published: (2024)
by: Yang, Wanli, et al.
Published: (2024)
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Inference-time Alignment in Continuous Space
by: Yuan, Yige, et al.
Published: (2025)
by: Yuan, Yige, et al.
Published: (2025)
CARD: Channel Aligned Robust Blend Transformer for Time Series Forecasting
by: Xue, Wang, et al.
Published: (2023)
by: Xue, Wang, et al.
Published: (2023)
The Mirage of Model Editing: Revisiting Evaluation in the Wild
by: Yang, Wanli, et al.
Published: (2025)
by: Yang, Wanli, et al.
Published: (2025)
LLMs Reading the Rhythms of Daily Life: Aligned Understanding for Behavior Prediction and Generation
by: Meng, Fanjin, et al.
Published: (2026)
by: Meng, Fanjin, et al.
Published: (2026)
When Can We Trust LLM Graders? Calibrating Confidence for Automated Assessment
by: Ferrer, Robinson, et al.
Published: (2026)
by: Ferrer, Robinson, et al.
Published: (2026)
Tree-based Models for Vertical Federated Learning: A Survey
by: Qian, Bingchen, et al.
Published: (2025)
by: Qian, Bingchen, et al.
Published: (2025)
Talk to Right Specialists: Iterative Routing in Multi-agent Systems for Question Answering
by: Wu, Feijie, et al.
Published: (2025)
by: Wu, Feijie, et al.
Published: (2025)
Enhancing Latent Computation in Transformers with Latent Tokens
by: Sun, Yuchang, et al.
Published: (2025)
by: Sun, Yuchang, et al.
Published: (2025)
On Verbalized Confidence Scores for LLMs
by: Yang, Daniel, et al.
Published: (2024)
by: Yang, Daniel, et al.
Published: (2024)
R$^2$PO: Decoupling Training Trajectories from Inference Responses for LLM Reasoning
by: Wang, Jingchu, et al.
Published: (2026)
by: Wang, Jingchu, et al.
Published: (2026)
Lowest Span Confidence: A Zero-Shot Metric for Efficient and Black-Box Hallucination Detection in LLMs
by: Qiao, Yitong, et al.
Published: (2026)
by: Qiao, Yitong, et al.
Published: (2026)
Larger or Smaller Reward Margins to Select Preferences for Alignment?
by: Huang, Kexin, et al.
Published: (2025)
by: Huang, Kexin, et al.
Published: (2025)
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL
by: Liu, Yifu, et al.
Published: (2025)
by: Liu, Yifu, et al.
Published: (2025)
DB-LLM: Accurate Dual-Binarization for Efficient LLMs
by: Chen, Hong, et al.
Published: (2024)
by: Chen, Hong, et al.
Published: (2024)
Similar Items
-
ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
by: Ding, Hanxing, et al.
Published: (2025) -
On the Diminishing Returns of Complex Robust RAG Training in the Era of Powerful LLMs
by: Ding, Hanxing, et al.
Published: (2025) -
Rowen: Adaptive Retrieval-Augmented Generation for Hallucination Mitigation in LLMs
by: Ding, Hanxing, et al.
Published: (2024) -
Enhancing Tool Learning in Large Language Models with Hierarchical Error Checklists
by: Cui, Yue, et al.
Published: (2025) -
Incentivizing Strong Reasoning from Weak Supervision
by: Yuan, Yige, et al.
Published: (2025)