Saved in:
| Main Authors: | Wu, Shujin, Qian, Cheng, Fung, Yi R., Liang, Paul Pu, Ji, Heng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.07316 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Aligning LLMs with Individual Preferences via Interaction
by: Wu, Shujin, et al.
Published: (2024)
by: Wu, Shujin, et al.
Published: (2024)
MACAROON: Training Vision-Language Models To Be Your Engaged Partners
by: Wu, Shujin, et al.
Published: (2024)
by: Wu, Shujin, et al.
Published: (2024)
Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
by: He, Qi, et al.
Published: (2025)
by: He, Qi, et al.
Published: (2025)
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration
by: He, Zhitao, et al.
Published: (2025)
by: He, Zhitao, et al.
Published: (2025)
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
by: Qian, Cheng, et al.
Published: (2023)
by: Qian, Cheng, et al.
Published: (2023)
Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher
by: Uzunoglu, Arda, et al.
Published: (2026)
by: Uzunoglu, Arda, et al.
Published: (2026)
VLM2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
by: Zhang, Jianshu, et al.
Published: (2025)
by: Zhang, Jianshu, et al.
Published: (2025)
LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation
by: Xuan, Keyang, et al.
Published: (2024)
by: Xuan, Keyang, et al.
Published: (2024)
Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks
by: He, Jiayi, et al.
Published: (2024)
by: He, Jiayi, et al.
Published: (2024)
Improving Weak-to-Strong Generalization with Reliability-Aware Alignment
by: Guo, Yue, et al.
Published: (2024)
by: Guo, Yue, et al.
Published: (2024)
Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection
by: Jung, Minseok, et al.
Published: (2025)
by: Jung, Minseok, et al.
Published: (2025)
Your Weak LLM is Secretly a Strong Teacher for Alignment
by: Tao, Leitian, et al.
Published: (2024)
by: Tao, Leitian, et al.
Published: (2024)
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
by: Yang, Wenkai, et al.
Published: (2024)
by: Yang, Wenkai, et al.
Published: (2024)
MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models
by: Li, Hengzhi, et al.
Published: (2025)
by: Li, Hengzhi, et al.
Published: (2025)
ADEPT: A DEbiasing PrompT Framework
by: Yang, Ke, et al.
Published: (2022)
by: Yang, Ke, et al.
Published: (2022)
Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking
by: Fung, Yi, et al.
Published: (2024)
by: Fung, Yi, et al.
Published: (2024)
Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning
by: Sang, Jitao, et al.
Published: (2024)
by: Sang, Jitao, et al.
Published: (2024)
Selective Weak-to-Strong Generalization
by: Lang, Hao, et al.
Published: (2025)
by: Lang, Hao, et al.
Published: (2025)
NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly
by: Fung, Yi R., et al.
Published: (2022)
by: Fung, Yi R., et al.
Published: (2022)
Debate Helps Weak-to-Strong Generalization
by: Lang, Hao, et al.
Published: (2025)
by: Lang, Hao, et al.
Published: (2025)
Theoretical Analysis of Weak-to-Strong Generalization
by: Lang, Hunter, et al.
Published: (2024)
by: Lang, Hunter, et al.
Published: (2024)
Demonstration Augmentation for Zero-shot In-context Learning
by: Su, Yi, et al.
Published: (2024)
by: Su, Yi, et al.
Published: (2024)
UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind
by: Qian, Cheng, et al.
Published: (2026)
by: Qian, Cheng, et al.
Published: (2026)
The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination
by: Zhang, Yuji, et al.
Published: (2025)
by: Zhang, Yuji, et al.
Published: (2025)
Bayesian WeakS-to-Strong from Text Classification to Generation
by: Cui, Ziyun, et al.
Published: (2024)
by: Cui, Ziyun, et al.
Published: (2024)
SmartBook: AI-Assisted Situation Report Generation for Intelligence Analysts
by: Reddy, Revanth Gangi, et al.
Published: (2023)
by: Reddy, Revanth Gangi, et al.
Published: (2023)
CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
by: Wang, Yumeng, et al.
Published: (2025)
by: Wang, Yumeng, et al.
Published: (2025)
Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
by: He, Zhitao, et al.
Published: (2025)
by: He, Zhitao, et al.
Published: (2025)
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
by: Yuan, Lifan, et al.
Published: (2023)
by: Yuan, Lifan, et al.
Published: (2023)
Instruction Tuning for Story Understanding and Generation with Weak Supervision
by: Yuan, Yangshu, et al.
Published: (2025)
by: Yuan, Yangshu, et al.
Published: (2025)
Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models
by: Zhang, Yuji, et al.
Published: (2024)
by: Zhang, Yuji, et al.
Published: (2024)
Weak-to-Strong Reasoning
by: Yang, Yuqing, et al.
Published: (2024)
by: Yang, Yuqing, et al.
Published: (2024)
Weak-to-Strong GraphRAG: Aligning Weak Retrievers with Large Language Models for Graph-based Retrieval Augmented Generation
by: Zou, Deyu, et al.
Published: (2025)
by: Zou, Deyu, et al.
Published: (2025)
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
by: Li, Ming, et al.
Published: (2024)
by: Li, Ming, et al.
Published: (2024)
PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning
by: Li, Bingxuan, et al.
Published: (2026)
by: Li, Bingxuan, et al.
Published: (2026)
R-Tuning: Instructing Large Language Models to Say `I Don't Know'
by: Zhang, Hanning, et al.
Published: (2023)
by: Zhang, Hanning, et al.
Published: (2023)
ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation
by: Dong, Weilong, et al.
Published: (2024)
by: Dong, Weilong, et al.
Published: (2024)
Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection
by: Park, Kwanyong, et al.
Published: (2024)
by: Park, Kwanyong, et al.
Published: (2024)
Improved Compositional Generalization by Generating Demonstrations for Meta-Learning
by: Spilsbury, Sam, et al.
Published: (2023)
by: Spilsbury, Sam, et al.
Published: (2023)
Weak-to-Strong Jailbreaking on Large Language Models
by: Zhao, Xuandong, et al.
Published: (2024)
by: Zhao, Xuandong, et al.
Published: (2024)
Similar Items
-
Aligning LLMs with Individual Preferences via Interaction
by: Wu, Shujin, et al.
Published: (2024) -
MACAROON: Training Vision-Language Models To Be Your Engaged Partners
by: Wu, Shujin, et al.
Published: (2024) -
Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
by: He, Qi, et al.
Published: (2025) -
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration
by: He, Zhitao, et al.
Published: (2025) -
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
by: Qian, Cheng, et al.
Published: (2023)