:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Shujin, Qian, Cheng, Fung, Yi R., Liang, Paul Pu, Ji, Heng
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2504.07316
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Aligning LLMs with Individual Preferences via Interaction
by: Wu, Shujin, et al.
Published: (2024)

MACAROON: Training Vision-Language Models To Be Your Engaged Partners
by: Wu, Shujin, et al.
Published: (2024)

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
by: He, Qi, et al.
Published: (2025)

MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration
by: He, Zhitao, et al.
Published: (2025)

CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
by: Qian, Cheng, et al.
Published: (2023)

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher
by: Uzunoglu, Arda, et al.
Published: (2026)

VLM2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
by: Zhang, Jianshu, et al.
Published: (2025)

LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation
by: Xuan, Keyang, et al.
Published: (2024)

Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks
by: He, Jiayi, et al.
Published: (2024)

Improving Weak-to-Strong Generalization with Reliability-Aware Alignment
by: Guo, Yue, et al.
Published: (2024)

Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection
by: Jung, Minseok, et al.
Published: (2025)

Your Weak LLM is Secretly a Strong Teacher for Alignment
by: Tao, Leitian, et al.
Published: (2024)

Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
by: Yang, Wenkai, et al.
Published: (2024)

MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models
by: Li, Hengzhi, et al.
Published: (2025)

ADEPT: A DEbiasing PrompT Framework
by: Yang, Ke, et al.
Published: (2022)

Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking
by: Fung, Yi, et al.
Published: (2024)

Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning
by: Sang, Jitao, et al.
Published: (2024)

Selective Weak-to-Strong Generalization
by: Lang, Hao, et al.
Published: (2025)

NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly
by: Fung, Yi R., et al.
Published: (2022)

Debate Helps Weak-to-Strong Generalization
by: Lang, Hao, et al.
Published: (2025)

Theoretical Analysis of Weak-to-Strong Generalization
by: Lang, Hunter, et al.
Published: (2024)

Demonstration Augmentation for Zero-shot In-context Learning
by: Su, Yi, et al.
Published: (2024)

UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind
by: Qian, Cheng, et al.
Published: (2026)

The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination
by: Zhang, Yuji, et al.
Published: (2025)

Bayesian WeakS-to-Strong from Text Classification to Generation
by: Cui, Ziyun, et al.
Published: (2024)

SmartBook: AI-Assisted Situation Report Generation for Intelligence Analysts
by: Reddy, Revanth Gangi, et al.
Published: (2023)

CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
by: Wang, Yumeng, et al.
Published: (2025)

Advancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization
by: He, Zhitao, et al.
Published: (2025)

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
by: Yuan, Lifan, et al.
Published: (2023)

Instruction Tuning for Story Understanding and Generation with Weak Supervision
by: Yuan, Yangshu, et al.
Published: (2025)

Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models
by: Zhang, Yuji, et al.
Published: (2024)

Weak-to-Strong Reasoning
by: Yang, Yuqing, et al.
Published: (2024)

Weak-to-Strong GraphRAG: Aligning Weak Retrievers with Large Language Models for Graph-based Retrieval Augmented Generation
by: Zou, Deyu, et al.
Published: (2025)

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
by: Li, Ming, et al.
Published: (2024)

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning
by: Li, Bingxuan, et al.
Published: (2026)

R-Tuning: Instructing Large Language Models to Say `I Don't Know'
by: Zhang, Hanning, et al.
Published: (2023)

ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation
by: Dong, Weilong, et al.
Published: (2024)

Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection
by: Park, Kwanyong, et al.
Published: (2024)

Improved Compositional Generalization by Generating Demonstrations for Meta-Learning
by: Spilsbury, Sam, et al.
Published: (2023)

Weak-to-Strong Jailbreaking on Large Language Models
by: Zhao, Xuandong, et al.
Published: (2024)