:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yan, Yuming, Yang, Shuo, Tang, Kai, Chen, Sihong, Zhang, Yang, Xu, Ke, Hu, Dan, Yu, Qun, Hu, Pengfei, Ngai, Edith C. H.
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.10740
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

S-GRPO: Unified Post-Training for Large Vision-Language Models
by: Yan, Yuming, et al.
Published: (2026)

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
by: Li, Shuo, et al.
Published: (2024)

OCR-Memory: Optical Context Retrieval for Long-Horizon Agent Memory
by: Li, Jinze, et al.
Published: (2026)

RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking
by: Yang, Shuo, et al.
Published: (2025)

LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation
by: Li, Jinze, et al.
Published: (2026)

Beyond the Target: From Imitation to Collaboration in Speculative Decoding
by: Li, Jinze, et al.
Published: (2026)

Training-Free Loosely Speculative Decoding: Accepting Semantically Correct Drafts Beyond Exact Match
by: Li, Jinze, et al.
Published: (2025)

GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings
by: Tang, Yixuan, et al.
Published: (2025)

Shadow-FT: Tuning Instruct Model via Training on Paired Base Model
by: Wu, Taiqiang, et al.
Published: (2025)

LLM-NEO: Parameter Efficient Knowledge Distillation for Large Language Models
by: Yang, Runming, et al.
Published: (2024)

TACL: Threshold-Adaptive Curriculum Learning Strategy for Enhancing Medical Text Understanding
by: Ren, Mucheng, et al.
Published: (2025)

Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition
by: Bao, Ke, et al.
Published: (2024)

RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking
by: Yang, Shuo, et al.
Published: (2025)

Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning
by: Lei, Xuanyu, et al.
Published: (2025)

Do VLMs Truly "Read" Candlesticks? A Multi-Scale Benchmark for Visual Stock Price Forecasting
by: Hu, Kaiqi, et al.
Published: (2026)

Long-Chain Reasoning Distillation via Adaptive Prefix Alignment
by: Liu, Zhenghao, et al.
Published: (2026)

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor
by: Yang, Jingyi, et al.
Published: (2026)

Beyond Textual Context: Structural Graph Encoding with Adaptive Space Alignment to alleviate the hallucination of LLMs
by: Zhang, Yifang, et al.
Published: (2025)

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs
by: Lu, Meng, et al.
Published: (2025)

No Loss, No Gain: Gated Refinement and Adaptive Compression for Prompt Optimization
by: Shi, Wenhang, et al.
Published: (2025)

Scaling LLM Pre-training with Vocabulary Curriculum
by: Yu, Fangyuan
Published: (2025)

How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
by: Uppaal, Rheeya, et al.
Published: (2024)

VLMs May Not Globally Enhance Human Alignment over LLMs During Natural Reading
by: Wu, Jinzhou, et al.
Published: (2026)

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
by: Qi, Zehan, et al.
Published: (2024)

RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment
by: Yang, Kevin, et al.
Published: (2023)

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
by: Shi, Taiwei, et al.
Published: (2025)

The Art of Practical Curriculum Making: HEP-Secondary Language Skills.
by: Kleinjans, Edith K.
Published: (1976)

SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation
by: Tang, Xiaqiang, et al.
Published: (2025)

Deep Pre-Alignment for VLMs
by: Yu, Tianyu, et al.
Published: (2026)

Legal Mathematical Reasoning with LLMs: Procedural Alignment through Two-Stage Reinforcement Learning
by: Zhang, Kepu, et al.
Published: (2025)

Alignment for Honesty
by: Yang, Yuqing, et al.
Published: (2023)

Reinforcement Pre-Training
by: Dong, Qingxiu, et al.
Published: (2025)

Decompose, Look, and Reason: Reinforced Latent Reasoning for VLMs
by: Zhu, Mengdan, et al.
Published: (2026)

Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization
by: Yang, Shuo, et al.
Published: (2024)

Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning
by: Fu, Yu, et al.
Published: (2024)

A Comparative Evaluation of Structural Topic Models and BERTopic for Short, Open-Ended Survey Responses
by: Jiang, Yan, et al.
Published: (2026)

G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks
by: Wan, Zhongwei, et al.
Published: (2022)

Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models
by: Tang, Lei, et al.
Published: (2025)

Cat-DPO: Category-Adaptive Safety Alignment
by: Yang, Tiankai, et al.
Published: (2026)

Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
by: Tang, Xiaqiang, et al.
Published: (2024)