Saved in:
| Main Authors: | Yan, Yuming, Yang, Shuo, Tang, Kai, Chen, Sihong, Zhang, Yang, Xu, Ke, Hu, Dan, Yu, Qun, Hu, Pengfei, Ngai, Edith C. H. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.10740 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
S-GRPO: Unified Post-Training for Large Vision-Language Models
by: Yan, Yuming, et al.
Published: (2026)
by: Yan, Yuming, et al.
Published: (2026)
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
by: Li, Shuo, et al.
Published: (2024)
by: Li, Shuo, et al.
Published: (2024)
OCR-Memory: Optical Context Retrieval for Long-Horizon Agent Memory
by: Li, Jinze, et al.
Published: (2026)
by: Li, Jinze, et al.
Published: (2026)
RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking
by: Yang, Shuo, et al.
Published: (2025)
by: Yang, Shuo, et al.
Published: (2025)
LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation
by: Li, Jinze, et al.
Published: (2026)
by: Li, Jinze, et al.
Published: (2026)
Beyond the Target: From Imitation to Collaboration in Speculative Decoding
by: Li, Jinze, et al.
Published: (2026)
by: Li, Jinze, et al.
Published: (2026)
Training-Free Loosely Speculative Decoding: Accepting Semantically Correct Drafts Beyond Exact Match
by: Li, Jinze, et al.
Published: (2025)
by: Li, Jinze, et al.
Published: (2025)
GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings
by: Tang, Yixuan, et al.
Published: (2025)
by: Tang, Yixuan, et al.
Published: (2025)
Shadow-FT: Tuning Instruct Model via Training on Paired Base Model
by: Wu, Taiqiang, et al.
Published: (2025)
by: Wu, Taiqiang, et al.
Published: (2025)
LLM-NEO: Parameter Efficient Knowledge Distillation for Large Language Models
by: Yang, Runming, et al.
Published: (2024)
by: Yang, Runming, et al.
Published: (2024)
TACL: Threshold-Adaptive Curriculum Learning Strategy for Enhancing Medical Text Understanding
by: Ren, Mucheng, et al.
Published: (2025)
by: Ren, Mucheng, et al.
Published: (2025)
Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition
by: Bao, Ke, et al.
Published: (2024)
by: Bao, Ke, et al.
Published: (2024)
RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking
by: Yang, Shuo, et al.
Published: (2025)
by: Yang, Shuo, et al.
Published: (2025)
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning
by: Lei, Xuanyu, et al.
Published: (2025)
by: Lei, Xuanyu, et al.
Published: (2025)
Do VLMs Truly "Read" Candlesticks? A Multi-Scale Benchmark for Visual Stock Price Forecasting
by: Hu, Kaiqi, et al.
Published: (2026)
by: Hu, Kaiqi, et al.
Published: (2026)
Long-Chain Reasoning Distillation via Adaptive Prefix Alignment
by: Liu, Zhenghao, et al.
Published: (2026)
by: Liu, Zhenghao, et al.
Published: (2026)
DARE: Diffusion Large Language Models Alignment and Reinforcement Executor
by: Yang, Jingyi, et al.
Published: (2026)
by: Yang, Jingyi, et al.
Published: (2026)
Beyond Textual Context: Structural Graph Encoding with Adaptive Space Alignment to alleviate the hallucination of LLMs
by: Zhang, Yifang, et al.
Published: (2025)
by: Zhang, Yifang, et al.
Published: (2025)
Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs
by: Lu, Meng, et al.
Published: (2025)
by: Lu, Meng, et al.
Published: (2025)
No Loss, No Gain: Gated Refinement and Adaptive Compression for Prompt Optimization
by: Shi, Wenhang, et al.
Published: (2025)
by: Shi, Wenhang, et al.
Published: (2025)
Scaling LLM Pre-training with Vocabulary Curriculum
by: Yu, Fangyuan
Published: (2025)
by: Yu, Fangyuan
Published: (2025)
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
by: Uppaal, Rheeya, et al.
Published: (2024)
by: Uppaal, Rheeya, et al.
Published: (2024)
VLMs May Not Globally Enhance Human Alignment over LLMs During Natural Reading
by: Wu, Jinzhou, et al.
Published: (2026)
by: Wu, Jinzhou, et al.
Published: (2026)
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
by: Qi, Zehan, et al.
Published: (2024)
by: Qi, Zehan, et al.
Published: (2024)
RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment
by: Yang, Kevin, et al.
Published: (2023)
by: Yang, Kevin, et al.
Published: (2023)
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
by: Shi, Taiwei, et al.
Published: (2025)
by: Shi, Taiwei, et al.
Published: (2025)
The Art of Practical Curriculum Making: HEP-Secondary Language Skills.
by: Kleinjans, Edith K.
Published: (1976)
by: Kleinjans, Edith K.
Published: (1976)
SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation
by: Tang, Xiaqiang, et al.
Published: (2025)
by: Tang, Xiaqiang, et al.
Published: (2025)
Deep Pre-Alignment for VLMs
by: Yu, Tianyu, et al.
Published: (2026)
by: Yu, Tianyu, et al.
Published: (2026)
Legal Mathematical Reasoning with LLMs: Procedural Alignment through Two-Stage Reinforcement Learning
by: Zhang, Kepu, et al.
Published: (2025)
by: Zhang, Kepu, et al.
Published: (2025)
Alignment for Honesty
by: Yang, Yuqing, et al.
Published: (2023)
by: Yang, Yuqing, et al.
Published: (2023)
Reinforcement Pre-Training
by: Dong, Qingxiu, et al.
Published: (2025)
by: Dong, Qingxiu, et al.
Published: (2025)
Decompose, Look, and Reason: Reinforced Latent Reasoning for VLMs
by: Zhu, Mengdan, et al.
Published: (2026)
by: Zhu, Mengdan, et al.
Published: (2026)
Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization
by: Yang, Shuo, et al.
Published: (2024)
by: Yang, Shuo, et al.
Published: (2024)
Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning
by: Fu, Yu, et al.
Published: (2024)
by: Fu, Yu, et al.
Published: (2024)
A Comparative Evaluation of Structural Topic Models and BERTopic for Short, Open-Ended Survey Responses
by: Jiang, Yan, et al.
Published: (2026)
by: Jiang, Yan, et al.
Published: (2026)
G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks
by: Wan, Zhongwei, et al.
Published: (2022)
by: Wan, Zhongwei, et al.
Published: (2022)
Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models
by: Tang, Lei, et al.
Published: (2025)
by: Tang, Lei, et al.
Published: (2025)
Cat-DPO: Category-Adaptive Safety Alignment
by: Yang, Tiankai, et al.
Published: (2026)
by: Yang, Tiankai, et al.
Published: (2026)
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
by: Tang, Xiaqiang, et al.
Published: (2024)
by: Tang, Xiaqiang, et al.
Published: (2024)
Similar Items
-
S-GRPO: Unified Post-Training for Large Vision-Language Models
by: Yan, Yuming, et al.
Published: (2026) -
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
by: Li, Shuo, et al.
Published: (2024) -
OCR-Memory: Optical Context Retrieval for Long-Horizon Agent Memory
by: Li, Jinze, et al.
Published: (2026) -
RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking
by: Yang, Shuo, et al.
Published: (2025) -
LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation
by: Li, Jinze, et al.
Published: (2026)