Saved in:
| Main Authors: | Chen, Yicheng, Ma, Zerun, Xie, Xinchen, Li, Yining, Chen, Kai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.11089 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
by: Chen, Yicheng, et al.
Published: (2025)
by: Chen, Yicheng, et al.
Published: (2025)
TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration
by: Ma, Zerun, et al.
Published: (2026)
by: Ma, Zerun, et al.
Published: (2026)
GTA: A Benchmark for General Tool Agents
by: Wang, Jize, et al.
Published: (2024)
by: Wang, Jize, et al.
Published: (2024)
Losses that Cook: Topological Optimal Transport for Structured Recipe Generation
by: Ottoborgo, Mattia, et al.
Published: (2026)
by: Ottoborgo, Mattia, et al.
Published: (2026)
Cooking Up Creativity: Enhancing LLM Creativity through Structured Recombination
by: Mizrahi, Moran, et al.
Published: (2025)
by: Mizrahi, Moran, et al.
Published: (2025)
Domain-Specific Data Generation Framework for RAG Adaptation
by: Tian, Chris Xing, et al.
Published: (2025)
by: Tian, Chris Xing, et al.
Published: (2025)
Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation
by: Chen, Yifang, et al.
Published: (2024)
by: Chen, Yifang, et al.
Published: (2024)
Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling
by: Sun, Shengyin, et al.
Published: (2025)
by: Sun, Shengyin, et al.
Published: (2025)
CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents
by: Sutawika, Lintang, et al.
Published: (2026)
by: Sutawika, Lintang, et al.
Published: (2026)
CGU-ILALab at FoodBench-QA 2026: Comparing Traditional and LLM-based Approaches for Recipe Nutrient Estimation
by: Chen, Wei-Chun, et al.
Published: (2026)
by: Chen, Wei-Chun, et al.
Published: (2026)
Ratchet: A Minimal Hygiene Recipe for Self-Evolving LLM Agents
by: Zhang, Xing, et al.
Published: (2026)
by: Zhang, Xing, et al.
Published: (2026)
FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
by: Zhou, Pengfei, et al.
Published: (2024)
by: Zhou, Pengfei, et al.
Published: (2024)
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
by: Li, Yuan, et al.
Published: (2025)
by: Li, Yuan, et al.
Published: (2025)
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
by: Cao, Maosong, et al.
Published: (2025)
by: Cao, Maosong, et al.
Published: (2025)
Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Data
by: Xie, Xinhong, et al.
Published: (2024)
by: Xie, Xinhong, et al.
Published: (2024)
Enhancing Action and Ingredient Modeling for Semantically Grounded Recipe Generation
by: Liu, Guoshan, et al.
Published: (2026)
by: Liu, Guoshan, et al.
Published: (2026)
Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat
by: Daynauth, Roland, et al.
Published: (2024)
by: Daynauth, Roland, et al.
Published: (2024)
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
by: Xie, Tian, et al.
Published: (2025)
by: Xie, Tian, et al.
Published: (2025)
Boosting LLM via Learning from Data Iteratively and Selectively
by: Jia, Qi, et al.
Published: (2024)
by: Jia, Qi, et al.
Published: (2024)
Reinforcement Learning on Pre-Training Data
by: Li, Siheng, et al.
Published: (2025)
by: Li, Siheng, et al.
Published: (2025)
LESA: Learnable LLM Layer Scaling-Up
by: Yang, Yifei, et al.
Published: (2025)
by: Yang, Yifei, et al.
Published: (2025)
Teaching Language Models to Critique via Reinforcement Learning
by: Xie, Zhihui, et al.
Published: (2025)
by: Xie, Zhihui, et al.
Published: (2025)
GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning
by: Yang, Ningyuan, et al.
Published: (2026)
by: Yang, Ningyuan, et al.
Published: (2026)
MobileLLM-R1: Exploring the Limits of Sub-Billion Language Model Reasoners with Open Training Recipes
by: Zhao, Changsheng, et al.
Published: (2025)
by: Zhao, Changsheng, et al.
Published: (2025)
Improving Data Efficiency via Curating LLM-Driven Rating Systems
by: Pang, Jinlong, et al.
Published: (2024)
by: Pang, Jinlong, et al.
Published: (2024)
Optimizing Decomposition for Optimal Claim Verification
by: Lu, Yining, et al.
Published: (2025)
by: Lu, Yining, et al.
Published: (2025)
Data Compressibility Quantifies LLM Memorization
by: Huang, Yizhan, et al.
Published: (2025)
by: Huang, Yizhan, et al.
Published: (2025)
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay
by: Sun, Yifan, et al.
Published: (2025)
by: Sun, Yifan, et al.
Published: (2025)
Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals
by: Chen, Sirui, et al.
Published: (2026)
by: Chen, Sirui, et al.
Published: (2026)
Safer-Instruct: Aligning Language Models with Automated Preference Data
by: Shi, Taiwei, et al.
Published: (2023)
by: Shi, Taiwei, et al.
Published: (2023)
Learning Multi-Indicator Weights for Data Selection: A Joint Task-Model Adaptation Framework with Efficient Proxies
by: Song, Jingze, et al.
Published: (2026)
by: Song, Jingze, et al.
Published: (2026)
CAST: Achieving Stable LLM-based Text Analysis for Data Analytics
by: Xie, Jinxiang, et al.
Published: (2026)
by: Xie, Jinxiang, et al.
Published: (2026)
SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training
by: He, Nan, et al.
Published: (2024)
by: He, Nan, et al.
Published: (2024)
Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphs
by: Sun, Lei, et al.
Published: (2024)
by: Sun, Lei, et al.
Published: (2024)
SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
by: Fang, Wenkai, et al.
Published: (2025)
by: Fang, Wenkai, et al.
Published: (2025)
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
by: Chen, Mingyang, et al.
Published: (2025)
by: Chen, Mingyang, et al.
Published: (2025)
Domain Adaptation of LLMs for Process Data
by: Oyamada, Rafael Seidi, et al.
Published: (2025)
by: Oyamada, Rafael Seidi, et al.
Published: (2025)
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
by: Shen, Han, et al.
Published: (2024)
by: Shen, Han, et al.
Published: (2024)
AddrLLM: Address Rewriting via Large Language Model on Nationwide Logistics Data
by: Yang, Qinchen, et al.
Published: (2024)
by: Yang, Qinchen, et al.
Published: (2024)
ProMind-LLM: Proactive Mental Health Care via Causal Reasoning with Sensor Data
by: Zheng, Xinzhe, et al.
Published: (2025)
by: Zheng, Xinzhe, et al.
Published: (2025)
Similar Items
-
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
by: Chen, Yicheng, et al.
Published: (2025) -
TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration
by: Ma, Zerun, et al.
Published: (2026) -
GTA: A Benchmark for General Tool Agents
by: Wang, Jize, et al.
Published: (2024) -
Losses that Cook: Topological Optimal Transport for Structured Recipe Generation
by: Ottoborgo, Mattia, et al.
Published: (2026) -
Cooking Up Creativity: Enhancing LLM Creativity through Structured Recombination
by: Mizrahi, Moran, et al.
Published: (2025)