Guardado en:
| Autores principales: | Li, Junjie, Wang, Ziao, Ma, NingXuan, Ma, Jianghong, Zhang, Xiaofeng |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.13130 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Uncovering Intrinsic Capabilities: A Paradigm for Data Curation in Vision-Language Models
por: Li, Junjie, et al.
Publicado: (2025)
por: Li, Junjie, et al.
Publicado: (2025)
GiVE: Guiding Visual Encoder to Perceive Overlooked Information
por: Li, Junjie, et al.
Publicado: (2024)
por: Li, Junjie, et al.
Publicado: (2024)
MLGIB: Multi-Label Graph Information Bottleneck for Expressive and Robust Message Passing
por: Wu, Chaokai, et al.
Publicado: (2026)
por: Wu, Chaokai, et al.
Publicado: (2026)
Does Faithfulness Conflict with Plausibility? An Empirical Study in Explainable AI across NLP Tasks
por: Lu, Xiaolei, et al.
Publicado: (2024)
por: Lu, Xiaolei, et al.
Publicado: (2024)
GRACE: Discriminator-Guided Chain-of-Thought Reasoning
por: Khalifa, Muhammad, et al.
Publicado: (2023)
por: Khalifa, Muhammad, et al.
Publicado: (2023)
Dual-Phase Playtime-guided Recommendation: Interest Intensity Exploration and Multimodal Random Walks
por: Zhang, Jingmao, et al.
Publicado: (2025)
por: Zhang, Jingmao, et al.
Publicado: (2025)
Diversity Recommendation via Causal Deconfounding of Co-purchase Relations and Counterfactual Exposure
por: Zhang, Jingmao, et al.
Publicado: (2025)
por: Zhang, Jingmao, et al.
Publicado: (2025)
On Data Synthesis and Post-training for Visual Abstract Reasoning
por: Zhu, Ke, et al.
Publicado: (2025)
por: Zhu, Ke, et al.
Publicado: (2025)
IDVT: Interest-aware Denoising and View-guided Tuning for Social Recommendation
por: Yang, Dezhao, et al.
Publicado: (2023)
por: Yang, Dezhao, et al.
Publicado: (2023)
STARec: An Efficient Agent Framework for Recommender Systems via Autonomous Deliberate Reasoning
por: Wu, Chenghao, et al.
Publicado: (2025)
por: Wu, Chenghao, et al.
Publicado: (2025)
CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs
por: Wang, Shaobo, et al.
Publicado: (2025)
por: Wang, Shaobo, et al.
Publicado: (2025)
Layer-Aware Influence for Online Data Valuation Estimation
por: Yang, Ziao, et al.
Publicado: (2025)
por: Yang, Ziao, et al.
Publicado: (2025)
DCA-Bench: A Benchmark for Dataset Curation Agents
por: Huang, Benhao, et al.
Publicado: (2024)
por: Huang, Benhao, et al.
Publicado: (2024)
DaGRPO: Rectifying Gradient Conflict in Reasoning via Distinctiveness-Aware Group Relative Policy Optimization
por: Xie, Xuan, et al.
Publicado: (2025)
por: Xie, Xuan, et al.
Publicado: (2025)
RADAR: Accelerating Large Language Model Inference With RL-Based Dynamic Draft Trees
por: Ma, Junjie, et al.
Publicado: (2025)
por: Ma, Junjie, et al.
Publicado: (2025)
What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge
por: Shin, Yosub, et al.
Publicado: (2026)
por: Shin, Yosub, et al.
Publicado: (2026)
Data to Defense: The Role of Curation in Customizing LLMs Against Jailbreaking Attacks
por: Liu, Xiaoqun, et al.
Publicado: (2024)
por: Liu, Xiaoqun, et al.
Publicado: (2024)
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
por: Li, Ming, et al.
Publicado: (2025)
por: Li, Ming, et al.
Publicado: (2025)
The Impact of Post-training on Data Contamination
por: Kocyigit, Muhammed Yusuf, et al.
Publicado: (2026)
por: Kocyigit, Muhammed Yusuf, et al.
Publicado: (2026)
Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks
por: Zhang, Yuxiang, et al.
Publicado: (2025)
por: Zhang, Yuxiang, et al.
Publicado: (2025)
Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning
por: Wan, Qian, et al.
Publicado: (2026)
por: Wan, Qian, et al.
Publicado: (2026)
Efficient Post-training Quantization with FP8 Formats
por: Shen, Haihao, et al.
Publicado: (2023)
por: Shen, Haihao, et al.
Publicado: (2023)
Post-training for Efficient Communication via Convention Formation
por: Hua, Yilun, et al.
Publicado: (2025)
por: Hua, Yilun, et al.
Publicado: (2025)
Revisiting the Data Sampling in Multimodal Post-training from a Difficulty-Distinguish View
por: Qi, Jianyu, et al.
Publicado: (2025)
por: Qi, Jianyu, et al.
Publicado: (2025)
Post Reasoning: Improving the Performance of Non-Thinking Models at No Cost
por: Xuan, Richmond Sin Jing, et al.
Publicado: (2026)
por: Xuan, Richmond Sin Jing, et al.
Publicado: (2026)
CPGRec+: A Balance-oriented Framework for Personalized Video Game Recommendations
por: Li, Xiping, et al.
Publicado: (2026)
por: Li, Xiping, et al.
Publicado: (2026)
Efficient Reasoning Models: A Survey
por: Feng, Sicheng, et al.
Publicado: (2025)
por: Feng, Sicheng, et al.
Publicado: (2025)
Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
por: Li, Yanhao, et al.
Publicado: (2025)
por: Li, Yanhao, et al.
Publicado: (2025)
Efficient Post-training of LLMs for Code Generation With Offline Reinforcement Learning
por: Wu, Mingze, et al.
Publicado: (2026)
por: Wu, Mingze, et al.
Publicado: (2026)
Efficient Reasoning with Balanced Thinking
por: Li, Yulin, et al.
Publicado: (2026)
por: Li, Yulin, et al.
Publicado: (2026)
Test-Time Steering for Lossless Text Compression via Weighted Product of Experts
por: Zhang, Qihang, et al.
Publicado: (2025)
por: Zhang, Qihang, et al.
Publicado: (2025)
DocMamba: Efficient Document Pre-training with State Space Model
por: Hu, Pengfei, et al.
Publicado: (2024)
por: Hu, Pengfei, et al.
Publicado: (2024)
Neuro-Symbolic Data Generation for Math Reasoning
por: Li, Zenan, et al.
Publicado: (2024)
por: Li, Zenan, et al.
Publicado: (2024)
Not All Preferences Are Created Equal: Stability-Aware and Gradient-Efficient Alignment for Reasoning Models
por: Wu, Hui, et al.
Publicado: (2026)
por: Wu, Hui, et al.
Publicado: (2026)
GRACE: A Dynamic Coreset Selection Framework for Large Language Model Optimization
por: Tang, Tianhao, et al.
Publicado: (2026)
por: Tang, Tianhao, et al.
Publicado: (2026)
GRACE: Loss-Resilient Real-Time Video through Neural Codecs
por: Cheng, Yihua, et al.
Publicado: (2023)
por: Cheng, Yihua, et al.
Publicado: (2023)
CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control
por: Guo, Qing, et al.
Publicado: (2026)
por: Guo, Qing, et al.
Publicado: (2026)
Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models
por: Tao, Yongding, et al.
Publicado: (2025)
por: Tao, Yongding, et al.
Publicado: (2025)
Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
por: Liu, Kaiyuan, et al.
Publicado: (2025)
por: Liu, Kaiyuan, et al.
Publicado: (2025)
Recontextualizing Famous Quotes for Brand Slogan Generation
por: Yang, Ziao, et al.
Publicado: (2026)
por: Yang, Ziao, et al.
Publicado: (2026)
Ejemplares similares
-
Uncovering Intrinsic Capabilities: A Paradigm for Data Curation in Vision-Language Models
por: Li, Junjie, et al.
Publicado: (2025) -
GiVE: Guiding Visual Encoder to Perceive Overlooked Information
por: Li, Junjie, et al.
Publicado: (2024) -
MLGIB: Multi-Label Graph Information Bottleneck for Expressive and Robust Message Passing
por: Wu, Chaokai, et al.
Publicado: (2026) -
Does Faithfulness Conflict with Plausibility? An Empirical Study in Explainable AI across NLP Tasks
por: Lu, Xiaolei, et al.
Publicado: (2024) -
GRACE: Discriminator-Guided Chain-of-Thought Reasoning
por: Khalifa, Muhammad, et al.
Publicado: (2023)