Saved in:
| Main Authors: | Qi, Jianyu, Zou, Ding, Yan, Wenrui, Ma, Rui, Li, Jiaxu, Zheng, Zhijie, Yang, Zhiguo, Zhao, Rongchang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.06722 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
by: Cai, Wenrui, et al.
Published: (2025)
by: Cai, Wenrui, et al.
Published: (2025)
Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models
by: Zheng, Junhao, et al.
Published: (2023)
by: Zheng, Junhao, et al.
Published: (2023)
Revisiting Self-Play Preference Optimization: On the Role of Prompt Difficulty
by: Xiao, Yao, et al.
Published: (2025)
by: Xiao, Yao, et al.
Published: (2025)
On Predicting the Post-training Potential of Pre-trained LLMs
by: Li, Xiaoyuan, et al.
Published: (2026)
by: Li, Xiaoyuan, et al.
Published: (2026)
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
by: An, Keyu, et al.
Published: (2024)
by: An, Keyu, et al.
Published: (2024)
Revisiting Generalization Across Difficulty Levels: It's Not So Easy
by: Kordi, Yeganeh, et al.
Published: (2025)
by: Kordi, Yeganeh, et al.
Published: (2025)
Asymmetric Conflict and Synergy in Post-training for LLM-based Multilingual Machine Translation
by: Zheng, Tong, et al.
Published: (2025)
by: Zheng, Tong, et al.
Published: (2025)
AIR: Post-training Data Selection for Reasoning via Attention Head Influence
by: Liu, Jinrui, et al.
Published: (2025)
by: Liu, Jinrui, et al.
Published: (2025)
Unsupervised Cross-Lingual Part-of-Speech Tagging with Monolingual Corpora Only
by: Zheng, Jianyu
Published: (2026)
by: Zheng, Jianyu
Published: (2026)
Effective vocabulary expanding of multilingual language models for extremely low-resource languages
by: Zheng, Jianyu
Published: (2026)
by: Zheng, Jianyu
Published: (2026)
On the Impact of Calibration Data in Post-training Quantization and Pruning
by: Williams, Miles, et al.
Published: (2023)
by: Williams, Miles, et al.
Published: (2023)
Unveiling the Impact of Multimodal Features on Chinese Spelling Correction: From Analysis to Design
by: Zhang, Xiaowu, et al.
Published: (2025)
by: Zhang, Xiaowu, et al.
Published: (2025)
Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection
by: Qi, Dacheng, et al.
Published: (2026)
by: Qi, Dacheng, et al.
Published: (2026)
Characterizing Memorization in Diffusion Language Models: Generalized Extraction and Sampling Effects
by: Luo, Xiaoyu, et al.
Published: (2026)
by: Luo, Xiaoyu, et al.
Published: (2026)
Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
by: Xu, Hongshen, et al.
Published: (2024)
by: Xu, Hongshen, et al.
Published: (2024)
Enhancing EEG-to-Text Decoding through Transferable Representations from Pre-trained Contrastive EEG-Text Masked Autoencoder
by: Wang, Jiaqi, et al.
Published: (2024)
by: Wang, Jiaqi, et al.
Published: (2024)
DAST: Difficulty-Aware Self-Training on Large Language Models
by: Xue, Boyang, et al.
Published: (2025)
by: Xue, Boyang, et al.
Published: (2025)
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning
by: Zhang, Jianghangfan, et al.
Published: (2025)
by: Zhang, Jianghangfan, et al.
Published: (2025)
On Data Synthesis and Post-training for Visual Abstract Reasoning
by: Zhu, Ke, et al.
Published: (2025)
by: Zhu, Ke, et al.
Published: (2025)
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts
by: Cha, Taehun, et al.
Published: (2024)
by: Cha, Taehun, et al.
Published: (2024)
Rethinking Expert Trajectory Utilization in LLM Post-training for Mathematical Reasoning
by: Ding, Bowen, et al.
Published: (2025)
by: Ding, Bowen, et al.
Published: (2025)
Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap
by: Qi, Xuan, et al.
Published: (2025)
by: Qi, Xuan, et al.
Published: (2025)
RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation
by: Liu, Fanfan, et al.
Published: (2024)
by: Liu, Fanfan, et al.
Published: (2024)
Revisiting Classification Taxonomy for Grammatical Errors
by: Zou, Deqing, et al.
Published: (2025)
by: Zou, Deqing, et al.
Published: (2025)
Breaking the Pre-Sampling Barrier: Activation-Informed Difficulty-Aware Self-Consistency
by: Yoon, Taewoong, et al.
Published: (2026)
by: Yoon, Taewoong, et al.
Published: (2026)
Got Compute, but No Data: Lessons From Post-training a Finnish LLM
by: Zosa, Elaine, et al.
Published: (2025)
by: Zosa, Elaine, et al.
Published: (2025)
SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity
by: Xi, Xiangyu, et al.
Published: (2025)
by: Xi, Xiangyu, et al.
Published: (2025)
Caption First, VQA Second: Knowledge Density, Not Task Format, Drives Multimodal Scaling
by: Zou, Hongjian, et al.
Published: (2026)
by: Zou, Hongjian, et al.
Published: (2026)
Improving Similar Case Retrieval Ranking Performance By Revisiting RankSVM
by: Liu, Yuqi, et al.
Published: (2025)
by: Liu, Yuqi, et al.
Published: (2025)
DARO: Difficulty-Aware Reweighting Policy Optimization
by: Zhou, Jingyu, et al.
Published: (2025)
by: Zhou, Jingyu, et al.
Published: (2025)
DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training
by: Tian, Xiaoyu, et al.
Published: (2025)
by: Tian, Xiaoyu, et al.
Published: (2025)
RWKV-X: A Linear Complexity Hybrid Language Model
by: Hou, Haowen, et al.
Published: (2025)
by: Hou, Haowen, et al.
Published: (2025)
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping
by: Chen, Shuang, et al.
Published: (2025)
by: Chen, Shuang, et al.
Published: (2025)
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
by: Shum, Kashun, et al.
Published: (2025)
by: Shum, Kashun, et al.
Published: (2025)
Anchoring Bias in Large Language Models: An Experimental Study
by: Lou, Jiaxu, et al.
Published: (2024)
by: Lou, Jiaxu, et al.
Published: (2024)
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
by: Zhang, Kaiyan, et al.
Published: (2024)
by: Zhang, Kaiyan, et al.
Published: (2024)
ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning
by: Liu, Hongwei, et al.
Published: (2025)
by: Liu, Hongwei, et al.
Published: (2025)
CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models
by: Gu, Jiawei, et al.
Published: (2024)
by: Gu, Jiawei, et al.
Published: (2024)
3DS: Medical Domain Adaptation of LLMs via Decomposed Difficulty-based Data Selection
by: Ding, Hongxin, et al.
Published: (2024)
by: Ding, Hongxin, et al.
Published: (2024)
TimeBill: Time-Budgeted Inference for Large Language Models
by: Fan, Qi, et al.
Published: (2025)
by: Fan, Qi, et al.
Published: (2025)
Similar Items
-
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
by: Cai, Wenrui, et al.
Published: (2025) -
Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models
by: Zheng, Junhao, et al.
Published: (2023) -
Revisiting Self-Play Preference Optimization: On the Role of Prompt Difficulty
by: Xiao, Yao, et al.
Published: (2025) -
On Predicting the Post-training Potential of Pre-trained LLMs
by: Li, Xiaoyuan, et al.
Published: (2026) -
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
by: An, Keyu, et al.
Published: (2024)