:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhao, Zhengyang, Ma, Lu, Jiang, Yizhen, Ma, Xiaochen, Meng, Zimo, Shen, Chengyu, Tang, Lexiang, Sun, Haoze, Pei, Peng, Zhang, Wentao
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Machine Learning Artificial Intelligence Computation and Language
Online-Zugang:	https://arxiv.org/abs/2601.09233
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
von: Ma, Lu, et al.
Veröffentlicht: (2025)

Training with Harnesses: On-Policy Harness Self-Distillation for Complex Reasoning
von: Zhao, Zhengyang, et al.
Veröffentlicht: (2026)

Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
von: Li, Yanhao, et al.
Veröffentlicht: (2025)

GIFT: Games as Informal Training for Generalizable LLMs
von: Lyu, Nuoyan, et al.
Veröffentlicht: (2026)

Let's Verify Math Questions Step by Step
von: Shen, Chengyu, et al.
Veröffentlicht: (2025)

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models
von: Liang, Hao, et al.
Veröffentlicht: (2026)

Towards Next-Generation LLM Training: From the Data-Centric Perspective
von: Liang, Hao, et al.
Veröffentlicht: (2026)

DARO: Difficulty-Aware Reweighting Policy Optimization
von: Zhou, Jingyu, et al.
Veröffentlicht: (2025)

Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models
von: Xiong, Lexiang, et al.
Veröffentlicht: (2025)

Are Bigger Encoders Always Better in Vision Large Models?
von: Li, Bozhou, et al.
Veröffentlicht: (2024)

Uncovering Cross-Objective Interference in Multi-Objective Alignment
von: Lu, Yining, et al.
Veröffentlicht: (2026)

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers
von: Wang, Chao, et al.
Veröffentlicht: (2026)

Faster and Better 3D Splatting via Group Training
von: Wang, Chengbo, et al.
Veröffentlicht: (2024)

Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
von: Chen, Lei, et al.
Veröffentlicht: (2024)

Thinking by Subtraction: Confidence-Driven Contrastive Decoding for LLM Reasoning
von: Tang, Lexiang, et al.
Veröffentlicht: (2026)

TraceAV-Bench: Benchmarking Multi-Hop Trajectory Reasoning over Long Audio-Visual Videos
von: Feng, Hengyi, et al.
Veröffentlicht: (2026)

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
von: Liang, Hao, et al.
Veröffentlicht: (2025)

GIFT: Guided Importance-Aware Fine-Tuning for Diffusion Language Models
von: Xu, Guowei, et al.
Veröffentlicht: (2025)

ANDES: Agent Native Data Evolving Synthesis Tool for Autonomous Instruction Alignment
von: Zhao, Zhengyang, et al.
Veröffentlicht: (2026)

Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation
von: Tang, Lexiang, et al.
Veröffentlicht: (2025)

High-Temperature Gibbs States are Unentangled and Efficiently Preparable
von: Bakshi, Ainesh, et al.
Veröffentlicht: (2024)

Objective Metrics for Evaluating Large Language Models Using External Data Sources
von: Du, Haoze, et al.
Veröffentlicht: (2025)

Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification
von: Li, Shuai, et al.
Veröffentlicht: (2024)

Boosting Few-Shot Segmentation via Instance-Aware Data Augmentation and Local Consensus Guided Cross Attention
von: Guo, Li, et al.
Veröffentlicht: (2024)

MetaMolGen: A Neural Graph Motif Generation Model for De Novo Molecular Design
von: Yan, Zimo, et al.
Veröffentlicht: (2025)

K12-KGraph: A Curriculum-Aligned Knowledge Graph for Benchmarking and Training Educational LLMs
von: Liang, Hao, et al.
Veröffentlicht: (2026)

GIFT: Global Irreplaceability Frame Targeting for Efficient Video Understanding
von: Ma, Junpeng, et al.
Veröffentlicht: (2026)

Spatial-Spectral Binarized Neural Network for Panchromatic and Multi-spectral Images Fusion
von: Jiang, Yizhen, et al.
Veröffentlicht: (2025)

Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing
von: Su, Tongtong, et al.
Veröffentlicht: (2025)

High-Temperature Fermionic Gibbs States are Mixtures of Gaussian States
von: Ramkumar, Akshar, et al.
Veröffentlicht: (2025)

Extrinsic derivatives for SDEs and SPDEs with distribution dependent noise
von: Ma, Xiaochen, et al.
Veröffentlicht: (2026)

Coupling Methods and Applications on Path Dependent McKean-Vlasov SDEs
von: Huang, Xing, et al.
Veröffentlicht: (2024)

From Saying to Communicating: The Generic Development of Classroom Academic Presentations by Chinese First‐Year College Students
von: Junming Ma, et al.
Veröffentlicht: (2025)

P‐5.10: A Method of Reducing the Warpage of Medium Size AMOLED Modules in High Temperature and Humidity Environment
von: Jianbing Ou, et al.
Veröffentlicht: (2024)

Robust Training for Speaker Verification against Noisy Labels
von: Fang, Zhihua, et al.
Veröffentlicht: (2022)

HetSSNet: Spatial-Spectral Heterogeneous Graph Learning Network for Panchromatic and Multispectral Images Fusion
von: Ma, Mengting, et al.
Veröffentlicht: (2025)

GIFT: Unlocking Full Potential of Labels in Distilled Dataset at Near-zero Cost
von: Shang, Xinyi, et al.
Veröffentlicht: (2024)

Dr. Post-Training: A Data Regularization Perspective on LLM Post-Training
von: Hu, Pingbang, et al.
Veröffentlicht: (2026)

HWL-HIN: A Hypergraph-Level Hypergraph Isomorphism Network as Powerful as the Hypergraph Weisfeiler-Lehman Test with Application to Higher-Order Network Robustness
von: Tian, Chengyu, et al.
Veröffentlicht: (2025)

A Factuality and Diversity Reconciled Decoding Method for Knowledge-Grounded Dialogue Generation
von: Yang, Chenxu, et al.
Veröffentlicht: (2024)