:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Haoran, Dong, Qingxiu, Tang, Zhengyang, Wang, Chaojun, Zhang, Xingxing, Huang, Haoyang, Huang, Shaohan, Huang, Xiaolong, Huang, Zeqiang, Zhang, Dongdong, Gu, Yuxian, Cheng, Xin, Wang, Xun, Chen, Si-Qing, Dong, Li, Lu, Wei, Sui, Zhifang, Wang, Benyou, Lam, Wai, Wei, Furu
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2402.13064
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Self-Boosting Large Language Models with Synthetic Preference Data
by: Dong, Qingxiu, et al.
Published: (2024)

Online Experiential Learning for Language Models
by: Ye, Tianzhu, et al.
Published: (2026)

Reward Reasoning Model
by: Guo, Jiaxin, et al.
Published: (2025)

The Era of Agentic Organization: Learning to Organize with Language Models
by: Chi, Zewen, et al.
Published: (2025)

Towards Optimal Learning of Language Models
by: Gu, Yuxian, et al.
Published: (2024)

MathScale: Scaling Instruction Tuning for Mathematical Reasoning
by: Tang, Zhengyang, et al.
Published: (2024)

Data Selection via Optimal Control for Language Models
by: Gu, Yuxian, et al.
Published: (2024)

Think Only When You Need with Large Hybrid-Reasoning Models
by: Jiang, Lingjie, et al.
Published: (2025)

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts
by: Zhang, Di, et al.
Published: (2025)

Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation
by: Wu, Xun, et al.
Published: (2024)

Mixture of LoRA Experts
by: Wu, Xun, et al.
Published: (2024)

VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models
by: Jiang, Lingjie, et al.
Published: (2025)

Chain-of-Dictionary Prompting Elicits Translation in Large Language Models
by: Lu, Hongyuan, et al.
Published: (2023)

On-Policy Context Distillation for Language Models
by: Ye, Tianzhu, et al.
Published: (2026)

Multi-Head Mixture-of-Experts
by: Wu, Xun, et al.
Published: (2024)

Reinforcement Pre-Training
by: Dong, Qingxiu, et al.
Published: (2025)

MiniLLM: On-Policy Distillation of Large Language Models
by: Gu, Yuxian, et al.
Published: (2023)

Thinking Augmented Pre-training
by: Wang, Liang, et al.
Published: (2025)

WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale
by: Li, Jiaxi, et al.
Published: (2025)

Instruction Pre-Training: Language Models are Supervised Multitask Learners
by: Cheng, Daixuan, et al.
Published: (2024)

Black-Box On-Policy Distillation of Large Language Models
by: Ye, Tianzhu, et al.
Published: (2025)

On-Policy RL with Optimal Reward Baseline
by: Hao, Yaru, et al.
Published: (2025)

Decoding in Geometry: Alleviating Embedding-Space Crowding for Complex Reasoning
by: Yang, Yixin, et al.
Published: (2026)

BitNet Distillation
by: Wu, Xun, et al.
Published: (2025)

Textual Aesthetics in Large Language Models
by: Jiang, Lingjie, et al.
Published: (2024)

MH-MoE: Multi-Head Mixture-of-Experts
by: Huang, Shaohan, et al.
Published: (2024)

Bootstrap Your Own Context Length
by: Wang, Liang, et al.
Published: (2024)

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity
by: Zhang, Di, et al.
Published: (2026)

Universal YOCO for Efficient Depth Scaling
by: Sun, Yutao, et al.
Published: (2026)

Scaling Laws of Synthetic Data for Language Models
by: Qin, Zeyu, et al.
Published: (2025)

Adapting Large Language Models to Domains via Reading Comprehension
by: Cheng, Daixuan, et al.
Published: (2023)

Kosmos-G: Generating Images in Context with Multimodal Large Language Models
by: Pan, Xichen, et al.
Published: (2023)

Can Large Multimodal Models Uncover Deep Semantics Behind Images?
by: Yang, Yixin, et al.
Published: (2024)

BitNet b1.58 2B4T Technical Report
by: Ma, Shuming, et al.
Published: (2025)

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning
by: Li, Zheng, et al.
Published: (2025)

Multimodal Latent Language Modeling with Next-Token Diffusion
by: Sun, Yutao, et al.
Published: (2024)

Computer Environments Elicit General Agentic Intelligence in LLMs
by: Cheng, Daixuan, et al.
Published: (2026)

You Only Cache Once: Decoder-Decoder Architectures for Language Models
by: Sun, Yutao, et al.
Published: (2024)

RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning
by: Fu, Shaopeng, et al.
Published: (2026)

RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection
by: Yang, Yixin, et al.
Published: (2025)