Saved in:
| Main Authors: | Zheng, Chen, Sun, Ke, Tang, Da, Ma, Yukun, Zhang, Yuyu, Xi, Chenguang, Zhou, Xun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.02072 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF
by: Zheng, Chen, et al.
Published: (2024)
by: Zheng, Chen, et al.
Published: (2024)
GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond
by: Zheng, Shen, et al.
Published: (2023)
by: Zheng, Shen, et al.
Published: (2023)
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs
by: Zheng, Chen, et al.
Published: (2024)
by: Zheng, Chen, et al.
Published: (2024)
L0: Reinforcement Learning to Become General Agents
by: Zhang, Junjie, et al.
Published: (2025)
by: Zhang, Junjie, et al.
Published: (2025)
WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale
by: Li, Jiaxi, et al.
Published: (2025)
by: Li, Jiaxi, et al.
Published: (2025)
TAG-INSTRUCT: Controlled Instruction Complexity Enhancement through Structure-based Augmentation
by: Zhu, He, et al.
Published: (2025)
by: Zhu, He, et al.
Published: (2025)
TRIMS: Trajectory-Ranked Instruction Masked Supervision for Diffusion Language Models
by: Chen, Lingjie, et al.
Published: (2026)
by: Chen, Lingjie, et al.
Published: (2026)
FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs
by: Gao, Zengyi, et al.
Published: (2025)
by: Gao, Zengyi, et al.
Published: (2025)
Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning
by: Li, Ming, et al.
Published: (2024)
by: Li, Ming, et al.
Published: (2024)
Complex Logical Instruction Generation
by: Zhang, Mian, et al.
Published: (2025)
by: Zhang, Mian, et al.
Published: (2025)
SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values
by: Zhang, Yunfan, et al.
Published: (2024)
by: Zhang, Yunfan, et al.
Published: (2024)
Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token-based ASR
by: Chen, Qian, et al.
Published: (2023)
by: Chen, Qian, et al.
Published: (2023)
Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
by: Chen, Qian, et al.
Published: (2024)
by: Chen, Qian, et al.
Published: (2024)
$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models
by: Cai, Yuang, et al.
Published: (2026)
by: Cai, Yuang, et al.
Published: (2026)
In-Context Examples Matter: Improving Emotion Recognition in Conversation with Instruction Tuning
by: Ma, Hui, et al.
Published: (2025)
by: Ma, Hui, et al.
Published: (2025)
CRAM: Centroid-Routing and Adaptive MoE for Multimodal Continual Instruction Tuning
by: Tang, Jun-Tao, et al.
Published: (2026)
by: Tang, Jun-Tao, et al.
Published: (2026)
TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
by: Wu, Bingheng, et al.
Published: (2025)
by: Wu, Bingheng, et al.
Published: (2025)
SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph
by: Lin, Teng, et al.
Published: (2025)
by: Lin, Teng, et al.
Published: (2025)
A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation
by: Zhou, Shijie, et al.
Published: (2024)
by: Zhou, Shijie, et al.
Published: (2024)
SkewRoute: Training-Free LLM Routing for Knowledge Graph Retrieval-Augmented Generation via Score Skewness of Retrieved Context
by: Wang, Hairu, et al.
Published: (2025)
by: Wang, Hairu, et al.
Published: (2025)
Beyond Single-shot Writing: Deep Research Agents are Unreliable at Multi-turn Report Revision
by: Chen, Bingsen, et al.
Published: (2026)
by: Chen, Bingsen, et al.
Published: (2026)
MAQInstruct: Instruction-based Unified Event Relation Extraction
by: Xu, Jun, et al.
Published: (2025)
by: Xu, Jun, et al.
Published: (2025)
Unraveling Text Generation in LLMs: A Stochastic Differential Equation Approach
by: Zhang, Yukun
Published: (2024)
by: Zhang, Yukun
Published: (2024)
Context-Driven Index Trimming: A Data Quality Perspective to Enhancing Precision of RALMs
by: Ma, Kexin, et al.
Published: (2024)
by: Ma, Kexin, et al.
Published: (2024)
Latent-Condensed Transformer for Efficient Long Context Modeling
by: You, Zeng, et al.
Published: (2026)
by: You, Zeng, et al.
Published: (2026)
A Reinforcement Learning-Driven Transformer GAN for Molecular Generation
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference
by: Lai, Xunhao, et al.
Published: (2025)
by: Lai, Xunhao, et al.
Published: (2025)
RAISE: Reinforced Adaptive Instruction Selection For Large Language Models
by: Lv, Qingsong, et al.
Published: (2025)
by: Lv, Qingsong, et al.
Published: (2025)
Submodular Context Partitioning and Compression for In-Context Learning
by: Zheng, Shaoyi, et al.
Published: (2025)
by: Zheng, Shaoyi, et al.
Published: (2025)
To Trust or Not to Trust? Enhancing Large Language Models' Situated Faithfulness to External Contexts
by: Huang, Yukun, et al.
Published: (2024)
by: Huang, Yukun, et al.
Published: (2024)
Privacy-Preserving Instructions for Aligning Large Language Models
by: Yu, Da, et al.
Published: (2024)
by: Yu, Da, et al.
Published: (2024)
SEIF: Self-Evolving Reinforcement Learning for Instruction Following
by: Ren, Qingyu, et al.
Published: (2026)
by: Ren, Qingyu, et al.
Published: (2026)
Grounding Long-Context Reasoning with Contextual Normalization for Retrieval-Augmented Generation
by: Chen, Jiamin, et al.
Published: (2025)
by: Chen, Jiamin, et al.
Published: (2025)
Joint Enhancement of Relational Reasoning for Long-Context LLMs
by: Chen, Zhirui, et al.
Published: (2025)
by: Chen, Zhirui, et al.
Published: (2025)
Slang Context-based Inference Enhancement via Greedy Search-Guided Chain-of-Thought Prompting
by: Cao, Jinghan, et al.
Published: (2026)
by: Cao, Jinghan, et al.
Published: (2026)
On the Loss of Context-awareness in General Instruction Fine-tuning
by: Wang, Yihan, et al.
Published: (2024)
by: Wang, Yihan, et al.
Published: (2024)
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation
by: Li, Zixuan, et al.
Published: (2024)
by: Li, Zixuan, et al.
Published: (2024)
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
by: Li, Haoran, et al.
Published: (2024)
by: Li, Haoran, et al.
Published: (2024)
Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction Tuning
by: Tang, Jun-Tao, et al.
Published: (2026)
by: Tang, Jun-Tao, et al.
Published: (2026)
ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
by: Li, Zhangpu, et al.
Published: (2024)
by: Li, Zhangpu, et al.
Published: (2024)
Similar Items
-
Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF
by: Zheng, Chen, et al.
Published: (2024) -
GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond
by: Zheng, Shen, et al.
Published: (2023) -
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs
by: Zheng, Chen, et al.
Published: (2024) -
L0: Reinforcement Learning to Become General Agents
by: Zhang, Junjie, et al.
Published: (2025) -
WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale
by: Li, Jiaxi, et al.
Published: (2025)