Saved in:
| Main Authors: | Wan, Zhongwei, Yin, Yichun, Zhang, Wei, Shi, Jiaxin, Shang, Lifeng, Chen, Guangyong, Jiang, Xin, Liu, Qun |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2212.03613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Prompt-Based Length Controlled Generation with Multiple Control Types
by: Jie, Renlong, et al.
Published: (2024)
by: Jie, Renlong, et al.
Published: (2024)
Preparing Lessons for Progressive Training on Language Models
by: Pan, Yu, et al.
Published: (2024)
by: Pan, Yu, et al.
Published: (2024)
Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification
by: Liu, Chengwu, et al.
Published: (2025)
by: Liu, Chengwu, et al.
Published: (2025)
Visually Guided Generative Text-Layout Pre-training for Document Intelligence
by: Mao, Zhiming, et al.
Published: (2024)
by: Mao, Zhiming, et al.
Published: (2024)
Retrieval-based Disentangled Representation Learning with Natural Language Supervision
by: Zhou, Jiawei, et al.
Published: (2022)
by: Zhou, Jiawei, et al.
Published: (2022)
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
by: Kwan, Wai-Chung, et al.
Published: (2023)
by: Kwan, Wai-Chung, et al.
Published: (2023)
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
by: Jiang, Yuxin, et al.
Published: (2023)
by: Jiang, Yuxin, et al.
Published: (2023)
Thinking Augmented Pre-training
by: Wang, Liang, et al.
Published: (2025)
by: Wang, Liang, et al.
Published: (2025)
Memory Grafting: Scaling Language Model Pre-training via Offline Conditional Memory
by: Cheng, Runxi, et al.
Published: (2026)
by: Cheng, Runxi, et al.
Published: (2026)
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models
by: Kwan, Wai-Chung, et al.
Published: (2024)
by: Kwan, Wai-Chung, et al.
Published: (2024)
Data Management For Training Large Language Models: A Survey
by: Wang, Zige, et al.
Published: (2023)
by: Wang, Zige, et al.
Published: (2023)
Switch Attention: Towards Dynamic and Fine-grained Hybrid Transformers
by: Zhao, Yusheng, et al.
Published: (2026)
by: Zhao, Yusheng, et al.
Published: (2026)
Gradually Excavating External Knowledge for Implicit Complex Question Answering
by: Liu, Chang, et al.
Published: (2026)
by: Liu, Chang, et al.
Published: (2026)
ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation
by: Zeng, Xingshan, et al.
Published: (2026)
by: Zeng, Xingshan, et al.
Published: (2026)
CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model
by: Chiang, Shang-Hsuan, et al.
Published: (2024)
by: Chiang, Shang-Hsuan, et al.
Published: (2024)
ToolACE-MT: Non-Autoregressive Generation for Agentic Multi-Turn Interaction
by: Zeng, Xingshan, et al.
Published: (2025)
by: Zeng, Xingshan, et al.
Published: (2025)
Domain Pre-training Impact on Representations
by: Gonzalez-Gutierrez, Cesar, et al.
Published: (2025)
by: Gonzalez-Gutierrez, Cesar, et al.
Published: (2025)
Effectiveness of Pre-training for Few-shot Intent Classification
by: Zhang, Haode, et al.
Published: (2021)
by: Zhang, Haode, et al.
Published: (2021)
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
by: Wang, Xin, et al.
Published: (2024)
by: Wang, Xin, et al.
Published: (2024)
The Harder The Better: Maintaining Supervised Fine-tuning Generalization with Less but Harder Data
by: Shang, Zhaoyang, et al.
Published: (2025)
by: Shang, Zhaoyang, et al.
Published: (2025)
Improving Low-Resource Knowledge Tracing Tasks by Supervised Pre-training and Importance Mechanism Fine-tuning
by: Zhang, Hengyuan, et al.
Published: (2024)
by: Zhang, Hengyuan, et al.
Published: (2024)
PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models
by: Tan, Haochen, et al.
Published: (2024)
by: Tan, Haochen, et al.
Published: (2024)
Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs
by: Wang, Zheng, et al.
Published: (2024)
by: Wang, Zheng, et al.
Published: (2024)
Can Memory-Augmented Language Models Generalize on Reasoning-in-a-Haystack Tasks?
by: Das, Payel, et al.
Published: (2025)
by: Das, Payel, et al.
Published: (2025)
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
by: Li, Jiatao, et al.
Published: (2024)
by: Li, Jiatao, et al.
Published: (2024)
Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks
by: Yamaguchi, Atsuki, et al.
Published: (2026)
by: Yamaguchi, Atsuki, et al.
Published: (2026)
Towards Effective and Efficient Continual Pre-training of Large Language Models
by: Chen, Jie, et al.
Published: (2024)
by: Chen, Jie, et al.
Published: (2024)
Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study
by: Wu, Di, et al.
Published: (2022)
by: Wu, Di, et al.
Published: (2022)
On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation
by: Wu, Di, et al.
Published: (2024)
by: Wu, Di, et al.
Published: (2024)
Enhancing Test-Time Scaling of Large Language Models with Hierarchical Retrieval-Augmented MCTS
by: Dou, Alex ZH, et al.
Published: (2025)
by: Dou, Alex ZH, et al.
Published: (2025)
Efficient Continual Pre-training for Building Domain Specific Large Language Models
by: Xie, Yong, et al.
Published: (2023)
by: Xie, Yong, et al.
Published: (2023)
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
by: Wang, Xin, et al.
Published: (2025)
by: Wang, Xin, et al.
Published: (2025)
Generative Pre-training for Speech with Flow Matching
by: Liu, Alexander H., et al.
Published: (2023)
by: Liu, Alexander H., et al.
Published: (2023)
Pre-training Limited Memory Language Models with Internal and External Knowledge
by: Zhao, Linxi, et al.
Published: (2025)
by: Zhao, Linxi, et al.
Published: (2025)
ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis
by: Wang, Zezhong, et al.
Published: (2024)
by: Wang, Zezhong, et al.
Published: (2024)
Superpixel Semantics Representation and Pre-training for Vision-Language Task
by: Zhang, Siyu, et al.
Published: (2023)
by: Zhang, Siyu, et al.
Published: (2023)
Learning to Align Multi-Faceted Evaluation: A Unified and Robust Framework
by: Xu, Kaishuai, et al.
Published: (2025)
by: Xu, Kaishuai, et al.
Published: (2025)
MaP: A Unified Framework for Reliable Evaluation of Pre-training Dynamics
by: Wang, Jiapeng, et al.
Published: (2025)
by: Wang, Jiapeng, et al.
Published: (2025)
Learning to Edit: Aligning LLMs with Knowledge Editing
by: Jiang, Yuxin, et al.
Published: (2024)
by: Jiang, Yuxin, et al.
Published: (2024)
Similar Items
-
Prompt-Based Length Controlled Generation with Multiple Control Types
by: Jie, Renlong, et al.
Published: (2024) -
Preparing Lessons for Progressive Training on Language Models
by: Pan, Yu, et al.
Published: (2024) -
Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification
by: Liu, Chengwu, et al.
Published: (2025) -
Visually Guided Generative Text-Layout Pre-training for Document Intelligence
by: Mao, Zhiming, et al.
Published: (2024) -
Retrieval-based Disentangled Representation Learning with Natural Language Supervision
by: Zhou, Jiawei, et al.
Published: (2022)