Saved in:
| Main Authors: | Lyu, Zhiheng, Yang, Kevin, Kong, Lingpeng, Klein, Daniel |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.16347 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Generating Long-form Story Using Dynamic Hierarchical Outlining with Memory-Enhancement
by: Wang, Qianyue, et al.
Published: (2024)
by: Wang, Qianyue, et al.
Published: (2024)
PixelWorld: How Far Are We from Perceiving Everything as Pixels?
by: Lyu, Zhiheng, et al.
Published: (2025)
by: Lyu, Zhiheng, et al.
Published: (2025)
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions
by: Wu, Zirui, et al.
Published: (2025)
by: Wu, Zirui, et al.
Published: (2025)
Reasoning Does Not Necessarily Improve Role-Playing Ability
by: Feng, Xiachong, et al.
Published: (2025)
by: Feng, Xiachong, et al.
Published: (2025)
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling
by: Prange, Jakob, et al.
Published: (2021)
by: Prange, Jakob, et al.
Published: (2021)
Self-Infilling Code Generation
by: Zheng, Lin, et al.
Published: (2023)
by: Zheng, Lin, et al.
Published: (2023)
Exploring the Reliability of Large Language Models as Customized Evaluators for Diverse NLP Tasks
by: Li, Qintong, et al.
Published: (2023)
by: Li, Qintong, et al.
Published: (2023)
Scaling Reasoning without Attention
by: Zhao, Xueliang, et al.
Published: (2025)
by: Zhao, Xueliang, et al.
Published: (2025)
A Reparameterized Discrete Diffusion Model for Text Generation
by: Zheng, Lin, et al.
Published: (2023)
by: Zheng, Lin, et al.
Published: (2023)
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
by: Li, Qintong, et al.
Published: (2024)
by: Li, Qintong, et al.
Published: (2024)
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models
by: Zhao, Xueliang, et al.
Published: (2025)
by: Zhao, Xueliang, et al.
Published: (2025)
Advancing Precise Outline-Conditioned Text Generation with Task Duality and Explicit Outline Control
by: Li, Yunzhe, et al.
Published: (2023)
by: Li, Yunzhe, et al.
Published: (2023)
Non-myopic Generation of Language Models for Reasoning and Planning
by: Ma, Chang, et al.
Published: (2024)
by: Ma, Chang, et al.
Published: (2024)
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning
by: Zhao, Xueliang, et al.
Published: (2025)
by: Zhao, Xueliang, et al.
Published: (2025)
DynaAct: Large Language Model Reasoning with Dynamic Action Spaces
by: Zhao, Xueliang, et al.
Published: (2025)
by: Zhao, Xueliang, et al.
Published: (2025)
Proxy Compression for Language Modeling
by: Zheng, Lin, et al.
Published: (2026)
by: Zheng, Lin, et al.
Published: (2026)
DoPE: Denoising Rotary Position Embedding
by: Xiong, Jing, et al.
Published: (2025)
by: Xiong, Jing, et al.
Published: (2025)
World Models for Math Story Problems
by: Opedal, Andreas, et al.
Published: (2023)
by: Opedal, Andreas, et al.
Published: (2023)
StoryLens: Preference-Aligned Story Rewriting via Context-Aware Narrative Enrichment
by: Cui, Hanwen, et al.
Published: (2026)
by: Cui, Hanwen, et al.
Published: (2026)
Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting
by: Jiang, Jiyue, et al.
Published: (2024)
by: Jiang, Jiyue, et al.
Published: (2024)
SciRAG: Adaptive, Citation-Aware, and Outline-Guided Retrieval and Synthesis for Scientific Literature
by: Ding, Hang, et al.
Published: (2025)
by: Ding, Hang, et al.
Published: (2025)
DLM-Scope: Mechanistic Interpretability of Diffusion Language Models via Sparse Autoencoders
by: Wang, Xu, et al.
Published: (2026)
by: Wang, Xu, et al.
Published: (2026)
Training-Free Long-Context Scaling of Large Language Models
by: An, Chenxin, et al.
Published: (2024)
by: An, Chenxin, et al.
Published: (2024)
Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking
by: Ju, Feng, et al.
Published: (2025)
by: Ju, Feng, et al.
Published: (2025)
Topic Detection and Tracking with Time-Aware Document Embeddings
by: Jiang, Hang, et al.
Published: (2021)
by: Jiang, Hang, et al.
Published: (2021)
Voices of Her: Analyzing Gender Differences in the AI Publication World
by: Ding, Yiwen, et al.
Published: (2023)
by: Ding, Yiwen, et al.
Published: (2023)
LoRA Meets Dropout under a Unified Framework
by: Wang, Sheng, et al.
Published: (2024)
by: Wang, Sheng, et al.
Published: (2024)
Jailbreaking as a Reward Misspecification Problem
by: Xie, Zhihui, et al.
Published: (2024)
by: Xie, Zhihui, et al.
Published: (2024)
The Granularity Axis: A Micro-to-Macro Latent Direction for Social Roles in Language Models
by: Qin, Chonghan, et al.
Published: (2026)
by: Qin, Chonghan, et al.
Published: (2026)
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
by: Sun, Qiushi, et al.
Published: (2023)
by: Sun, Qiushi, et al.
Published: (2023)
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
by: Gong, Shansan, et al.
Published: (2025)
by: Gong, Shansan, et al.
Published: (2025)
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation
by: Ran, Yiting, et al.
Published: (2025)
by: Ran, Yiting, et al.
Published: (2025)
Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models
by: Chen, Kecheng, et al.
Published: (2026)
by: Chen, Kecheng, et al.
Published: (2026)
UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
by: Xiong, Jing, et al.
Published: (2024)
by: Xiong, Jing, et al.
Published: (2024)
StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning
by: Chen, Jiaju, et al.
Published: (2023)
by: Chen, Jiaju, et al.
Published: (2023)
Why Does the Effective Context Length of LLMs Fall Short?
by: An, Chenxin, et al.
Published: (2024)
by: An, Chenxin, et al.
Published: (2024)
SubgoalXL: Subgoal-based Expert Learning for Theorem Proving
by: Zhao, Xueliang, et al.
Published: (2024)
by: Zhao, Xueliang, et al.
Published: (2024)
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
by: Li, Lei, et al.
Published: (2024)
by: Li, Lei, et al.
Published: (2024)
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation
by: Xiong, Jing, et al.
Published: (2025)
by: Xiong, Jing, et al.
Published: (2025)
Teaching Language Models to Critique via Reinforcement Learning
by: Xie, Zhihui, et al.
Published: (2025)
by: Xie, Zhihui, et al.
Published: (2025)
Similar Items
-
Generating Long-form Story Using Dynamic Hierarchical Outlining with Memory-Enhancement
by: Wang, Qianyue, et al.
Published: (2024) -
PixelWorld: How Far Are We from Perceiving Everything as Pixels?
by: Lyu, Zhiheng, et al.
Published: (2025) -
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions
by: Wu, Zirui, et al.
Published: (2025) -
Reasoning Does Not Necessarily Improve Role-Playing Ability
by: Feng, Xiachong, et al.
Published: (2025) -
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling
by: Prange, Jakob, et al.
Published: (2021)