:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhu, Dawei, Wu, Wenhao, Song, Yifan, Zhu, Fangwei, Cao, Ziqiang, Li, Sujian
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2404.00681
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
by: Zhu, Dawei, et al.
Published: (2023)

LongEmbed: Extending Embedding Models for Long Context Retrieval
by: Zhu, Dawei, et al.
Published: (2024)

AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
by: Song, Yifan, et al.
Published: (2024)

Long Context Alignment with Short Instructions and Synthesized Positions
by: Wu, Wenhao, et al.
Published: (2024)

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
by: Zhang, Jiebin, et al.
Published: (2024)

DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding
by: Zhu, Dawei, et al.
Published: (2025)

CoLT: Reasoning with Chain of Latent Tool Calls
by: Zhu, Fangwei, et al.
Published: (2026)

LongAttn: Selecting Long-context Training Data via Token-level Attention
by: Wu, Longyun, et al.
Published: (2025)

EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
by: Li, Zheng, et al.
Published: (2024)

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
by: Song, Yifan, et al.
Published: (2024)

PaperBanana: Automating Academic Illustration for AI Scientists
by: Zhu, Dawei, et al.
Published: (2026)

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
by: Zhu, Dawei, et al.
Published: (2025)

Hierarchical Memory Organization for Wikipedia Generation
by: Yu, Eugene J., et al.
Published: (2025)

DFlare: Scaling Up Draft Capacity for Block Diffusion Speculative Decoding
by: Zhang, Jiebin, et al.
Published: (2026)

Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
by: Xiong, Weimin, et al.
Published: (2024)

Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning
by: Zhang, Jiebin, et al.
Published: (2026)

UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation
by: Gao, Jun, et al.
Published: (2024)

Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling
by: Ran, Junfeng, et al.
Published: (2025)

Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
by: Zhu, Fangwei, et al.
Published: (2024)

Language Models Encode the Value of Numbers Linearly
by: Zhu, Fangwei, et al.
Published: (2024)

NeuReasoner: Towards Explainable, Controllable, and Unified Reasoning via Mixture-of-Neurons
by: Dong, Haonan, et al.
Published: (2026)

RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection
by: Yang, Yixin, et al.
Published: (2025)

CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games
by: Xu, Shuhang, et al.
Published: (2025)

Chain-of-Thought Tokens are Computer Program Variables
by: Zhu, Fangwei, et al.
Published: (2025)

LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
by: Xin, Amy, et al.
Published: (2024)

WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario
by: Zhang, Jiebin, et al.
Published: (2024)

MPO: Boosting LLM Agents with Meta Plan Optimization
by: Xiong, Weimin, et al.
Published: (2025)

More Vulnerable than You Think: On the Stability of Tool-Integrated LLM Agents
by: Xiong, Weimin, et al.
Published: (2025)

SelfCP: Compressing Over-Limit Prompt via the Frozen Large Language Model Itself
by: Gao, Jun, et al.
Published: (2024)

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
by: Song, Yifan, et al.
Published: (2024)

Discourse Coherence and Response-Guided Context Rewriting for Multi-Party Dialogue Generation
by: Cao, Zhiyu, et al.
Published: (2026)

EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
by: Zhu, Wenhao, et al.
Published: (2025)

KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization
by: Song, Mingbo, et al.
Published: (2025)

Coherent Multimodal Reasoning with Iterative Self-Evaluation for Vision-Language Models
by: Luo, Wenjie, et al.
Published: (2025)

FinRAGBench-V: A Benchmark for Multimodal RAG with Visual Citation in the Financial Domain
by: Zhao, Suifeng, et al.
Published: (2025)

Unified Active Retrieval for Retrieval Augmented Generation
by: Cheng, Qinyuan, et al.
Published: (2024)

ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
by: Zhu, Junda, et al.
Published: (2024)

Reformulation for Pretraining Data Augmentation
by: Hao, Xintong, et al.
Published: (2025)

Improving Grammatical Error Correction via Contextual Data Augmentation
by: Wang, Yixuan, et al.
Published: (2024)

Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models
by: Domhan, Tobias, et al.
Published: (2025)