Saved in:
| Main Authors: | Zhu, Dawei, Wu, Wenhao, Song, Yifan, Zhu, Fangwei, Cao, Ziqiang, Li, Sujian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.00681 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
by: Zhu, Dawei, et al.
Published: (2023)
by: Zhu, Dawei, et al.
Published: (2023)
LongEmbed: Extending Embedding Models for Long Context Retrieval
by: Zhu, Dawei, et al.
Published: (2024)
by: Zhu, Dawei, et al.
Published: (2024)
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
by: Song, Yifan, et al.
Published: (2024)
by: Song, Yifan, et al.
Published: (2024)
Long Context Alignment with Short Instructions and Synthesized Positions
by: Wu, Wenhao, et al.
Published: (2024)
by: Wu, Wenhao, et al.
Published: (2024)
More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
by: Zhang, Jiebin, et al.
Published: (2024)
by: Zhang, Jiebin, et al.
Published: (2024)
DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding
by: Zhu, Dawei, et al.
Published: (2025)
by: Zhu, Dawei, et al.
Published: (2025)
CoLT: Reasoning with Chain of Latent Tool Calls
by: Zhu, Fangwei, et al.
Published: (2026)
by: Zhu, Fangwei, et al.
Published: (2026)
LongAttn: Selecting Long-context Training Data via Token-level Attention
by: Wu, Longyun, et al.
Published: (2025)
by: Wu, Longyun, et al.
Published: (2025)
EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
by: Li, Zheng, et al.
Published: (2024)
by: Li, Zheng, et al.
Published: (2024)
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
by: Song, Yifan, et al.
Published: (2024)
by: Song, Yifan, et al.
Published: (2024)
PaperBanana: Automating Academic Illustration for AI Scientists
by: Zhu, Dawei, et al.
Published: (2026)
by: Zhu, Dawei, et al.
Published: (2026)
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
by: Zhu, Dawei, et al.
Published: (2025)
by: Zhu, Dawei, et al.
Published: (2025)
Hierarchical Memory Organization for Wikipedia Generation
by: Yu, Eugene J., et al.
Published: (2025)
by: Yu, Eugene J., et al.
Published: (2025)
DFlare: Scaling Up Draft Capacity for Block Diffusion Speculative Decoding
by: Zhang, Jiebin, et al.
Published: (2026)
by: Zhang, Jiebin, et al.
Published: (2026)
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
by: Xiong, Weimin, et al.
Published: (2024)
by: Xiong, Weimin, et al.
Published: (2024)
Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning
by: Zhang, Jiebin, et al.
Published: (2026)
by: Zhang, Jiebin, et al.
Published: (2026)
UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation
by: Gao, Jun, et al.
Published: (2024)
by: Gao, Jun, et al.
Published: (2024)
Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling
by: Ran, Junfeng, et al.
Published: (2025)
by: Ran, Junfeng, et al.
Published: (2025)
Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
by: Zhu, Fangwei, et al.
Published: (2024)
by: Zhu, Fangwei, et al.
Published: (2024)
Language Models Encode the Value of Numbers Linearly
by: Zhu, Fangwei, et al.
Published: (2024)
by: Zhu, Fangwei, et al.
Published: (2024)
NeuReasoner: Towards Explainable, Controllable, and Unified Reasoning via Mixture-of-Neurons
by: Dong, Haonan, et al.
Published: (2026)
by: Dong, Haonan, et al.
Published: (2026)
RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection
by: Yang, Yixin, et al.
Published: (2025)
by: Yang, Yixin, et al.
Published: (2025)
CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games
by: Xu, Shuhang, et al.
Published: (2025)
by: Xu, Shuhang, et al.
Published: (2025)
Chain-of-Thought Tokens are Computer Program Variables
by: Zhu, Fangwei, et al.
Published: (2025)
by: Zhu, Fangwei, et al.
Published: (2025)
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
by: Xin, Amy, et al.
Published: (2024)
by: Xin, Amy, et al.
Published: (2024)
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario
by: Zhang, Jiebin, et al.
Published: (2024)
by: Zhang, Jiebin, et al.
Published: (2024)
MPO: Boosting LLM Agents with Meta Plan Optimization
by: Xiong, Weimin, et al.
Published: (2025)
by: Xiong, Weimin, et al.
Published: (2025)
More Vulnerable than You Think: On the Stability of Tool-Integrated LLM Agents
by: Xiong, Weimin, et al.
Published: (2025)
by: Xiong, Weimin, et al.
Published: (2025)
SelfCP: Compressing Over-Limit Prompt via the Frozen Large Language Model Itself
by: Gao, Jun, et al.
Published: (2024)
by: Gao, Jun, et al.
Published: (2024)
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
by: Song, Yifan, et al.
Published: (2024)
by: Song, Yifan, et al.
Published: (2024)
Discourse Coherence and Response-Guided Context Rewriting for Multi-Party Dialogue Generation
by: Cao, Zhiyu, et al.
Published: (2026)
by: Cao, Zhiyu, et al.
Published: (2026)
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
by: Zhu, Wenhao, et al.
Published: (2025)
by: Zhu, Wenhao, et al.
Published: (2025)
KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization
by: Song, Mingbo, et al.
Published: (2025)
by: Song, Mingbo, et al.
Published: (2025)
Coherent Multimodal Reasoning with Iterative Self-Evaluation for Vision-Language Models
by: Luo, Wenjie, et al.
Published: (2025)
by: Luo, Wenjie, et al.
Published: (2025)
FinRAGBench-V: A Benchmark for Multimodal RAG with Visual Citation in the Financial Domain
by: Zhao, Suifeng, et al.
Published: (2025)
by: Zhao, Suifeng, et al.
Published: (2025)
Unified Active Retrieval for Retrieval Augmented Generation
by: Cheng, Qinyuan, et al.
Published: (2024)
by: Cheng, Qinyuan, et al.
Published: (2024)
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
by: Zhu, Junda, et al.
Published: (2024)
by: Zhu, Junda, et al.
Published: (2024)
Reformulation for Pretraining Data Augmentation
by: Hao, Xintong, et al.
Published: (2025)
by: Hao, Xintong, et al.
Published: (2025)
Improving Grammatical Error Correction via Contextual Data Augmentation
by: Wang, Yixuan, et al.
Published: (2024)
by: Wang, Yixuan, et al.
Published: (2024)
Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models
by: Domhan, Tobias, et al.
Published: (2025)
by: Domhan, Tobias, et al.
Published: (2025)
Similar Items
-
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
by: Zhu, Dawei, et al.
Published: (2023) -
LongEmbed: Extending Embedding Models for Long Context Retrieval
by: Zhu, Dawei, et al.
Published: (2024) -
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
by: Song, Yifan, et al.
Published: (2024) -
Long Context Alignment with Short Instructions and Synthesized Positions
by: Wu, Wenhao, et al.
Published: (2024) -
More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
by: Zhang, Jiebin, et al.
Published: (2024)