Saved in:
| Main Authors: | Tian, Zihang, Li, Rui, Zhang, Jingsen, Bo, Xiaohe, Huo, Wei, Chen, Xu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.05903 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Prompt and Parameter Co-Optimization for Large Language Models
by: Bo, Xiaohe, et al.
Published: (2025)
by: Bo, Xiaohe, et al.
Published: (2025)
CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension
by: Li, Rui, et al.
Published: (2025)
by: Li, Rui, et al.
Published: (2025)
Explainable Recommendation with Simulated Human Feedback
by: Tang, Jiakai, et al.
Published: (2025)
by: Tang, Jiakai, et al.
Published: (2025)
Towards Adaptive, Scalable, and Robust Coordination of LLM Agents: A Dynamic Ad-Hoc Networking Perspective
by: Li, Rui, et al.
Published: (2026)
by: Li, Rui, et al.
Published: (2026)
Learn to Memorize: Optimizing LLM-based Agents with Adaptive Memory Framework
by: Zhang, Zeyu, et al.
Published: (2025)
by: Zhang, Zeyu, et al.
Published: (2025)
MTRouter: Cost-Aware Multi-Turn LLM Routing with History-Model Joint Embeddings
by: Zhang, Yiqun, et al.
Published: (2026)
by: Zhang, Yiqun, et al.
Published: (2026)
CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation
by: Tu, Quan, et al.
Published: (2024)
by: Tu, Quan, et al.
Published: (2024)
Instructing the Architecture Search for Spatial-temporal Sequence Forecasting with LLM
by: Xue, Xin, et al.
Published: (2025)
by: Xue, Xin, et al.
Published: (2025)
A Status Quo Investigation of Large Language Models towards Cost-Effective CFD Automation with OpenFOAMGPT: ChatGPT vs. Qwen vs. Deepseek
by: Wang, Wenkang, et al.
Published: (2025)
by: Wang, Wenkang, et al.
Published: (2025)
Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory
by: Tang, Zihao, et al.
Published: (2026)
by: Tang, Zihao, et al.
Published: (2026)
Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing
by: Lai, Kunfeng, et al.
Published: (2025)
by: Lai, Kunfeng, et al.
Published: (2025)
AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search
by: Li, Yu, et al.
Published: (2025)
by: Li, Yu, et al.
Published: (2025)
Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents
by: Zhou, Heng, et al.
Published: (2026)
by: Zhou, Heng, et al.
Published: (2026)
Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent
by: Feng, Xueyang, et al.
Published: (2025)
by: Feng, Xueyang, et al.
Published: (2025)
LLM Performance Predictors are good initializers for Architecture Search
by: Jawahar, Ganesh, et al.
Published: (2023)
by: Jawahar, Ganesh, et al.
Published: (2023)
Recognizing Limits: Investigating Infeasibility in Large Language Models
by: Zhang, Wenbo, et al.
Published: (2024)
by: Zhang, Wenbo, et al.
Published: (2024)
Enhancing Recommendation Explanations through User-Centric Refinement
by: Zhang, Jingsen, et al.
Published: (2025)
by: Zhang, Jingsen, et al.
Published: (2025)
TweakLLM: A Routing Architecture for Dynamic Tailoring of Cached Responses
by: Cheema, Muhammad Taha, et al.
Published: (2025)
by: Cheema, Muhammad Taha, et al.
Published: (2025)
Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
by: Pan, Zhihong, et al.
Published: (2025)
by: Pan, Zhihong, et al.
Published: (2025)
TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision
by: Zhang, Yunyi, et al.
Published: (2024)
by: Zhang, Yunyi, et al.
Published: (2024)
DiSRouter: Distributed Self-Routing for LLM Selections
by: Zheng, Hang, et al.
Published: (2025)
by: Zheng, Hang, et al.
Published: (2025)
DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters
by: Song, Yixin, et al.
Published: (2024)
by: Song, Yixin, et al.
Published: (2024)
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference
by: Zhang, Libo, et al.
Published: (2024)
by: Zhang, Libo, et al.
Published: (2024)
RouteProfile: Graph-Based Profiling for Cold-Start LLM Routing
by: Xu, Jingjun, et al.
Published: (2026)
by: Xu, Jingjun, et al.
Published: (2026)
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
by: Behera, Adarsh Prasad, et al.
Published: (2025)
by: Behera, Adarsh Prasad, et al.
Published: (2025)
EvoRoute: Experience-Driven Self-Routing LLM Agent Systems
by: Zhang, Guibin, et al.
Published: (2026)
by: Zhang, Guibin, et al.
Published: (2026)
RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment
by: Luo, Yingfeng, et al.
Published: (2026)
by: Luo, Yingfeng, et al.
Published: (2026)
OmniRouter: Budget and Performance Controllable Multi-LLM Routing
by: Mei, Kai, et al.
Published: (2025)
by: Mei, Kai, et al.
Published: (2025)
From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models
by: Liu, Na, et al.
Published: (2024)
by: Liu, Na, et al.
Published: (2024)
Evaluating and Calibrating LLM Confidence on Questions with Multiple Correct Answers
by: Wang, Yuhan, et al.
Published: (2026)
by: Wang, Yuhan, et al.
Published: (2026)
Modeling Uncertainty Trends for Timely Retrieval in Dynamic RAG
by: Li, Bo, et al.
Published: (2025)
by: Li, Bo, et al.
Published: (2025)
ZeroLM: Data-Free Transformer Architecture Search for Language Models
by: Chen, Zhen-Song, et al.
Published: (2025)
by: Chen, Zhen-Song, et al.
Published: (2025)
iFairy: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$
by: Wang, Feiyu, et al.
Published: (2025)
by: Wang, Feiyu, et al.
Published: (2025)
RouteLLM: Learning to Route LLMs with Preference Data
by: Ong, Isaac, et al.
Published: (2024)
by: Ong, Isaac, et al.
Published: (2024)
LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures
by: Huang, Hai, et al.
Published: (2025)
by: Huang, Hai, et al.
Published: (2025)
D2LLM: Decomposed and Distilled Large Language Models for Semantic Search
by: Liao, Zihan, et al.
Published: (2024)
by: Liao, Zihan, et al.
Published: (2024)
Arch-Router: Aligning LLM Routing with Human Preferences
by: Tran, Co, et al.
Published: (2025)
by: Tran, Co, et al.
Published: (2025)
Inner-Probe: Discovering Copyright-related Data Generation in LLM Architecture
by: Ma, Qichao, et al.
Published: (2024)
by: Ma, Qichao, et al.
Published: (2024)
Subgraph Retrieval Enhanced by Graph-Text Alignment for Commonsense Question Answering
by: Peng, Boci, et al.
Published: (2024)
by: Peng, Boci, et al.
Published: (2024)
Similar Items
-
Prompt and Parameter Co-Optimization for Large Language Models
by: Bo, Xiaohe, et al.
Published: (2025) -
CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension
by: Li, Rui, et al.
Published: (2025) -
Explainable Recommendation with Simulated Human Feedback
by: Tang, Jiakai, et al.
Published: (2025) -
Towards Adaptive, Scalable, and Robust Coordination of LLM Agents: A Dynamic Ad-Hoc Networking Perspective
by: Li, Rui, et al.
Published: (2026) -
Learn to Memorize: Optimizing LLM-based Agents with Adaptive Memory Framework
by: Zhang, Zeyu, et al.
Published: (2025)