Saved in:
| Main Authors: | Wang, Tao, Zhu, Lipeng, Li, Jiayong, Gao, Feng, Liang, Siwen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.28822 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue
by: Gao, Rena, et al.
Published: (2024)
by: Gao, Rena, et al.
Published: (2024)
InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference
by: Pan, Xiurui, et al.
Published: (2024)
by: Pan, Xiurui, et al.
Published: (2024)
Multimodal Commonsense Knowledge Distillation for Visual Question Answering
by: Yang, Shuo, et al.
Published: (2024)
by: Yang, Shuo, et al.
Published: (2024)
Xuanwu: Evolving General Multimodal Models into an Industrial-Grade Foundation for Content Ecosystems
by: Zhang, Zhiqian, et al.
Published: (2026)
by: Zhang, Zhiqian, et al.
Published: (2026)
LCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasks
by: Wan, Jiayong, et al.
Published: (2026)
by: Wan, Jiayong, et al.
Published: (2026)
On Cost-Effective LLM-as-a-Judge Improvement Techniques
by: Lail, Ryan, et al.
Published: (2026)
by: Lail, Ryan, et al.
Published: (2026)
MAGIC-VQA: Multimodal And Grounded Inference with Commonsense Knowledge for Visual Question Answering
by: Yang, Shuo, et al.
Published: (2025)
by: Yang, Shuo, et al.
Published: (2025)
A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization
by: Chu, Yucheng, et al.
Published: (2024)
by: Chu, Yucheng, et al.
Published: (2024)
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection
by: Cabral, Rina Carines, et al.
Published: (2024)
by: Cabral, Rina Carines, et al.
Published: (2024)
Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models
by: Na, Injae, et al.
Published: (2025)
by: Na, Injae, et al.
Published: (2025)
Enabling Real-Time Conversations with Minimal Training Costs
by: Xu, Wang, et al.
Published: (2024)
by: Xu, Wang, et al.
Published: (2024)
LLM Cache Bandit Revisited: Addressing Query Heterogeneity for Cost-Effective LLM Inference
by: Yang, Hantao, et al.
Published: (2025)
by: Yang, Hantao, et al.
Published: (2025)
FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design
by: Lan, Kai, et al.
Published: (2025)
by: Lan, Kai, et al.
Published: (2025)
PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention
by: Chen, Lida, et al.
Published: (2025)
by: Chen, Lida, et al.
Published: (2025)
Grading Scale Impact on LLM-as-a-Judge: Human-LLM Alignment Is Highest on 0-5 Grading Scale
by: Li, Weiyue, et al.
Published: (2026)
by: Li, Weiyue, et al.
Published: (2026)
Speculative Reward Model Boosts Decision Making Ability of LLMs Cost-Effectively
by: Gu, Jiawei, et al.
Published: (2025)
by: Gu, Jiawei, et al.
Published: (2025)
LLM-based Automated Grading with Human-in-the-Loop
by: Chu, Yucheng, et al.
Published: (2025)
by: Chu, Yucheng, et al.
Published: (2025)
PDF-MVQA: A Dataset for Multimodal Information Retrieval in PDF-based Visual Question Answering
by: Ding, Yihao, et al.
Published: (2024)
by: Ding, Yihao, et al.
Published: (2024)
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
by: Wang, Weiyun, et al.
Published: (2025)
by: Wang, Weiyun, et al.
Published: (2025)
ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference
by: Ouyang, Haojie, et al.
Published: (2025)
by: Ouyang, Haojie, et al.
Published: (2025)
Infant Agent: A Tool-Integrated, Logic-Driven Agent with Cost-Effective API Usage
by: Lei, Bin, et al.
Published: (2024)
by: Lei, Bin, et al.
Published: (2024)
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
by: He, Shwai, et al.
Published: (2024)
by: He, Shwai, et al.
Published: (2024)
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment
by: Zhang, Yi-Fan, et al.
Published: (2025)
by: Zhang, Yi-Fan, et al.
Published: (2025)
ImCoref-CeS: An Improved Lightweight Pipeline for Coreference Resolution with LLM-based Checker-Splitter Refinement
by: Luo, Kangyang, et al.
Published: (2025)
by: Luo, Kangyang, et al.
Published: (2025)
A Framework for Cost-Effective and Self-Adaptive LLM Shaking and Recovery Mechanism
by: Chen, Zhiyu, et al.
Published: (2024)
by: Chen, Zhiyu, et al.
Published: (2024)
TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation
by: Yang, Huanqi, et al.
Published: (2024)
by: Yang, Huanqi, et al.
Published: (2024)
Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
by: Dekoninck, Jasper, et al.
Published: (2024)
by: Dekoninck, Jasper, et al.
Published: (2024)
GradingAttack: Exposing Security Vulnerabilities in LLM Based Educational Grading Agents
by: Li, Xueyi, et al.
Published: (2026)
by: Li, Xueyi, et al.
Published: (2026)
Don't Start Over: A Cost-Effective Framework for Migrating Personalized Prompts Between LLMs
by: Zhao, Ziyi, et al.
Published: (2026)
by: Zhao, Ziyi, et al.
Published: (2026)
Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation
by: Chu, Yucheng, et al.
Published: (2025)
by: Chu, Yucheng, et al.
Published: (2025)
EmbeddingGemma: Powerful and Lightweight Text Representations
by: Vera, Henrique Schechter, et al.
Published: (2025)
by: Vera, Henrique Schechter, et al.
Published: (2025)
A Taxonomy of Prompt Defects in LLM Systems
by: Tian, Haoye, et al.
Published: (2025)
by: Tian, Haoye, et al.
Published: (2025)
Cascaded Self-Evaluation Augmented Training for Lightweight Multimodal LLMs
by: Lv, Zheqi, et al.
Published: (2025)
by: Lv, Zheqi, et al.
Published: (2025)
LLMs are Also Effective Embedding Models: An In-depth Overview
by: Tao, Chongyang, et al.
Published: (2024)
by: Tao, Chongyang, et al.
Published: (2024)
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization
by: She, Shuaijie, et al.
Published: (2025)
by: She, Shuaijie, et al.
Published: (2025)
When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life
by: Lou, Xinyue, et al.
Published: (2026)
by: Lou, Xinyue, et al.
Published: (2026)
LLM-Powered Fully Automated Chaos Engineering: Towards Enabling Anyone to Build Resilient Software Systems at Low Cost
by: Kikuta, Daisuke, et al.
Published: (2025)
by: Kikuta, Daisuke, et al.
Published: (2025)
Forget NLI, Use a Dictionary: Zero-Shot Topic Classification for Low-Resource Languages with Application to Luxembourgish
by: Philippy, Fred, et al.
Published: (2024)
by: Philippy, Fred, et al.
Published: (2024)
Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
by: Wang, Xidong, et al.
Published: (2024)
by: Wang, Xidong, et al.
Published: (2024)
Optimizing In-Context Demonstrations for LLM-based Automated Grading
by: Chu, Yucheng, et al.
Published: (2026)
by: Chu, Yucheng, et al.
Published: (2026)
Similar Items
-
'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue
by: Gao, Rena, et al.
Published: (2024) -
InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference
by: Pan, Xiurui, et al.
Published: (2024) -
Multimodal Commonsense Knowledge Distillation for Visual Question Answering
by: Yang, Shuo, et al.
Published: (2024) -
Xuanwu: Evolving General Multimodal Models into an Industrial-Grade Foundation for Content Ecosystems
by: Zhang, Zhiqian, et al.
Published: (2026) -
LCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasks
by: Wan, Jiayong, et al.
Published: (2026)