Saved in:
| Main Authors: | Tian, Jiayi, Su, Yupeng, Solgi, Ryan, Kundu, Souvik, Zhang, Zheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.16694 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Activation-Informed Pareto-Guided Low-Rank Compression for Efficient LLM/VLM
by: Solgi, Ryan, et al.
Published: (2025)
by: Solgi, Ryan, et al.
Published: (2025)
AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
by: Liu, Zeyu, et al.
Published: (2024)
by: Liu, Zeyu, et al.
Published: (2024)
SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models
by: Tian, Jiayi, et al.
Published: (2025)
by: Tian, Jiayi, et al.
Published: (2025)
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
by: Yang, Zi, et al.
Published: (2024)
by: Yang, Zi, et al.
Published: (2024)
Let the Agent Steer: Closed-Loop Ranking Optimization via Influence Exchange
by: Cheng, Yin, et al.
Published: (2026)
by: Cheng, Yin, et al.
Published: (2026)
RGAlign-Rec: Ranking-Guided Alignment for Latent Query Reasoning in Recommendation Systems
by: Liu, Junhua, et al.
Published: (2026)
by: Liu, Junhua, et al.
Published: (2026)
OG-Rank: Learning to Rank Fast and Slow with Uncertainty and Reward-Trend Guided Adaptive Exploration
by: Singh, Praphul, et al.
Published: (2025)
by: Singh, Praphul, et al.
Published: (2025)
LGR: LLM-Guided Ranking of Frontiers for Object Goal Navigation
by: Uno, Mitsuaki, et al.
Published: (2025)
by: Uno, Mitsuaki, et al.
Published: (2025)
Joint Tensor-Train Parameterization for Efficient and Expressive Low-Rank Adaptation
by: Qi, Jun, et al.
Published: (2025)
by: Qi, Jun, et al.
Published: (2025)
Scientific Paper Retrieval with LLM-Guided Semantic-Based Ranking
by: Zhang, Yunyi, et al.
Published: (2025)
by: Zhang, Yunyi, et al.
Published: (2025)
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing
by: Zheng, Wenhao, et al.
Published: (2025)
by: Zheng, Wenhao, et al.
Published: (2025)
Policy-Guided Stepwise Model Routing for Cost-Effective Reasoning
by: Si, Wenwen, et al.
Published: (2026)
by: Si, Wenwen, et al.
Published: (2026)
Do Tensorized Large-Scale Spatiotemporal Dynamic Atmospheric Data Exhibit Low-Rank Properties?
by: Solgi, Ryan, et al.
Published: (2025)
by: Solgi, Ryan, et al.
Published: (2025)
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
by: Chen, Runjin, et al.
Published: (2025)
by: Chen, Runjin, et al.
Published: (2025)
OGLS-SD: On-Policy Self-Distillation with Outcome-Guided Logit Steering for LLM Reasoning
by: Yang, Yuxiao, et al.
Published: (2026)
by: Yang, Yuxiao, et al.
Published: (2026)
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability
by: Liu, Wenhan, et al.
Published: (2025)
by: Liu, Wenhan, et al.
Published: (2025)
Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection
by: Iacovides, Giorgos, et al.
Published: (2024)
by: Iacovides, Giorgos, et al.
Published: (2024)
Low-Rank Tensor Decompositions for the Theory of Neural Networks
by: Borsoi, Ricardo, et al.
Published: (2025)
by: Borsoi, Ricardo, et al.
Published: (2025)
Low Tensor-Rank Adaptation of Kolmogorov--Arnold Networks
by: Gao, Yihang, et al.
Published: (2025)
by: Gao, Yihang, et al.
Published: (2025)
Linearizing Models for Efficient yet Robust Private Inference
by: Sarkar, Sreetama, et al.
Published: (2024)
by: Sarkar, Sreetama, et al.
Published: (2024)
P-Guide: Parameter-Efficient Prior Steering for Single-Pass CFG Inference
by: Peng, Xin, et al.
Published: (2026)
by: Peng, Xin, et al.
Published: (2026)
Comprehensive Design Space Exploration for Tensorized Neural Network Hardware Accelerators
by: Zhang, Jinsong, et al.
Published: (2025)
by: Zhang, Jinsong, et al.
Published: (2025)
When Is Rank-1 Steering Cheap? Geometry, Granularity, and Budgeted Search
by: Robertson, John T., et al.
Published: (2026)
by: Robertson, John T., et al.
Published: (2026)
Less is More: Resource-Efficient Low-Rank Adaptation
by: Tian, Chunlin, et al.
Published: (2025)
by: Tian, Chunlin, et al.
Published: (2025)
Low-Rank Robust Subspace Tensor Clustering for Metro Passenger Flow Modeling
by: Hu, Jiuyun, et al.
Published: (2024)
by: Hu, Jiuyun, et al.
Published: (2024)
LoTR: Low Tensor Rank Weight Adaptation
by: Bershatsky, Daniel, et al.
Published: (2024)
by: Bershatsky, Daniel, et al.
Published: (2024)
LLM Evaluation as Tensor Completion: Low Rank Structure and Semiparametric Efficiency
by: Li, Jiachun, et al.
Published: (2026)
by: Li, Jiachun, et al.
Published: (2026)
Tensor-Compressed and Fully-Quantized Training of Neural PDE Solvers
by: Lu, Jinming, et al.
Published: (2025)
by: Lu, Jinming, et al.
Published: (2025)
LoRTA: Low Rank Tensor Adaptation of Large Language Models
by: Hounie, Ignacio, et al.
Published: (2024)
by: Hounie, Ignacio, et al.
Published: (2024)
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning
by: Rozada, Sergio, et al.
Published: (2022)
by: Rozada, Sergio, et al.
Published: (2022)
PiCSAR: Probabilistic Confidence Selection And Ranking for Reasoning Chains
by: Leang, Joshua Ong Jun, et al.
Published: (2025)
by: Leang, Joshua Ong Jun, et al.
Published: (2025)
TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training
by: Zhang, Ruijie, et al.
Published: (2026)
by: Zhang, Ruijie, et al.
Published: (2026)
CTR-LoRA: Curvature-Aware and Trust-Region Guided Low-Rank Adaptation for Large Language Models
by: Wang, Zhuxuanzi, et al.
Published: (2025)
by: Wang, Zhuxuanzi, et al.
Published: (2025)
Potential Outcome Rankings for Counterfactual Decision Making
by: Kawakami, Yuta, et al.
Published: (2025)
by: Kawakami, Yuta, et al.
Published: (2025)
RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
by: Zhang, Ziqian, et al.
Published: (2026)
by: Zhang, Ziqian, et al.
Published: (2026)
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
by: Zhang, Jiajun, et al.
Published: (2025)
by: Zhang, Jiajun, et al.
Published: (2025)
MetaLoRA: Tensor-Enhanced Adaptive Low-Rank Fine-tuning
by: Wang, Maolin, et al.
Published: (2025)
by: Wang, Maolin, et al.
Published: (2025)
Searching Meta Reasoning Skeleton to Guide LLM Reasoning
by: Zhang, Ziying, et al.
Published: (2025)
by: Zhang, Ziying, et al.
Published: (2025)
TVR-Ranking: A Dataset for Ranked Video Moment Retrieval with Imprecise Queries
by: Liang, Renjie, et al.
Published: (2024)
by: Liang, Renjie, et al.
Published: (2024)
LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
by: Yang, Yifan, et al.
Published: (2024)
by: Yang, Yifan, et al.
Published: (2024)
Similar Items
-
Activation-Informed Pareto-Guided Low-Rank Compression for Efficient LLM/VLM
by: Solgi, Ryan, et al.
Published: (2025) -
AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
by: Liu, Zeyu, et al.
Published: (2024) -
SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models
by: Tian, Jiayi, et al.
Published: (2025) -
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
by: Yang, Zi, et al.
Published: (2024) -
Let the Agent Steer: Closed-Loop Ranking Optimization via Influence Exchange
by: Cheng, Yin, et al.
Published: (2026)