Saved in:
| Main Authors: | Miles, Roy, Toker, Aysim, Oncescu, Andreea-Maria, Xu, Songcen, Deng, Jiankang, Elezi, Ismail |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.22871 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing
by: Toker, Aysim, et al.
Published: (2025)
by: Toker, Aysim, et al.
Published: (2025)
$V_kD:$ Improving Knowledge Distillation using Orthogonal Projections
by: Miles, Roy, et al.
Published: (2024)
by: Miles, Roy, et al.
Published: (2024)
VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections
by: Miles, Roy, et al.
Published: (2024)
by: Miles, Roy, et al.
Published: (2024)
A Benchmark for Deep Information Synthesis
by: Paul, Debjit, et al.
Published: (2026)
by: Paul, Debjit, et al.
Published: (2026)
From Attention to Activation: Unravelling the Enigmas of Large Language Models
by: Kaul, Prannay, et al.
Published: (2024)
by: Kaul, Prannay, et al.
Published: (2024)
Top 10 Open Challenges Steering the Future of Diffusion Language Model and Its Variants
by: Wang, Yunhe, et al.
Published: (2026)
by: Wang, Yunhe, et al.
Published: (2026)
Deep Active Learning: A Reality Check
by: Gashi, Edrina, et al.
Published: (2024)
by: Gashi, Edrina, et al.
Published: (2024)
CASteer: Cross-Attention Steering for Controllable Concept Erasure
by: Gaintseva, Tatiana, et al.
Published: (2025)
by: Gaintseva, Tatiana, et al.
Published: (2025)
RetouchLLM: Training-free Code-based Image Retouching with Vision Language Models
by: Ye-Bin, Moon, et al.
Published: (2025)
by: Ye-Bin, Moon, et al.
Published: (2025)
G3DR: Generative 3D Reconstruction in ImageNet
by: Reddy, Pradyumna, et al.
Published: (2024)
by: Reddy, Pradyumna, et al.
Published: (2024)
Logical Reasoning with Outcome Reward Models for Test-Time Scaling
by: Thatikonda, Ramya Keerthy, et al.
Published: (2025)
by: Thatikonda, Ramya Keerthy, et al.
Published: (2025)
Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering
by: Choi, Yura, et al.
Published: (2026)
by: Choi, Yura, et al.
Published: (2026)
MidSteer: Optimal Affine Framework for Steering Generative Models
by: Gaintseva, Tatiana, et al.
Published: (2026)
by: Gaintseva, Tatiana, et al.
Published: (2026)
A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models
by: Han, Jinyi, et al.
Published: (2025)
by: Han, Jinyi, et al.
Published: (2025)
Adaptive Test-Time Reasoning via Reward-Guided Dual-Phase Search
by: Cui, Yingqian, et al.
Published: (2025)
by: Cui, Yingqian, et al.
Published: (2025)
Entropy Centroids as Intrinsic Rewards for Test-Time Scaling
by: Zhao, Wenshuo, et al.
Published: (2026)
by: Zhao, Wenshuo, et al.
Published: (2026)
SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation
by: Toker, Aysim, et al.
Published: (2024)
by: Toker, Aysim, et al.
Published: (2024)
Inference-Time Scaling for Generalist Reward Modeling
by: Liu, Zijun, et al.
Published: (2025)
by: Liu, Zijun, et al.
Published: (2025)
DreamCAD: Scaling Multi-modal CAD Generation using Differentiable Parametric Surfaces
by: Khan, Mohammad Sadil, et al.
Published: (2026)
by: Khan, Mohammad Sadil, et al.
Published: (2026)
Duel-Evolve: Reward-Free Test-Time Scaling via LLM Self-Preferences
by: Karlekar, Sweta, et al.
Published: (2026)
by: Karlekar, Sweta, et al.
Published: (2026)
RTTC: Reward-Guided Collaborative Test-Time Compute
by: Muñoz, J. Pablo, et al.
Published: (2025)
by: Muñoz, J. Pablo, et al.
Published: (2025)
"Principal Components" Enable A New Language of Images
by: Wen, Xin, et al.
Published: (2025)
by: Wen, Xin, et al.
Published: (2025)
Leveraging Large Language Models for Rare Disease Named Entity Recognition
by: Xi, Nan Miles, et al.
Published: (2025)
by: Xi, Nan Miles, et al.
Published: (2025)
RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance
by: Chen, Tianlang, et al.
Published: (2025)
by: Chen, Tianlang, et al.
Published: (2025)
Bridging the Reasoning Gap in Vietnamese with Small Language Models via Test-Time Scaling
by: Trung, Bui The, et al.
Published: (2026)
by: Trung, Bui The, et al.
Published: (2026)
Three Heads Are Better Than One: Complementary Experts for Long-Tailed Semi-supervised Learning
by: Ma, Chengcheng, et al.
Published: (2023)
by: Ma, Chengcheng, et al.
Published: (2023)
UniScale: Adaptive Unified Inference Scaling via Online Joint Optimization of Model Routing and Test-Time Scaling
by: Huang, Kaiyu, et al.
Published: (2026)
by: Huang, Kaiyu, et al.
Published: (2026)
Teaching Models to Verbalize Reward Hacking in Chain-of-Thought Reasoning
by: Turpin, Miles, et al.
Published: (2025)
by: Turpin, Miles, et al.
Published: (2025)
R-Stitch: Dynamic Trajectory Stitching for Efficient Reasoning
by: Chen, Zhuokun, et al.
Published: (2025)
by: Chen, Zhuokun, et al.
Published: (2025)
DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models
by: Ventura, Mor, et al.
Published: (2025)
by: Ventura, Mor, et al.
Published: (2025)
Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence
by: Ghasemabadi, Amirhosein, et al.
Published: (2025)
by: Ghasemabadi, Amirhosein, et al.
Published: (2025)
BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents
by: Ou, Litu, et al.
Published: (2025)
by: Ou, Litu, et al.
Published: (2025)
Self-Rewarding Language Models
by: Yuan, Weizhe, et al.
Published: (2024)
by: Yuan, Weizhe, et al.
Published: (2024)
Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models
by: Tian, Runchu, et al.
Published: (2025)
by: Tian, Runchu, et al.
Published: (2025)
GradeSQL: Test-Time Inference with Outcome Reward Models for Text-to-SQL Generation from Large Language Models
by: Tritto, Mattia, et al.
Published: (2025)
by: Tritto, Mattia, et al.
Published: (2025)
Provable Scaling Laws for the Test-Time Compute of Large Language Models
by: Chen, Yanxi, et al.
Published: (2024)
by: Chen, Yanxi, et al.
Published: (2024)
Value-Aware Numerical Representations for Transformer Language Models
by: Dutulescu, Andreea, et al.
Published: (2026)
by: Dutulescu, Andreea, et al.
Published: (2026)
ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
by: Xie, Yin, et al.
Published: (2024)
by: Xie, Yin, et al.
Published: (2024)
Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing
by: Cai, Weitong, et al.
Published: (2026)
by: Cai, Weitong, et al.
Published: (2026)
Entropy Aware Reward Guidance for Diffusion Language Model Alignment
by: Tejaswi, Atula, et al.
Published: (2026)
by: Tejaswi, Atula, et al.
Published: (2026)
Similar Items
-
SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing
by: Toker, Aysim, et al.
Published: (2025) -
$V_kD:$ Improving Knowledge Distillation using Orthogonal Projections
by: Miles, Roy, et al.
Published: (2024) -
VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections
by: Miles, Roy, et al.
Published: (2024) -
A Benchmark for Deep Information Synthesis
by: Paul, Debjit, et al.
Published: (2026) -
From Attention to Activation: Unravelling the Enigmas of Large Language Models
by: Kaul, Prannay, et al.
Published: (2024)