Saved in:
| Main Authors: | Chen, Hao Mark, Lu, Guanxi, Okoshi, Yasuyuki, Mo, Zhiwen, Motomura, Masato, Fan, Hongxiang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11730 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Context Memorization for Efficient Long Context Generation
by: Okoshi, Yasuyuki, et al.
Published: (2026)
by: Okoshi, Yasuyuki, et al.
Published: (2026)
AQPIM: Breaking the PIM Capacity Wall for LLMs with In-Memory Activation Quantization
by: Matsushima, Kosuke, et al.
Published: (2026)
by: Matsushima, Kosuke, et al.
Published: (2026)
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms
by: Otsuka, Hikari, et al.
Published: (2025)
by: Otsuka, Hikari, et al.
Published: (2025)
FastTTS: Accelerating Test-Time Scaling for Edge LLM Reasoning
by: Chen, Hao Mark, et al.
Published: (2025)
by: Chen, Hao Mark, et al.
Published: (2025)
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
by: Kuroki, Kyo, et al.
Published: (2025)
by: Kuroki, Kyo, et al.
Published: (2025)
AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
by: Lu, Guanxi, et al.
Published: (2025)
by: Lu, Guanxi, et al.
Published: (2025)
Partially Frozen Random Networks Contain Compact Strong Lottery Tickets
by: Otsuka, Hikari, et al.
Published: (2024)
by: Otsuka, Hikari, et al.
Published: (2024)
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
by: Chen, Hao Mark, et al.
Published: (2025)
by: Chen, Hao Mark, et al.
Published: (2025)
Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
by: Chen, Feng, et al.
Published: (2025)
by: Chen, Feng, et al.
Published: (2025)
Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges
by: Lu, Guanxi, et al.
Published: (2025)
by: Lu, Guanxi, et al.
Published: (2025)
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
by: Wang, Xinglin, et al.
Published: (2025)
by: Wang, Xinglin, et al.
Published: (2025)
Advancing AI-assisted Hardware Design with Hierarchical Decentralized Training and Personalized Inference-Time Optimization
by: Chen, Hao Mark, et al.
Published: (2025)
by: Chen, Hao Mark, et al.
Published: (2025)
GLIMPSE: Holistic Cross-Modal Explainability for Large Vision-Language Models
by: Shen, Guanxi
Published: (2025)
by: Shen, Guanxi
Published: (2025)
Detecting and Mitigating the Correct-Answer Extinction Window in Test-Time Reinforcement Learning with Majority Voting
by: Lin, Hongxiang, et al.
Published: (2026)
by: Lin, Hongxiang, et al.
Published: (2026)
Rethinking the Unsolvable: When In-Context Search Meets Test-Time Scaling
by: Xia, Fanzeng, et al.
Published: (2025)
by: Xia, Fanzeng, et al.
Published: (2025)
Labels Matter More Than Models: Rethinking the Unsupervised Paradigm in Time Series Anomaly Detection
by: Zhong, Zhijie, et al.
Published: (2025)
by: Zhong, Zhijie, et al.
Published: (2025)
Iterative Deepening Sampling as Efficient Test-Time Scaling
by: Chen, Weizhe, et al.
Published: (2025)
by: Chen, Weizhe, et al.
Published: (2025)
Code Generation by Differential Test Time Scaling
by: He, Yifeng, et al.
Published: (2026)
by: He, Yifeng, et al.
Published: (2026)
Scaling Test-Time Compute for Agentic Coding
by: Kim, Joongwon, et al.
Published: (2026)
by: Kim, Joongwon, et al.
Published: (2026)
Provable Scaling Laws for the Test-Time Compute of Large Language Models
by: Chen, Yanxi, et al.
Published: (2024)
by: Chen, Yanxi, et al.
Published: (2024)
Beyond the Frontier: Stochastic Backtracking for Efficient Test-Time Scaling
by: Tran, Dao, et al.
Published: (2026)
by: Tran, Dao, et al.
Published: (2026)
Budget-aware Test-time Scaling via Discriminative Verification
by: Montgomery, Kyle, et al.
Published: (2025)
by: Montgomery, Kyle, et al.
Published: (2025)
Self-Trained Verification for Training- and Test-Time Self-Improvement
by: Wu, Chen Henry, et al.
Published: (2026)
by: Wu, Chen Henry, et al.
Published: (2026)
Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation
by: Chen, Peter Baile, et al.
Published: (2025)
by: Chen, Peter Baile, et al.
Published: (2025)
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification
by: Zhao, Eric, et al.
Published: (2025)
by: Zhao, Eric, et al.
Published: (2025)
IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL
by: Cheng, Zhoujun, et al.
Published: (2026)
by: Cheng, Zhoujun, et al.
Published: (2026)
Towards Theoretical Understanding of Transformer Test-Time Computing: Investigation on In-Context Linear Regression
by: Chen, Xingwu, et al.
Published: (2025)
by: Chen, Xingwu, et al.
Published: (2025)
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
by: Lifshitz, Shalev, et al.
Published: (2025)
by: Lifshitz, Shalev, et al.
Published: (2025)
Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory
by: Liu, Yexiang, et al.
Published: (2025)
by: Liu, Yexiang, et al.
Published: (2025)
Efficient Test-Time Scaling via Self-Calibration
by: Huang, Chengsong, et al.
Published: (2025)
by: Huang, Chengsong, et al.
Published: (2025)
Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification
by: Liang, Zhenwen, et al.
Published: (2024)
by: Liang, Zhenwen, et al.
Published: (2024)
PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency
by: Liu, Zhangyi, et al.
Published: (2026)
by: Liu, Zhangyi, et al.
Published: (2026)
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation
by: Barnes, Jarrod
Published: (2026)
by: Barnes, Jarrod
Published: (2026)
Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling
by: Rodkin, Ivan, et al.
Published: (2025)
by: Rodkin, Ivan, et al.
Published: (2025)
When More Thinking Hurts: Overthinking in LLM Test-Time Compute Scaling
by: Zhou, Shu, et al.
Published: (2026)
by: Zhou, Shu, et al.
Published: (2026)
Strategic Scaling of Test-Time Compute: A Bandit Learning Approach
by: Zuo, Bowen, et al.
Published: (2025)
by: Zuo, Bowen, et al.
Published: (2025)
BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute
by: Ding, Dujian, et al.
Published: (2025)
by: Ding, Dujian, et al.
Published: (2025)
Scales++: Compute Efficient Evaluation Subset Selection with Cognitive Scales Embeddings
by: Bean, Andrew M., et al.
Published: (2025)
by: Bean, Andrew M., et al.
Published: (2025)
Efficient Quantum Approximate $k$NN Algorithm via Granular-Ball Computing
by: Xia, Shuyin, et al.
Published: (2025)
by: Xia, Shuyin, et al.
Published: (2025)
MG-TSD: Multi-Granularity Time Series Diffusion Models with Guided Learning Process
by: Fan, Xinyao, et al.
Published: (2024)
by: Fan, Xinyao, et al.
Published: (2024)
Similar Items
-
Context Memorization for Efficient Long Context Generation
by: Okoshi, Yasuyuki, et al.
Published: (2026) -
AQPIM: Breaking the PIM Capacity Wall for LLMs with In-Memory Activation Quantization
by: Matsushima, Kosuke, et al.
Published: (2026) -
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms
by: Otsuka, Hikari, et al.
Published: (2025) -
FastTTS: Accelerating Test-Time Scaling for Edge LLM Reasoning
by: Chen, Hao Mark, et al.
Published: (2025) -
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
by: Kuroki, Kyo, et al.
Published: (2025)