Saved in:
| Main Authors: | Liu, Jia, He, ChangYi, Lin, YingQiao, Yang, MingMin, Shen, FeiYang, Liu, ShaoGuo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.11356 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mechanical Properties and Deformation Behavior of a Novel 3D Printed Tubular TPMS Structure
by: ShaoGuo Zhang, et al.
Published: (2026)
by: ShaoGuo Zhang, et al.
Published: (2026)
Chain of Time: In-Context Physical Simulation with Image Generation Models
by: Wang, YingQiao, et al.
Published: (2025)
by: Wang, YingQiao, et al.
Published: (2025)
Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation
by: Yang, Pinci, et al.
Published: (2025)
by: Yang, Pinci, et al.
Published: (2025)
$ϕ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
by: Xu, Fangzhi, et al.
Published: (2025)
by: Xu, Fangzhi, et al.
Published: (2025)
Query Decomposition for RAG: Balancing Exploration-Exploitation
by: Petcu, Roxana, et al.
Published: (2025)
by: Petcu, Roxana, et al.
Published: (2025)
The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
by: Yan, Renye, et al.
Published: (2024)
by: Yan, Renye, et al.
Published: (2024)
Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding
by: Nguyen, Ha-Thanh, et al.
Published: (2024)
by: Nguyen, Ha-Thanh, et al.
Published: (2024)
Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided Sampling
by: Niu, Zenghao, et al.
Published: (2025)
by: Niu, Zenghao, et al.
Published: (2025)
In-context Exploration-Exploitation for Reinforcement Learning
by: Dai, Zhenwen, et al.
Published: (2024)
by: Dai, Zhenwen, et al.
Published: (2024)
Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning
by: Wissow, Stephen, et al.
Published: (2023)
by: Wissow, Stephen, et al.
Published: (2023)
Exploration, Exploitation, and Organizational Coordination Mechanisms
by: Silvio Popadiuk
Published: (2016)
by: Silvio Popadiuk
Published: (2016)
WESE: Weak Exploration to Strong Exploitation for LLM Agents
by: Huang, Xu, et al.
Published: (2024)
by: Huang, Xu, et al.
Published: (2024)
SELF-REDRAFT: Eliciting Intrinsic Exploration-Exploitation Balance in Test-Time Scaling for Code Generation
by: Chen, Yixiang, et al.
Published: (2025)
by: Chen, Yixiang, et al.
Published: (2025)
Landmark Guided Active Exploration with State-specific Balance Coefficient
by: Cui, Fei, et al.
Published: (2023)
by: Cui, Fei, et al.
Published: (2023)
MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation
by: Yang, Lu, et al.
Published: (2026)
by: Yang, Lu, et al.
Published: (2026)
Semantic-Space Exploration and Exploitation in RLVR for LLM Reasoning
by: Huang, Fanding, et al.
Published: (2025)
by: Huang, Fanding, et al.
Published: (2025)
ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning
by: Liang, Kun, et al.
Published: (2026)
by: Liang, Kun, et al.
Published: (2026)
ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection
by: Gao, Changjiang, et al.
Published: (2026)
by: Gao, Changjiang, et al.
Published: (2026)
Energy Exploration & Exploitation
Published: (2020)
Published: (2020)
A Goal-Oriented Approach for Active Object Detection with Exploration-Exploitation Balance
by: Yu, Yalei, et al.
Published: (2025)
by: Yu, Yalei, et al.
Published: (2025)
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models
by: Chen, Zhipeng, et al.
Published: (2025)
by: Chen, Zhipeng, et al.
Published: (2025)
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
by: Zeng, Weihao, et al.
Published: (2024)
by: Zeng, Weihao, et al.
Published: (2024)
A Balanced Approach of Rapid Genetic Exploration and Surrogate Exploitation for Hyperparameter Optimization
by: Kim, Chul, et al.
Published: (2025)
by: Kim, Chul, et al.
Published: (2025)
Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering
by: Bigelow, Eric, et al.
Published: (2025)
by: Bigelow, Eric, et al.
Published: (2025)
Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
by: Chen, Peter, et al.
Published: (2025)
by: Chen, Peter, et al.
Published: (2025)
Engineered Protein Fibers with Reinforced Mechanical Properties Via β‐Sheet High‐Order Assembly
by: Ming Li, et al.
Published: (2024)
by: Ming Li, et al.
Published: (2024)
Improving Policy Exploitation in Online Reinforcement Learning with Instant Retrospect Action
by: Gao, Gong, et al.
Published: (2026)
by: Gao, Gong, et al.
Published: (2026)
DGRO: Enhancing LLM Reasoning via Exploration-Exploitation Control and Reward Variance Management
by: Su, Xuerui, et al.
Published: (2025)
by: Su, Xuerui, et al.
Published: (2025)
From Exploration to Exploitation: A Two-Stage Entropy RLVR Approach for Noise-Tolerant MLLM Training
by: Xu, Donglai, et al.
Published: (2025)
by: Xu, Donglai, et al.
Published: (2025)
HarnessLLM: Automatic Testing Harness Generation via Reinforcement Learning
by: Liu, Yujian, et al.
Published: (2025)
by: Liu, Yujian, et al.
Published: (2025)
EEA: Exploration-Exploitation Agent for Long Video Understanding
by: Yang, Te, et al.
Published: (2025)
by: Yang, Te, et al.
Published: (2025)
MRSO: Balancing Exploration and Exploitation through Modified Rat Swarm Optimization for Global Optimization
by: Abdulla, Hemin Sardar, et al.
Published: (2024)
by: Abdulla, Hemin Sardar, et al.
Published: (2024)
ECHO: Entropy-Confidence Hybrid Optimization for Test-Time Reinforcement Learning
by: Zhao, Chu, et al.
Published: (2026)
by: Zhao, Chu, et al.
Published: (2026)
LLM-Empowered State Representation for Reinforcement Learning
by: Wang, Boyuan, et al.
Published: (2024)
by: Wang, Boyuan, et al.
Published: (2024)
Disentangling Exploration from Exploitation
by: Lizzeri, Alessandro, et al.
Published: (2024)
by: Lizzeri, Alessandro, et al.
Published: (2024)
Marine Exploration and Exploitation of Hydrocarbons
by: Radovich, Violeta S.
Published: (2025)
by: Radovich, Violeta S.
Published: (2025)
Organizational Factors for Exploration and Exploitation
by: Sharadindu Pandey
Published: (2009)
by: Sharadindu Pandey
Published: (2009)
Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation
by: Dong, Shiying, et al.
Published: (2026)
by: Dong, Shiying, et al.
Published: (2026)
Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization
by: Yao, Jiashu, et al.
Published: (2026)
by: Yao, Jiashu, et al.
Published: (2026)
Maximizing Local Entropy Where It Matters: Prefix-Aware Localized LLM Unlearning
by: Zhai, Naixin, et al.
Published: (2026)
by: Zhai, Naixin, et al.
Published: (2026)
Similar Items
-
Mechanical Properties and Deformation Behavior of a Novel 3D Printed Tubular TPMS Structure
by: ShaoGuo Zhang, et al.
Published: (2026) -
Chain of Time: In-Context Physical Simulation with Image Generation Models
by: Wang, YingQiao, et al.
Published: (2025) -
Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation
by: Yang, Pinci, et al.
Published: (2025) -
$ϕ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
by: Xu, Fangzhi, et al.
Published: (2025) -
Query Decomposition for RAG: Balancing Exploration-Exploitation
by: Petcu, Roxana, et al.
Published: (2025)