:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Jia, He, ChangYi, Lin, YingQiao, Yang, MingMin, Shen, FeiYang, Liu, ShaoGuo
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.11356
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Mechanical Properties and Deformation Behavior of a Novel 3D Printed Tubular TPMS Structure
by: ShaoGuo Zhang, et al.
Published: (2026)

Chain of Time: In-Context Physical Simulation with Image Generation Models
by: Wang, YingQiao, et al.
Published: (2025)

Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation
by: Yang, Pinci, et al.
Published: (2025)

$ϕ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
by: Xu, Fangzhi, et al.
Published: (2025)

Query Decomposition for RAG: Balancing Exploration-Exploitation
by: Petcu, Roxana, et al.
Published: (2025)

The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
by: Yan, Renye, et al.
Published: (2024)

Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding
by: Nguyen, Ha-Thanh, et al.
Published: (2024)

Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided Sampling
by: Niu, Zenghao, et al.
Published: (2025)

In-context Exploration-Exploitation for Reinforcement Learning
by: Dai, Zhenwen, et al.
Published: (2024)

Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning
by: Wissow, Stephen, et al.
Published: (2023)

Exploration, Exploitation, and Organizational Coordination Mechanisms
by: Silvio Popadiuk
Published: (2016)

WESE: Weak Exploration to Strong Exploitation for LLM Agents
by: Huang, Xu, et al.
Published: (2024)

SELF-REDRAFT: Eliciting Intrinsic Exploration-Exploitation Balance in Test-Time Scaling for Code Generation
by: Chen, Yixiang, et al.
Published: (2025)

Landmark Guided Active Exploration with State-specific Balance Coefficient
by: Cui, Fei, et al.
Published: (2023)

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation
by: Yang, Lu, et al.
Published: (2026)

Semantic-Space Exploration and Exploitation in RLVR for LLM Reasoning
by: Huang, Fanding, et al.
Published: (2025)

ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning
by: Liang, Kun, et al.
Published: (2026)

ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection
by: Gao, Changjiang, et al.
Published: (2026)

Energy Exploration & Exploitation
Published: (2020)

A Goal-Oriented Approach for Active Object Detection with Exploration-Exploitation Balance
by: Yu, Yalei, et al.
Published: (2025)

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models
by: Chen, Zhipeng, et al.
Published: (2025)

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
by: Zeng, Weihao, et al.
Published: (2024)

A Balanced Approach of Rapid Genetic Exploration and Surrogate Exploitation for Hyperparameter Optimization
by: Kim, Chul, et al.
Published: (2025)

Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering
by: Bigelow, Eric, et al.
Published: (2025)

Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
by: Chen, Peter, et al.
Published: (2025)

Engineered Protein Fibers with Reinforced Mechanical Properties Via β‐Sheet High‐Order Assembly
by: Ming Li, et al.
Published: (2024)

Improving Policy Exploitation in Online Reinforcement Learning with Instant Retrospect Action
by: Gao, Gong, et al.
Published: (2026)

DGRO: Enhancing LLM Reasoning via Exploration-Exploitation Control and Reward Variance Management
by: Su, Xuerui, et al.
Published: (2025)

From Exploration to Exploitation: A Two-Stage Entropy RLVR Approach for Noise-Tolerant MLLM Training
by: Xu, Donglai, et al.
Published: (2025)

HarnessLLM: Automatic Testing Harness Generation via Reinforcement Learning
by: Liu, Yujian, et al.
Published: (2025)

EEA: Exploration-Exploitation Agent for Long Video Understanding
by: Yang, Te, et al.
Published: (2025)

MRSO: Balancing Exploration and Exploitation through Modified Rat Swarm Optimization for Global Optimization
by: Abdulla, Hemin Sardar, et al.
Published: (2024)

ECHO: Entropy-Confidence Hybrid Optimization for Test-Time Reinforcement Learning
by: Zhao, Chu, et al.
Published: (2026)

LLM-Empowered State Representation for Reinforcement Learning
by: Wang, Boyuan, et al.
Published: (2024)

Disentangling Exploration from Exploitation
by: Lizzeri, Alessandro, et al.
Published: (2024)

Marine Exploration and Exploitation of Hydrocarbons
by: Radovich, Violeta S.
Published: (2025)

Organizational Factors for Exploration and Exploitation
by: Sharadindu Pandey
Published: (2009)

Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation
by: Dong, Shiying, et al.
Published: (2026)

Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization
by: Yao, Jiashu, et al.
Published: (2026)

Maximizing Local Entropy Where It Matters: Prefix-Aware Localized LLM Unlearning
by: Zhai, Naixin, et al.
Published: (2026)