Guardado en:
| Autores principales: | Guo, Haotian, Liu, Hui |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2509.21010 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Agentic reinforcement learning empowers next-generation chemical language models for molecular design and synthesis
por: Li, Hao, et al.
Publicado: (2026)
por: Li, Hao, et al.
Publicado: (2026)
Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
por: Mirbakhsh, Shahin, et al.
Publicado: (2024)
por: Mirbakhsh, Shahin, et al.
Publicado: (2024)
Data-driven simulator of multi-animal behavior with unknown dynamics via offline and online reinforcement learning
por: Fujii, Keisuke, et al.
Publicado: (2025)
por: Fujii, Keisuke, et al.
Publicado: (2025)
The challenge of hidden gifts in multi-agent reinforcement learning
por: Malenfant, Dane, et al.
Publicado: (2025)
por: Malenfant, Dane, et al.
Publicado: (2025)
The impact of behavioral diversity in multi-agent reinforcement learning
por: Bettini, Matteo, et al.
Publicado: (2024)
por: Bettini, Matteo, et al.
Publicado: (2024)
Curriculum reinforcement learning with measurable task representation learning
por: Wen, Yongyan, et al.
Publicado: (2026)
por: Wen, Yongyan, et al.
Publicado: (2026)
TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction
por: Mu, Xuechen, et al.
Publicado: (2024)
por: Mu, Xuechen, et al.
Publicado: (2024)
Normalization and effective learning rates in reinforcement learning
por: Lyle, Clare, et al.
Publicado: (2024)
por: Lyle, Clare, et al.
Publicado: (2024)
EnerBridge-DPO: Energy-Guided Protein Inverse Folding with Markov Bridges and Direct Preference Optimization
por: Rong, Dingyi, et al.
Publicado: (2025)
por: Rong, Dingyi, et al.
Publicado: (2025)
CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives
por: Saghafian, Armin, et al.
Publicado: (2024)
por: Saghafian, Armin, et al.
Publicado: (2024)
A modular framework for automated evaluation of procedural content generation in serious games with deep reinforcement learning agents
por: Kalafatis, Eleftherios, et al.
Publicado: (2025)
por: Kalafatis, Eleftherios, et al.
Publicado: (2025)
Designing an efficient and equitable humanitarian supply chain dynamically via reinforcement learning
por: Jin, Weijia
Publicado: (2025)
por: Jin, Weijia
Publicado: (2025)
Ensemble Elastic DQN: A novel multi-step ensemble approach to address overestimation in deep value-based reinforcement learning
por: Ly, Adrian, et al.
Publicado: (2025)
por: Ly, Adrian, et al.
Publicado: (2025)
Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces
por: Mahajan, Pranav, et al.
Publicado: (2026)
por: Mahajan, Pranav, et al.
Publicado: (2026)
Bridging Dynamics Gaps via Diffusion Schrödinger Bridge for Cross-Domain Reinforcement Learning
por: Zhang, Hanping, et al.
Publicado: (2026)
por: Zhang, Hanping, et al.
Publicado: (2026)
Counterfactual experience augmented off-policy reinforcement learning
por: Lee, Sunbowen, et al.
Publicado: (2025)
por: Lee, Sunbowen, et al.
Publicado: (2025)
Bellman operator convergence enhancements in reinforcement learning algorithms
por: Kadurha, David Krame, et al.
Publicado: (2025)
por: Kadurha, David Krame, et al.
Publicado: (2025)
Causal prompting model-based offline reinforcement learning
por: Yu, Xuehui, et al.
Publicado: (2024)
por: Yu, Xuehui, et al.
Publicado: (2024)
Deep reinforcement learning with time-scale invariant memory
por: Kabir, Md Rysul, et al.
Publicado: (2024)
por: Kabir, Md Rysul, et al.
Publicado: (2024)
Delayed homomorphic reinforcement learning for environments with delayed feedback
por: Lee, Jongsoo, et al.
Publicado: (2026)
por: Lee, Jongsoo, et al.
Publicado: (2026)
Offline reinforcement learning for job-shop scheduling problems
por: Echeverria, Imanol, et al.
Publicado: (2024)
por: Echeverria, Imanol, et al.
Publicado: (2024)
Learning to summarize user information for personalized reinforcement learning from human feedback
por: Nam, Hyunji, et al.
Publicado: (2025)
por: Nam, Hyunji, et al.
Publicado: (2025)
Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learning
por: Zulfiqar, Mubshra, et al.
Publicado: (2026)
por: Zulfiqar, Mubshra, et al.
Publicado: (2026)
Leveraging weights signals -- Predicting and improving generalizability in reinforcement learning
por: Moulin, Olivier, et al.
Publicado: (2025)
por: Moulin, Olivier, et al.
Publicado: (2025)
Dynamic feature selection in medical predictive monitoring by reinforcement learning
por: Chen, Yutong, et al.
Publicado: (2024)
por: Chen, Yutong, et al.
Publicado: (2024)
Economic span selection of bridge based on deep reinforcement learning
por: Zhang, Leye, et al.
Publicado: (2024)
por: Zhang, Leye, et al.
Publicado: (2024)
Not all tokens are needed(NAT): token efficient reinforcement learning
por: Sang, Hejian, et al.
Publicado: (2026)
por: Sang, Hejian, et al.
Publicado: (2026)
Survey on reinforcement learning for language processing
por: Uc-Cetina, Victor, et al.
Publicado: (2021)
por: Uc-Cetina, Victor, et al.
Publicado: (2021)
MOMA-AC: A preference-driven actor-critic framework for continuous multi-objective multi-agent reinforcement learning
por: Callaghan, Adam, et al.
Publicado: (2025)
por: Callaghan, Adam, et al.
Publicado: (2025)
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
por: Su, Xuerui, et al.
Publicado: (2025)
por: Su, Xuerui, et al.
Publicado: (2025)
Maximum diffusion reinforcement learning
por: Berrueta, Thomas A., et al.
Publicado: (2023)
por: Berrueta, Thomas A., et al.
Publicado: (2023)
Overcoming label shift with target-aware federated learning
por: Zec, Edvin Listo, et al.
Publicado: (2024)
por: Zec, Edvin Listo, et al.
Publicado: (2024)
An efficient deep reinforcement learning environment for flexible job-shop scheduling
por: Wu, Xinquan, et al.
Publicado: (2025)
por: Wu, Xinquan, et al.
Publicado: (2025)
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
por: Kobayashi, Seijin, et al.
Publicado: (2025)
por: Kobayashi, Seijin, et al.
Publicado: (2025)
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
por: Obando-Ceron, Johan, et al.
Publicado: (2024)
por: Obando-Ceron, Johan, et al.
Publicado: (2024)
Task diversity produces systematic transfer but inhibits continual reinforcement learning
por: Seth, Purab, et al.
Publicado: (2026)
por: Seth, Purab, et al.
Publicado: (2026)
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
por: Hutson, Miles, et al.
Publicado: (2024)
por: Hutson, Miles, et al.
Publicado: (2024)
Found-RL: foundation model-enhanced reinforcement learning for autonomous driving
por: Qu, Yansong, et al.
Publicado: (2026)
por: Qu, Yansong, et al.
Publicado: (2026)
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability
por: Alam, Md Ferdous, et al.
Publicado: (2023)
por: Alam, Md Ferdous, et al.
Publicado: (2023)
AOAD-MAT: Transformer-based multi-agent deep reinforcement learning model considering agents' order of action decisions
por: Takayama, Shota, et al.
Publicado: (2025)
por: Takayama, Shota, et al.
Publicado: (2025)
Ejemplares similares
-
Agentic reinforcement learning empowers next-generation chemical language models for molecular design and synthesis
por: Li, Hao, et al.
Publicado: (2026) -
Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
por: Mirbakhsh, Shahin, et al.
Publicado: (2024) -
Data-driven simulator of multi-animal behavior with unknown dynamics via offline and online reinforcement learning
por: Fujii, Keisuke, et al.
Publicado: (2025) -
The challenge of hidden gifts in multi-agent reinforcement learning
por: Malenfant, Dane, et al.
Publicado: (2025) -
The impact of behavioral diversity in multi-agent reinforcement learning
por: Bettini, Matteo, et al.
Publicado: (2024)