Saved in:
| Main Authors: | Xu, Zhongwen, Wang, Xianliang, Li, Siyi, Yu, Tao, Wang, Liang, Fu, Qiang, Yang, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.13356 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning
by: Wang, Sai, et al.
Published: (2025)
by: Wang, Sai, et al.
Published: (2025)
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
by: Xu, Zelai, et al.
Published: (2023)
by: Xu, Zelai, et al.
Published: (2023)
Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents
by: Li, Wenhao, et al.
Published: (2026)
by: Li, Wenhao, et al.
Published: (2026)
Learning to play: A Multimodal Agent for 3D Game-Play
by: Yue, Yuguang, et al.
Published: (2025)
by: Yue, Yuguang, et al.
Published: (2025)
Playing Non-Embedded Card-Based Games with Reinforcement Learning
by: Wu, Tianyang, et al.
Published: (2025)
by: Wu, Tianyang, et al.
Published: (2025)
Maximum Entropy Heterogeneous-Agent Reinforcement Learning
by: Liu, Jiarong, et al.
Published: (2023)
by: Liu, Jiarong, et al.
Published: (2023)
Learning Game-Playing Agents with Generative Code Optimization
by: Kuang, Zhiyi, et al.
Published: (2025)
by: Kuang, Zhiyi, et al.
Published: (2025)
TextAtari: 100K Frames Game Playing with Language Agents
by: Li, Wenhao, et al.
Published: (2025)
by: Li, Wenhao, et al.
Published: (2025)
Do We Need Transformers to Play FPS Video Games?
by: Batth, Karmanbir, et al.
Published: (2025)
by: Batth, Karmanbir, et al.
Published: (2025)
Single-stream Policy Optimization
by: Xu, Zhongwen, et al.
Published: (2025)
by: Xu, Zhongwen, et al.
Published: (2025)
Understanding Tool-Integrated Reasoning
by: Lin, Heng, et al.
Published: (2025)
by: Lin, Heng, et al.
Published: (2025)
Convergence analysis of wide shallow neural operators within the framework of Neural Tangent Kernel
by: Xu, Xianliang, et al.
Published: (2024)
by: Xu, Xianliang, et al.
Published: (2024)
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
by: Mei, Zhiyu, et al.
Published: (2023)
by: Mei, Zhiyu, et al.
Published: (2023)
Convergence Analysis of Natural Gradient Descent for Over-parameterized Physics-Informed Neural Networks
by: Xu, Xianliang, et al.
Published: (2024)
by: Xu, Xianliang, et al.
Published: (2024)
Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction
by: Jin, Yonggang, et al.
Published: (2024)
by: Jin, Yonggang, et al.
Published: (2024)
Best Agent Identification for General Game Playing
by: Stephenson, Matthew, et al.
Published: (2025)
by: Stephenson, Matthew, et al.
Published: (2025)
Learning to Play Video Games with Intuitive Physics Priors
by: Jaiswal, Abhishek, et al.
Published: (2024)
by: Jaiswal, Abhishek, et al.
Published: (2024)
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
by: Zhang, Chen, et al.
Published: (2024)
by: Zhang, Chen, et al.
Published: (2024)
Reinforcement Learning from Diverse Human Preferences
by: Xue, Wanqi, et al.
Published: (2023)
by: Xue, Wanqi, et al.
Published: (2023)
Convergence of Implicit Gradient Descent for Training Two-Layer Physics-Informed Neural Networks
by: Xu, Xianliang, et al.
Published: (2024)
by: Xu, Xianliang, et al.
Published: (2024)
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
by: Chen, Jiaqi, et al.
Published: (2025)
by: Chen, Jiaqi, et al.
Published: (2025)
Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding
by: Jiang, He, et al.
Published: (2024)
by: Jiang, He, et al.
Published: (2024)
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
by: Xu, Zelai, et al.
Published: (2025)
by: Xu, Zelai, et al.
Published: (2025)
A Comprehensive Review of Multi-Agent Reinforcement Learning in Video Games
by: Li, Zhengyang, et al.
Published: (2025)
by: Li, Zhengyang, et al.
Published: (2025)
PSI3D: Plug-and-Play 3D Stochastic Inference with Slice-wise Latent Diffusion Prior
by: Guo, Wenhan, et al.
Published: (2025)
by: Guo, Wenhan, et al.
Published: (2025)
Learning to Play Multi-Follower Bayesian Stackelberg Games
by: Personnat, Gerson, et al.
Published: (2025)
by: Personnat, Gerson, et al.
Published: (2025)
Large Language Model Agent for Hyper-Parameter Optimization
by: Liu, Siyi, et al.
Published: (2024)
by: Liu, Siyi, et al.
Published: (2024)
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
by: Li, Xianliang, et al.
Published: (2024)
by: Li, Xianliang, et al.
Published: (2024)
Rethinking Graph Masked Autoencoders through Alignment and Uniformity
by: Wang, Liang, et al.
Published: (2024)
by: Wang, Liang, et al.
Published: (2024)
VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos
by: Lu, Dunjie, et al.
Published: (2025)
by: Lu, Dunjie, et al.
Published: (2025)
$π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data
by: Zhang, Yaocheng, et al.
Published: (2026)
by: Zhang, Yaocheng, et al.
Published: (2026)
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
by: Wang, Liangzhou, et al.
Published: (2024)
by: Wang, Liangzhou, et al.
Published: (2024)
Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization
by: Yu, Tao, et al.
Published: (2026)
by: Yu, Tao, et al.
Published: (2026)
Plug-and-Play Controllable Generation for Discrete Masked Models
by: Guo, Wei, et al.
Published: (2024)
by: Guo, Wei, et al.
Published: (2024)
Pareto-guided Pipeline for Distilling Featherweight AI Agents in Mobile MOBA Games
by: Yang, Xionghui, et al.
Published: (2026)
by: Yang, Xionghui, et al.
Published: (2026)
Discriminative Entropy Clustering and its Relation to K-means and SVM
by: Zhang, Zhongwen, et al.
Published: (2023)
by: Zhang, Zhongwen, et al.
Published: (2023)
ChronoSteer: Bridging Large Language Model and Time Series Foundation Model via Synthetic Data
by: Wang, Chengsen, et al.
Published: (2025)
by: Wang, Chengsen, et al.
Published: (2025)
D2IP: Deep Dynamic Image Prior for 3D Time-sequence Pulmonary Impedance Imaging
by: Fang, Hao, et al.
Published: (2025)
by: Fang, Hao, et al.
Published: (2025)
More Agents Is All You Need
by: Li, Junyou, et al.
Published: (2024)
by: Li, Junyou, et al.
Published: (2024)
Character Beyond Speech: Leveraging Role-Playing Evaluation in Audio Large Language Models via Reinforcement Learning
by: Fu, Dongjie, et al.
Published: (2026)
by: Fu, Dongjie, et al.
Published: (2026)
Similar Items
-
Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning
by: Wang, Sai, et al.
Published: (2025) -
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
by: Xu, Zelai, et al.
Published: (2023) -
Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents
by: Li, Wenhao, et al.
Published: (2026) -
Learning to play: A Multimodal Agent for 3D Game-Play
by: Yue, Yuguang, et al.
Published: (2025) -
Playing Non-Embedded Card-Based Games with Reinforcement Learning
by: Wu, Tianyang, et al.
Published: (2025)