Saved in:
| Main Authors: | Sheng, Junjie, Wu, Jiehao, Cui, Haochuan, Hu, Yiqiu, Zhou, Wenli, Zhu, Lei, Peng, Qian, Li, Wenhao, Wang, Xiangfeng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.00537 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
by: Wu, JieHao, et al.
Published: (2025)
by: Wu, JieHao, et al.
Published: (2025)
AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization
by: Wu, Jiehao, et al.
Published: (2026)
by: Wu, Jiehao, et al.
Published: (2026)
Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents
by: Li, Wenhao, et al.
Published: (2026)
by: Li, Wenhao, et al.
Published: (2026)
Scalable Density-based Clustering with Random Projections
by: Xu, Haochuan, et al.
Published: (2024)
by: Xu, Haochuan, et al.
Published: (2024)
GraphThought: Graph Combinatorial Optimization with Thought Generation
by: Huang, Zixiao, et al.
Published: (2025)
by: Huang, Zixiao, et al.
Published: (2025)
A Survey of Automatic Prompt Engineering: An Optimization Perspective
by: Li, Wenwu, et al.
Published: (2025)
by: Li, Wenwu, et al.
Published: (2025)
Reinforcement Learning for Scalable Train Timetable Rescheduling with Graph Representation
by: Yue, Peng, et al.
Published: (2024)
by: Yue, Peng, et al.
Published: (2024)
Ranking-Aware Calibration for Reliable Multimodal Reinforcement Learning
by: Cui, Peng, et al.
Published: (2026)
by: Cui, Peng, et al.
Published: (2026)
TextAtari: 100K Frames Game Playing with Language Agents
by: Li, Wenhao, et al.
Published: (2025)
by: Li, Wenhao, et al.
Published: (2025)
Exploring Microstructural Dynamics in Cryptocurrency Limit Order Books: Better Inputs Matter More Than Stacking Another Hidden Layer
by: Wang, Haochuan
Published: (2025)
by: Wang, Haochuan
Published: (2025)
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
by: Wang, Qiulei, et al.
Published: (2023)
by: Wang, Qiulei, et al.
Published: (2023)
Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning
by: Qi, Jiaju, et al.
Published: (2025)
by: Qi, Jiaju, et al.
Published: (2025)
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
by: Li, Yuhan, et al.
Published: (2025)
by: Li, Yuhan, et al.
Published: (2025)
Estimation and Inference in Distributional Reinforcement Learning
by: Zhang, Liangyu, et al.
Published: (2023)
by: Zhang, Liangyu, et al.
Published: (2023)
Machine Learning for Scalable and Optimal Load Shedding Under Power System Contingency
by: Zhou, Yuqi, et al.
Published: (2024)
by: Zhou, Yuqi, et al.
Published: (2024)
Mixture-of-Experts Meets In-Context Reinforcement Learning
by: Wu, Wenhao, et al.
Published: (2025)
by: Wu, Wenhao, et al.
Published: (2025)
Enhanced Pre-training of Graph Neural Networks for Million-Scale Heterogeneous Graphs
by: Sun, Shengyin, et al.
Published: (2025)
by: Sun, Shengyin, et al.
Published: (2025)
A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula
by: Liu, Chenruo, et al.
Published: (2026)
by: Liu, Chenruo, et al.
Published: (2026)
Unveiling Markov Heads in Pretrained Language Models for Offline Reinforcement Learning
by: Zhao, Wenhao, et al.
Published: (2024)
by: Zhao, Wenhao, et al.
Published: (2024)
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
by: Grigsby, Jake, et al.
Published: (2023)
by: Grigsby, Jake, et al.
Published: (2023)
Differentiable Adaptive Kalman Filtering via Optimal Transport
by: He, Yangguang, et al.
Published: (2025)
by: He, Yangguang, et al.
Published: (2025)
TrackDiffuser: Nearly Model-Free Bayesian Filtering with Diffusion Model
by: He, Yangguang, et al.
Published: (2025)
by: He, Yangguang, et al.
Published: (2025)
Scheduling That Speaks: An Interpretable Programmatic Reinforcement Learning Framework
by: Hu, Chengpeng, et al.
Published: (2026)
by: Hu, Chengpeng, et al.
Published: (2026)
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
by: Qi, Jiaju, et al.
Published: (2025)
by: Qi, Jiaju, et al.
Published: (2025)
Scalable Multi-Agent Reinforcement Learning for Residential Load Scheduling under Data Governance
by: Qin, Zhaoming, et al.
Published: (2021)
by: Qin, Zhaoming, et al.
Published: (2021)
SkyRover: A Modular Simulator for Cross-Domain Pathfinding
by: Ma, Wenhui, et al.
Published: (2025)
by: Ma, Wenhui, et al.
Published: (2025)
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
by: Wu, Yongliang, et al.
Published: (2025)
by: Wu, Yongliang, et al.
Published: (2025)
Counterfactually Safe Reinforcement Learning
by: Li, Jingyi, et al.
Published: (2026)
by: Li, Jingyi, et al.
Published: (2026)
Rethinking the Role of Dynamic Sparse Training for Scalable Deep Reinforcement Learning
by: Ma, Guozheng, et al.
Published: (2025)
by: Ma, Guozheng, et al.
Published: (2025)
Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration
by: Guo, Xudong, et al.
Published: (2024)
by: Guo, Xudong, et al.
Published: (2024)
Learning to Factorize and Adapt: A Versatile Approach Toward Universal Spatio-Temporal Foundation Models
by: Zhong, Siru, et al.
Published: (2026)
by: Zhong, Siru, et al.
Published: (2026)
Data Heterogeneity Modeling for Trustworthy Machine Learning
by: Liu, Jiashuo, et al.
Published: (2025)
by: Liu, Jiashuo, et al.
Published: (2025)
AnchorGT: Efficient and Flexible Attention Architecture for Scalable Graph Transformers
by: Zhu, Wenhao, et al.
Published: (2024)
by: Zhu, Wenhao, et al.
Published: (2024)
Generative Multi-Agent Collaboration in Embodied AI: A Systematic Review
by: Wu, Di, et al.
Published: (2025)
by: Wu, Di, et al.
Published: (2025)
Functional Scaling Laws in Kernel Regression: Loss Dynamics and Learning Rate Schedules
by: Li, Binghui, et al.
Published: (2025)
by: Li, Binghui, et al.
Published: (2025)
Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving
by: Gao, Shihong, et al.
Published: (2025)
by: Gao, Shihong, et al.
Published: (2025)
Dynamic Inhomogeneous Quantum Resource Scheduling with Reinforcement Learning
by: Li, Linsen, et al.
Published: (2024)
by: Li, Linsen, et al.
Published: (2024)
Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization
by: Li, Jimmy, et al.
Published: (2023)
by: Li, Jimmy, et al.
Published: (2023)
Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding
by: Wang, Shiyue, et al.
Published: (2025)
by: Wang, Shiyue, et al.
Published: (2025)
Sparsity via Hyperpriors: A Theoretical and Algorithmic Study under Empirical Bayes Framework
by: Li, Zhitao, et al.
Published: (2025)
by: Li, Zhitao, et al.
Published: (2025)
Similar Items
-
Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
by: Wu, JieHao, et al.
Published: (2025) -
AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization
by: Wu, Jiehao, et al.
Published: (2026) -
Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents
by: Li, Wenhao, et al.
Published: (2026) -
Scalable Density-based Clustering with Random Projections
by: Xu, Haochuan, et al.
Published: (2024) -
GraphThought: Graph Combinatorial Optimization with Thought Generation
by: Huang, Zixiao, et al.
Published: (2025)