:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sheng, Junjie, Wu, Jiehao, Cui, Haochuan, Hu, Yiqiu, Zhou, Wenli, Zhu, Lei, Peng, Qian, Li, Wenhao, Wang, Xiangfeng
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2503.00537
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
by: Wu, JieHao, et al.
Published: (2025)

AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization
by: Wu, Jiehao, et al.
Published: (2026)

Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents
by: Li, Wenhao, et al.
Published: (2026)

Scalable Density-based Clustering with Random Projections
by: Xu, Haochuan, et al.
Published: (2024)

GraphThought: Graph Combinatorial Optimization with Thought Generation
by: Huang, Zixiao, et al.
Published: (2025)

A Survey of Automatic Prompt Engineering: An Optimization Perspective
by: Li, Wenwu, et al.
Published: (2025)

Reinforcement Learning for Scalable Train Timetable Rescheduling with Graph Representation
by: Yue, Peng, et al.
Published: (2024)

Ranking-Aware Calibration for Reliable Multimodal Reinforcement Learning
by: Cui, Peng, et al.
Published: (2026)

TextAtari: 100K Frames Game Playing with Language Agents
by: Li, Wenhao, et al.
Published: (2025)

Exploring Microstructural Dynamics in Cryptocurrency Limit Order Books: Better Inputs Matter More Than Stacking Another Hidden Layer
by: Wang, Haochuan
Published: (2025)

Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
by: Wang, Qiulei, et al.
Published: (2023)

Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning
by: Qi, Jiaju, et al.
Published: (2025)

Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
by: Li, Yuhan, et al.
Published: (2025)

Estimation and Inference in Distributional Reinforcement Learning
by: Zhang, Liangyu, et al.
Published: (2023)

Machine Learning for Scalable and Optimal Load Shedding Under Power System Contingency
by: Zhou, Yuqi, et al.
Published: (2024)

Mixture-of-Experts Meets In-Context Reinforcement Learning
by: Wu, Wenhao, et al.
Published: (2025)

Enhanced Pre-training of Graph Neural Networks for Million-Scale Heterogeneous Graphs
by: Sun, Shengyin, et al.
Published: (2025)

A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula
by: Liu, Chenruo, et al.
Published: (2026)

Unveiling Markov Heads in Pretrained Language Models for Offline Reinforcement Learning
by: Zhao, Wenhao, et al.
Published: (2024)

AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
by: Grigsby, Jake, et al.
Published: (2023)

Differentiable Adaptive Kalman Filtering via Optimal Transport
by: He, Yangguang, et al.
Published: (2025)

TrackDiffuser: Nearly Model-Free Bayesian Filtering with Diffusion Model
by: He, Yangguang, et al.
Published: (2025)

Scheduling That Speaks: An Interpretable Programmatic Reinforcement Learning Framework
by: Hu, Chengpeng, et al.
Published: (2026)

Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
by: Qi, Jiaju, et al.
Published: (2025)

Scalable Multi-Agent Reinforcement Learning for Residential Load Scheduling under Data Governance
by: Qin, Zhaoming, et al.
Published: (2021)

SkyRover: A Modular Simulator for Cross-Domain Pathfinding
by: Ma, Wenhui, et al.
Published: (2025)

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
by: Wu, Yongliang, et al.
Published: (2025)

Counterfactually Safe Reinforcement Learning
by: Li, Jingyi, et al.
Published: (2026)

Rethinking the Role of Dynamic Sparse Training for Scalable Deep Reinforcement Learning
by: Ma, Guozheng, et al.
Published: (2025)

Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration
by: Guo, Xudong, et al.
Published: (2024)

Learning to Factorize and Adapt: A Versatile Approach Toward Universal Spatio-Temporal Foundation Models
by: Zhong, Siru, et al.
Published: (2026)

Data Heterogeneity Modeling for Trustworthy Machine Learning
by: Liu, Jiashuo, et al.
Published: (2025)

AnchorGT: Efficient and Flexible Attention Architecture for Scalable Graph Transformers
by: Zhu, Wenhao, et al.
Published: (2024)

Generative Multi-Agent Collaboration in Embodied AI: A Systematic Review
by: Wu, Di, et al.
Published: (2025)

Functional Scaling Laws in Kernel Regression: Loss Dynamics and Learning Rate Schedules
by: Li, Binghui, et al.
Published: (2025)

Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving
by: Gao, Shihong, et al.
Published: (2025)

Dynamic Inhomogeneous Quantum Resource Scheduling with Reinforcement Learning
by: Li, Linsen, et al.
Published: (2024)

Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization
by: Li, Jimmy, et al.
Published: (2023)

Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding
by: Wang, Shiyue, et al.
Published: (2025)

Sparsity via Hyperpriors: A Theoretical and Algorithmic Study under Empirical Bayes Framework
by: Li, Zhitao, et al.
Published: (2025)