Saved in:
| Main Authors: | Xiao, Jie, Fan, Changyuan, Ren, Qingnan, Long, Alfred, Zhang, Yuchen, Yu, Rymon, Yang, Eric, Ai, Lynn, Gan, Shaoduo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.05387 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget
by: Wang, Zihao, et al.
Published: (2024)
by: Wang, Zihao, et al.
Published: (2024)
ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning
by: Song, Jingwei, et al.
Published: (2026)
by: Song, Jingwei, et al.
Published: (2026)
Societal Adaptation to AI Human-Labor Automation
by: Rymon, Yuval
Published: (2024)
by: Rymon, Yuval
Published: (2024)
Of the People, By the Algorithm: How AI Transforms Democratic Representation
by: Rymon, Yuval
Published: (2025)
by: Rymon, Yuval
Published: (2025)
Implicit Strategic Optimization: Rethinking Long-Horizon Decision-Making in Adversarial Poker Environments
by: Xia, Boyang, et al.
Published: (2026)
by: Xia, Boyang, et al.
Published: (2026)
EchoRL: Reinforcement Learning via Rollout Echoing
by: Bi, Jinhe, et al.
Published: (2026)
by: Bi, Jinhe, et al.
Published: (2026)
ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment
by: Li, Xiuyu, et al.
Published: (2026)
by: Li, Xiuyu, et al.
Published: (2026)
Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization
by: Dong, Siyan, et al.
Published: (2024)
by: Dong, Siyan, et al.
Published: (2024)
Lattica: A Decentralized Cross-NAT Communication Framework for Scalable AI Inference and Training
by: Yang, Ween, et al.
Published: (2025)
by: Yang, Ween, et al.
Published: (2025)
Echo: Simulating Distributed Training At Scale
by: Feng, Yicheng, et al.
Published: (2024)
by: Feng, Yicheng, et al.
Published: (2024)
AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis
by: Yang, Pei, et al.
Published: (2026)
by: Yang, Pei, et al.
Published: (2026)
Event‐Triggered Fault‐Tolerant Formation Tracking for Quadrotor Swarms Under Fully Distributed Communication
by: Qingnan Huang, et al.
Published: (2026)
by: Qingnan Huang, et al.
Published: (2026)
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
by: Li, Haozhan, et al.
Published: (2025)
by: Li, Haozhan, et al.
Published: (2025)
Position: Assistive Agents Need Accessibility Alignment
by: Hu, Jie, et al.
Published: (2026)
by: Hu, Jie, et al.
Published: (2026)
EchoDistill:Alignment Noisy-to-Clean Self-Distillation for Robust Audio LLMs
by: Lin, Liang, et al.
Published: (2026)
by: Lin, Liang, et al.
Published: (2026)
VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning
by: Xiao, Wenyi, et al.
Published: (2026)
by: Xiao, Wenyi, et al.
Published: (2026)
Decoupled Prioritized Resampling for Offline RL
by: Yue, Yang, et al.
Published: (2023)
by: Yue, Yang, et al.
Published: (2023)
Echo-N1: Affective RL Frontier
by: Zhang, Naifan, et al.
Published: (2025)
by: Zhang, Naifan, et al.
Published: (2025)
Custom code for LSGI manuscript
by: Liang, Qingnan
Published: (2025)
by: Liang, Qingnan
Published: (2025)
Primitive-Swarm: An Ultra-lightweight and Scalable Planner for Large-scale Aerial Swarms
by: Hou, Jialiang, et al.
Published: (2025)
by: Hou, Jialiang, et al.
Published: (2025)
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL
by: Feng, Yichen, et al.
Published: (2025)
by: Feng, Yichen, et al.
Published: (2025)
SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds
by: Tadevosyan, Grik, et al.
Published: (2025)
by: Tadevosyan, Grik, et al.
Published: (2025)
Heterogeneity-Aware Dataset Scheduling for Efficient Audio Large Language Model Training
by: Wu, Yanru, et al.
Published: (2026)
by: Wu, Yanru, et al.
Published: (2026)
Is Meta-Path Attention an Explanation? Evidence of Alignment and Decoupling in Heterogeneous GNNs
by: Jiang, Maiqi, et al.
Published: (2026)
by: Jiang, Maiqi, et al.
Published: (2026)
Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment
by: Zheng, Ziyu, et al.
Published: (2026)
by: Zheng, Ziyu, et al.
Published: (2026)
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
by: Xie, Tian, et al.
Published: (2025)
by: Xie, Tian, et al.
Published: (2025)
AutoTool: Automatic Scaling of Tool-Use Capabilities in RL via Decoupled Entropy Constraints
by: Zeng, Yirong, et al.
Published: (2026)
by: Zeng, Yirong, et al.
Published: (2026)
Bockstein operations and extensions with trivial boundary maps
by: An, Qingnan, et al.
Published: (2024)
by: An, Qingnan, et al.
Published: (2024)
Total Cuntz semigroup, Extension and Elliott Conjecture with Real rank zero
by: An, Qingnan, et al.
Published: (2023)
by: An, Qingnan, et al.
Published: (2023)
Total Cuntz semigroup, extension, and Elliott Conjecture with real rank zero
by: Qingnan An, et al.
Published: (2024)
by: Qingnan An, et al.
Published: (2024)
Modular Diffusion Policy Training: Decoupling and Recombining Guidance and Diffusion for Offline RL
by: Chen, Zhaoyang, et al.
Published: (2025)
by: Chen, Zhaoyang, et al.
Published: (2025)
DiffusionRL: Efficient Training of Diffusion Policies for Robotic Grasping Using RL-Adapted Large-Scale Datasets
by: Makarova, Maria, et al.
Published: (2025)
by: Makarova, Maria, et al.
Published: (2025)
Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
by: Zhu, Yu, et al.
Published: (2024)
by: Zhu, Yu, et al.
Published: (2024)
Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models
by: Wang, Ya, et al.
Published: (2025)
by: Wang, Ya, et al.
Published: (2025)
How to Compress KV Cache in RL Post-Training? Shadow Mask Distillation for Memory-Efficient Alignment
by: Zhu, Rui, et al.
Published: (2026)
by: Zhu, Rui, et al.
Published: (2026)
ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning
by: Ren, Qingnan, et al.
Published: (2026)
by: Ren, Qingnan, et al.
Published: (2026)
Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems
by: Cheng, Shiyu, et al.
Published: (2025)
by: Cheng, Shiyu, et al.
Published: (2025)
SwarmPRM: Probabilistic Roadmap Motion Planning for Large-Scale Swarm Robotic Systems
by: Hu, Yunze, et al.
Published: (2024)
by: Hu, Yunze, et al.
Published: (2024)
QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training--Inference Mismatch
by: Gu, Hao, et al.
Published: (2026)
by: Gu, Hao, et al.
Published: (2026)
Distributed and Decentralized Control and Task Allocation for Flexible Swarms
by: Koifman, Yigal, et al.
Published: (2024)
by: Koifman, Yigal, et al.
Published: (2024)
Similar Items
-
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget
by: Wang, Zihao, et al.
Published: (2024) -
ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning
by: Song, Jingwei, et al.
Published: (2026) -
Societal Adaptation to AI Human-Labor Automation
by: Rymon, Yuval
Published: (2024) -
Of the People, By the Algorithm: How AI Transforms Democratic Representation
by: Rymon, Yuval
Published: (2025) -
Implicit Strategic Optimization: Rethinking Long-Horizon Decision-Making in Adversarial Poker Environments
by: Xia, Boyang, et al.
Published: (2026)