:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xiao, Jie, Fan, Changyuan, Ren, Qingnan, Long, Alfred, Zhang, Yuchen, Yu, Rymon, Yang, Eric, Ai, Lynn, Gan, Shaoduo
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.05387
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget
by: Wang, Zihao, et al.
Published: (2024)

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning
by: Song, Jingwei, et al.
Published: (2026)

Societal Adaptation to AI Human-Labor Automation
by: Rymon, Yuval
Published: (2024)

Of the People, By the Algorithm: How AI Transforms Democratic Representation
by: Rymon, Yuval
Published: (2025)

Implicit Strategic Optimization: Rethinking Long-Horizon Decision-Making in Adversarial Poker Environments
by: Xia, Boyang, et al.
Published: (2026)

EchoRL: Reinforcement Learning via Rollout Echoing
by: Bi, Jinhe, et al.
Published: (2026)

ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment
by: Li, Xiuyu, et al.
Published: (2026)

Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization
by: Dong, Siyan, et al.
Published: (2024)

Lattica: A Decentralized Cross-NAT Communication Framework for Scalable AI Inference and Training
by: Yang, Ween, et al.
Published: (2025)

Echo: Simulating Distributed Training At Scale
by: Feng, Yicheng, et al.
Published: (2024)

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis
by: Yang, Pei, et al.
Published: (2026)

Event‐Triggered Fault‐Tolerant Formation Tracking for Quadrotor Swarms Under Fully Distributed Communication
by: Qingnan Huang, et al.
Published: (2026)

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
by: Li, Haozhan, et al.
Published: (2025)

Position: Assistive Agents Need Accessibility Alignment
by: Hu, Jie, et al.
Published: (2026)

EchoDistill:Alignment Noisy-to-Clean Self-Distillation for Robust Audio LLMs
by: Lin, Liang, et al.
Published: (2026)

VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning
by: Xiao, Wenyi, et al.
Published: (2026)

Decoupled Prioritized Resampling for Offline RL
by: Yue, Yang, et al.
Published: (2023)

Echo-N1: Affective RL Frontier
by: Zhang, Naifan, et al.
Published: (2025)

Custom code for LSGI manuscript
by: Liang, Qingnan
Published: (2025)

Primitive-Swarm: An Ultra-lightweight and Scalable Planner for Large-scale Aerial Swarms
by: Hou, Jialiang, et al.
Published: (2025)

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL
by: Feng, Yichen, et al.
Published: (2025)

SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds
by: Tadevosyan, Grik, et al.
Published: (2025)

Heterogeneity-Aware Dataset Scheduling for Efficient Audio Large Language Model Training
by: Wu, Yanru, et al.
Published: (2026)

Is Meta-Path Attention an Explanation? Evidence of Alignment and Decoupling in Heterogeneous GNNs
by: Jiang, Maiqi, et al.
Published: (2026)

Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment
by: Zheng, Ziyu, et al.
Published: (2026)

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
by: Xie, Tian, et al.
Published: (2025)

AutoTool: Automatic Scaling of Tool-Use Capabilities in RL via Decoupled Entropy Constraints
by: Zeng, Yirong, et al.
Published: (2026)

Bockstein operations and extensions with trivial boundary maps
by: An, Qingnan, et al.
Published: (2024)

Total Cuntz semigroup, Extension and Elliott Conjecture with Real rank zero
by: An, Qingnan, et al.
Published: (2023)

Total Cuntz semigroup, extension, and Elliott Conjecture with real rank zero
by: Qingnan An, et al.
Published: (2024)

Modular Diffusion Policy Training: Decoupling and Recombining Guidance and Diffusion for Offline RL
by: Chen, Zhaoyang, et al.
Published: (2025)

DiffusionRL: Efficient Training of Diffusion Policies for Robotic Grasping Using RL-Adapted Large-Scale Datasets
by: Makarova, Maria, et al.
Published: (2025)

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
by: Zhu, Yu, et al.
Published: (2024)

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models
by: Wang, Ya, et al.
Published: (2025)

How to Compress KV Cache in RL Post-Training? Shadow Mask Distillation for Memory-Efficient Alignment
by: Zhu, Rui, et al.
Published: (2026)

ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning
by: Ren, Qingnan, et al.
Published: (2026)

Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems
by: Cheng, Shiyu, et al.
Published: (2025)

SwarmPRM: Probabilistic Roadmap Motion Planning for Large-Scale Swarm Robotic Systems
by: Hu, Yunze, et al.
Published: (2024)

QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training--Inference Mismatch
by: Gu, Hao, et al.
Published: (2026)

Distributed and Decentralized Control and Task Allocation for Flexible Swarms
by: Koifman, Yigal, et al.
Published: (2024)