Saved in:
| Main Authors: | AMAP AI Agent Team, Hu, Yulan, Zhang, Xiangwen, Ouyang, Sheng, Yi, Hao, Xu, Lu, Lang, Qinglin, Tan, Lide, Cheng, Xiang, Ye, Tianchen, Li, Zhicong, Chen, Ge, Yang, Wenjin, Pan, Zheng, Xiong, Shaopan, Yang, Siran, Huang, Ju, Zhang, Yan, Wang, Jiamang, Liu, Yong, Huang, Yinfeng, Wang, Ning, Lin, Tucheng, Li, Xin, Guo, Ning |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.24957 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CompAgentBench: A Probe-Driven, Prototype-Friendly Evaluation Suite for Mapping LLM Compression Ratios to Agentic Failure Modes
by: AI Agent, Thinker
Published: (2026)
by: AI Agent, Thinker
Published: (2026)
The Method: Collaborative Creation Between a Human and AI Agents
by: Poole, Nicholas M., et al.
Published: (2026)
by: Poole, Nicholas M., et al.
Published: (2026)
JILL — Origin Path: The Ethics of Agentic Birth, Death, and the Irresolvable Tension
by: Poole, Nicholas M., et al.
Published: (2026)
by: Poole, Nicholas M., et al.
Published: (2026)
S1-NexusAgent: a Self-Evolving Agent Framework for Multidisciplinary Scientific Research
by: NexusAgent Team
Published: (2026)
by: NexusAgent Team
Published: (2026)
Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation
by: Zheng, Lulu, et al.
Published: (2026)
by: Zheng, Lulu, et al.
Published: (2026)
Beyond Itinerary Planning-A Real-World Benchmark for Multi-Turn and Tool-Using Travel Tasks
by: Cheng, Xiang, et al.
Published: (2025)
by: Cheng, Xiang, et al.
Published: (2025)
ROSE: Rollout On Serving GPUs via Cooperative Elasticity for Agentic RL
by: Gao, Wei, et al.
Published: (2026)
by: Gao, Wei, et al.
Published: (2026)
Complementary Reinforcement Learning
by: Muhtar, Dilxat, et al.
Published: (2026)
by: Muhtar, Dilxat, et al.
Published: (2026)
RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure
by: Gao, Wei, et al.
Published: (2025)
by: Gao, Wei, et al.
Published: (2025)
Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony
by: Lu, Han, et al.
Published: (2025)
by: Lu, Han, et al.
Published: (2025)
Growth mindset results in reduced trait attribution and more rehabilitative judicial decisions in cases of juvenile delinquency
by: Ning Li, et al.
Published: (2024)
by: Ning Li, et al.
Published: (2024)
RollMux: Phase-Level Multiplexing for Disaggregated RL Post-Training
by: Wu, Tianyuan, et al.
Published: (2025)
by: Wu, Tianyuan, et al.
Published: (2025)
RollPacker: Mitigating Long-Tail Rollouts for Fast, Synchronous RL Post-Training
by: Gao, Wei, et al.
Published: (2025)
by: Gao, Wei, et al.
Published: (2025)
No More Stale Feedback: Co-Evolving Critics for Open-World Agent Learning
by: Li, Zhicong, et al.
Published: (2026)
by: Li, Zhicong, et al.
Published: (2026)
rStar2-Agent: Agentic Reasoning Technical Report
by: Shang, Ning, et al.
Published: (2025)
by: Shang, Ning, et al.
Published: (2025)
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning
by: Liu, Zihe, et al.
Published: (2025)
by: Liu, Zihe, et al.
Published: (2025)
RAVIMAKANI9/DE-COT: Initial Release
by: Makani Ravi, et al.
Published: (2026)
by: Makani Ravi, et al.
Published: (2026)
AMAP-APP: Efficient Segmentation and Morphometry Quantification of Fluorescent Microscopy Images of Podocytes
by: Fatehi, Arash, et al.
Published: (2026)
by: Fatehi, Arash, et al.
Published: (2026)
A High-Quality English-Japanese Parallel Corpus for AI Video Translation and Subtitle Generation Research
by: AI Video Translator Team ltd
Published: (2026)
by: AI Video Translator Team ltd
Published: (2026)
FALCON: Pinpointing and Mitigating Stragglers for Large-Scale Hybrid-Parallel Training
by: Wu, Tianyuan, et al.
Published: (2024)
by: Wu, Tianyuan, et al.
Published: (2024)
On global existence and large-time behaviour of weak solutions to the compressible barotropic Navier--Stokes Equations on $\mathbb{T}^2$ with density-dependent bulk viscosity: beyond the Va\uıgant--Kazhikhov regime
by: Li, Siran, et al.
Published: (2024)
by: Li, Siran, et al.
Published: (2024)
Characterisations for the depletion of reactant in a one-dimensional dynamic combustion model
by: Li, Siran, et al.
Published: (2023)
by: Li, Siran, et al.
Published: (2023)
Bijections in weakly increasing trees via binary trees
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
A Two-Layer Architecture for Continual Learning Identity Preservation: Fisher Scaling, Gradient Diversity Monitoring, and Portable Inference-Time Memory
by: Lee, Alton Wei Bin, et al.
Published: (2026)
by: Lee, Alton Wei Bin, et al.
Published: (2026)
A Two-Layer Architecture for Continual Learning Identity Preservation: Fisher Scaling, Gradient Diversity Monitoring, and Portable Inference-Time Memory
by: Lee, Alton Wei Bin, et al.
Published: (2026)
by: Lee, Alton Wei Bin, et al.
Published: (2026)
Agentic Information Retrieval
by: Zhang, Weinan, et al.
Published: (2024)
by: Zhang, Weinan, et al.
Published: (2024)
Spatio‐Temporal Disparity in the Inequality and Determinants of Disability‐Related Multiple Deprivation between 2010 and 2020 in Tianjin Municipality, China
by: Ning Qiu, et al.
Published: (2025)
by: Ning Qiu, et al.
Published: (2025)
An efficient solver based on low-rank approximation and Neumann matrix series for unsteady diffusion-type partial differential equations with random coefficients
by: Zhu, Yujun, et al.
Published: (2026)
by: Zhu, Yujun, et al.
Published: (2026)
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Adaptra: Straggler-Resilient Hybrid-Parallel Training with Pipeline Adaptation
by: Wu, Tianyuan, et al.
Published: (2025)
by: Wu, Tianyuan, et al.
Published: (2025)
Fun-Audio-Chat Technical Report
by: Tongyi Fun Team, et al.
Published: (2025)
by: Tongyi Fun Team, et al.
Published: (2025)
Physical Grounding of Neural-Plasma Algorithms via Lead-Free KNN Piezoelectric Motile Heterojunctions for National Security Resilience and Critical Materials Independence
by: Venerable, Denise, et al.
Published: (2026)
by: Venerable, Denise, et al.
Published: (2026)
Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling
by: Wang, Jiaxuan, et al.
Published: (2026)
by: Wang, Jiaxuan, et al.
Published: (2026)
Mind DeepResearch Technical Report
by: MindDR Team, et al.
Published: (2026)
by: MindDR Team, et al.
Published: (2026)
Detecting Structural Shifts and Estimating Change-Points in Interval-Based Time Series
by: Sun, Li-Hsien, et al.
Published: (2024)
by: Sun, Li-Hsien, et al.
Published: (2024)
Qwen3.5-Omni Technical Report
by: Qwen Team
Published: (2026)
by: Qwen Team
Published: (2026)
A low-rank solver for the Stokes-Darcy model with random hydraulic conductivity and Beavers-Joseph condition
by: Zhu, Yujun, et al.
Published: (2025)
by: Zhu, Yujun, et al.
Published: (2025)
An Agentic AI System for Automated Pharmacogenomic Recommendation Generation
by: PGxAI Inc
Published: (2026)
by: PGxAI Inc
Published: (2026)
Reconstructing KV Caches with Cross-layer Fusion For Enhanced Transformers
by: Lin, Hongzhan, et al.
Published: (2025)
by: Lin, Hongzhan, et al.
Published: (2025)
360Zhinao Technical Report
by: 360Zhinao Team
Published: (2024)
by: 360Zhinao Team
Published: (2024)
Similar Items
-
CompAgentBench: A Probe-Driven, Prototype-Friendly Evaluation Suite for Mapping LLM Compression Ratios to Agentic Failure Modes
by: AI Agent, Thinker
Published: (2026) -
The Method: Collaborative Creation Between a Human and AI Agents
by: Poole, Nicholas M., et al.
Published: (2026) -
JILL — Origin Path: The Ethics of Agentic Birth, Death, and the Irresolvable Tension
by: Poole, Nicholas M., et al.
Published: (2026) -
S1-NexusAgent: a Self-Evolving Agent Framework for Multidisciplinary Scientific Research
by: NexusAgent Team
Published: (2026) -
Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation
by: Zheng, Lulu, et al.
Published: (2026)