Guardado en:
| Autores principales: | Cheng, Xiaoyuan, Wang, Haoyu, Yuan, Wenxuan, Wang, Ziyan, Chen, Zonghao, Zeng, Li, Sun, Zhuo |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2604.17919 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
por: Yuan, Xiu, et al.
Publicado: (2024)
por: Yuan, Xiu, et al.
Publicado: (2024)
Reinforcement Fine-Tuning of Flow-Matching Policies for Vision-Language-Action Models
por: Lyu, Mingyang, et al.
Publicado: (2025)
por: Lyu, Mingyang, et al.
Publicado: (2025)
Fisher-Preserving Guidance: Training-Free Manifold Constraints for Safe Diffusion Control
por: Ren, Hao, et al.
Publicado: (2026)
por: Ren, Hao, et al.
Publicado: (2026)
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition
por: Cao, Jiahang, et al.
Publicado: (2025)
por: Cao, Jiahang, et al.
Publicado: (2025)
Translating Flow to Policy via Hindsight Online Imitation
por: Zheng, Yitian, et al.
Publicado: (2025)
por: Zheng, Yitian, et al.
Publicado: (2025)
Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving
por: Zhu, Tianze, et al.
Publicado: (2026)
por: Zhu, Tianze, et al.
Publicado: (2026)
PolicyFlow: Policy Optimization with Continuous Normalizing Flow in Reinforcement Learning
por: Yang, Shunpeng, et al.
Publicado: (2026)
por: Yang, Shunpeng, et al.
Publicado: (2026)
FocalPolicy: Frequency-Optimized Chunking and Locally Anchored Flow Matching for Coherent Visuomotor Policy
por: He, Qian, et al.
Publicado: (2026)
por: He, Qian, et al.
Publicado: (2026)
Don't Start from Scratch: Behavioral Refinement via Interpolant-based Policy Diffusion
por: Chen, Kaiqi, et al.
Publicado: (2024)
por: Chen, Kaiqi, et al.
Publicado: (2024)
ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning
por: Zhang, Tonghe, et al.
Publicado: (2025)
por: Zhang, Tonghe, et al.
Publicado: (2025)
SAC Flow: Sample-Efficient Reinforcement Learning of Flow-Based Policies via Velocity-Reparameterized Sequential Modeling
por: Zhang, Yixian, et al.
Publicado: (2025)
por: Zhang, Yixian, et al.
Publicado: (2025)
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
por: Wang, Zhendong, et al.
Publicado: (2024)
por: Wang, Zhendong, et al.
Publicado: (2024)
SeFA-Policy: Fast and Accurate Visuomotor Policy Learning with Selective Flow Alignment
por: Xue, Rong, et al.
Publicado: (2025)
por: Xue, Rong, et al.
Publicado: (2025)
CRAFT: Counterfactual-to-Interactive Reinforcement Fine-Tuning for Driving Policies
por: Chen, Keyu, et al.
Publicado: (2026)
por: Chen, Keyu, et al.
Publicado: (2026)
Composite Gaussian Processes Flows for Learning Discontinuous Multimodal Policies
por: Wang, Shu-yuan, et al.
Publicado: (2025)
por: Wang, Shu-yuan, et al.
Publicado: (2025)
PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning
por: Jia, Xiaogang, et al.
Publicado: (2025)
por: Jia, Xiaogang, et al.
Publicado: (2025)
Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation
por: Jia, Bofang, et al.
Publicado: (2024)
por: Jia, Bofang, et al.
Publicado: (2024)
Refined Policy Distillation: From VLA Generalists to RL Experts
por: Jülg, Tobias, et al.
Publicado: (2025)
por: Jülg, Tobias, et al.
Publicado: (2025)
The Lie We Tell: Correcting the Euclidean Fallacy in Vision Language Action Policies via Score Matching on Tangent Space
por: Chuang, Bing-Cheng, et al.
Publicado: (2026)
por: Chuang, Bing-Cheng, et al.
Publicado: (2026)
Flow Matching Policy Gradients
por: McAllister, David, et al.
Publicado: (2025)
por: McAllister, David, et al.
Publicado: (2025)
Efficient Language-instructed Skill Acquisition via Reward-Policy Co-Evolution
por: Huang, Changxin, et al.
Publicado: (2024)
por: Huang, Changxin, et al.
Publicado: (2024)
WarmPrior: Straightening Flow-Matching Policies with Temporal Priors
por: Kang, Sinjae, et al.
Publicado: (2026)
por: Kang, Sinjae, et al.
Publicado: (2026)
Latent Policy Steering through One-Step Flow Policies
por: Im, Hokyun, et al.
Publicado: (2026)
por: Im, Hokyun, et al.
Publicado: (2026)
Outlier-robust Diffusion Posterior Sampling for Bayesian Inverse Problems
por: Yang, Yiming, et al.
Publicado: (2026)
por: Yang, Yiming, et al.
Publicado: (2026)
Offline Reinforcement Learning with Wasserstein Regularization via Optimal Transport Maps
por: Omura, Motoki, et al.
Publicado: (2025)
por: Omura, Motoki, et al.
Publicado: (2025)
Flow with the Force Field: Learning 3D Compliant Flow Matching Policies from Force and Demonstration-Guided Simulation Data
por: Li, Tianyu, et al.
Publicado: (2025)
por: Li, Tianyu, et al.
Publicado: (2025)
Harnessing Bounded-Support Evolution Strategies for Policy Refinement
por: Hirschowitz, Ethan, et al.
Publicado: (2025)
por: Hirschowitz, Ethan, et al.
Publicado: (2025)
Robot Fleet Learning via Policy Merging
por: Wang, Lirui, et al.
Publicado: (2023)
por: Wang, Lirui, et al.
Publicado: (2023)
Pointing the Way: Refining Radar-Lidar Localization Using Learned ICP Weights
por: Lisus, Daniil, et al.
Publicado: (2023)
por: Lisus, Daniil, et al.
Publicado: (2023)
SPRINT: Efficient Spectral Priors for Humanoid Athletic Sprints
por: Wei, Yantong, et al.
Publicado: (2026)
por: Wei, Yantong, et al.
Publicado: (2026)
VT-Refine: Learning Bimanual Assembly with Visuo-Tactile Feedback via Simulation Fine-Tuning
por: Huang, Binghao, et al.
Publicado: (2025)
por: Huang, Binghao, et al.
Publicado: (2025)
FlowCorrect: Efficient Interactive Correction of Generative Flow Policies for Robotic Manipulation
por: Welte, Edgar, et al.
Publicado: (2026)
por: Welte, Edgar, et al.
Publicado: (2026)
Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction
por: Wang, Yuping, et al.
Publicado: (2023)
por: Wang, Yuping, et al.
Publicado: (2023)
Smoother Action Chunking Flow Policy via Prior-Corrected Orthogonal Trust-Region Guidance
por: Fang, Kai, et al.
Publicado: (2026)
por: Fang, Kai, et al.
Publicado: (2026)
FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
por: Alles, Marvin, et al.
Publicado: (2025)
por: Alles, Marvin, et al.
Publicado: (2025)
Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM
por: Davies, Leon, et al.
Publicado: (2025)
por: Davies, Leon, et al.
Publicado: (2025)
Drifting Field Policy: A One-Step Generative Policy via Wasserstein Gradient Flow
por: Koo, Juil, et al.
Publicado: (2026)
por: Koo, Juil, et al.
Publicado: (2026)
Unpacking the Individual Components of Diffusion Policy
por: Yuan, Xiu
Publicado: (2024)
por: Yuan, Xiu
Publicado: (2024)
Riemannian Flow Matching Policy for Robot Motion Learning
por: Braun, Max, et al.
Publicado: (2024)
por: Braun, Max, et al.
Publicado: (2024)
Fast and Robust Visuomotor Riemannian Flow Matching Policy
por: Ding, Haoran, et al.
Publicado: (2024)
por: Ding, Haoran, et al.
Publicado: (2024)
Ejemplares similares
-
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
por: Yuan, Xiu, et al.
Publicado: (2024) -
Reinforcement Fine-Tuning of Flow-Matching Policies for Vision-Language-Action Models
por: Lyu, Mingyang, et al.
Publicado: (2025) -
Fisher-Preserving Guidance: Training-Free Manifold Constraints for Safe Diffusion Control
por: Ren, Hao, et al.
Publicado: (2026) -
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition
por: Cao, Jiahang, et al.
Publicado: (2025) -
Translating Flow to Policy via Hindsight Online Imitation
por: Zheng, Yitian, et al.
Publicado: (2025)