Saved in:
| Main Authors: | Fan, Jiajun, Zhuang, Yuzheng, Liu, Yuecheng, Hao, Jianye, Wang, Bin, Zhu, Jiangcheng, Wang, Hao, Xia, Shu-Tao |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2305.05239 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow
by: Ma, Yueen, et al.
Published: (2025)
by: Ma, Yueen, et al.
Published: (2025)
A Survey on Vision-Language-Action Models for Embodied AI
by: Ma, Yueen, et al.
Published: (2024)
by: Ma, Yueen, et al.
Published: (2024)
CR-Eyes: A Computational Rational Model of Visual Sampling Behavior in Atari Games
by: Lorenz, Martin, et al.
Published: (2026)
by: Lorenz, Martin, et al.
Published: (2026)
Astra: Efficient Transformer Architecture and Contrastive Dynamics Learning for Embodied Instruction Following
by: Ma, Yueen, et al.
Published: (2024)
by: Ma, Yueen, et al.
Published: (2024)
ED2: Environment Dynamics Decomposition World Models for Continuous Control
by: Hao, Jianye, et al.
Published: (2021)
by: Hao, Jianye, et al.
Published: (2021)
Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control
by: Papoudakis, Georgios, et al.
Published: (2025)
by: Papoudakis, Georgios, et al.
Published: (2025)
MedDialBench: Benchmarking LLM Diagnostic Robustness under Parametric Adversarial Patient Behaviors
by: Luo, Xiaotian, et al.
Published: (2026)
by: Luo, Xiaotian, et al.
Published: (2026)
EVLP:Learning Unified Embodied Vision-Language Planner with Reinforced Supervised Fine-Tuning
by: Cai, Xinyan, et al.
Published: (2025)
by: Cai, Xinyan, et al.
Published: (2025)
SCALE: Self-Correcting Visual Navigation for Mobile Robots via Anti-Novelty Estimation
by: Chen, Chang, et al.
Published: (2024)
by: Chen, Chang, et al.
Published: (2024)
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
by: Dong, Zibin, et al.
Published: (2023)
by: Dong, Zibin, et al.
Published: (2023)
OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning
by: Liu, Yuecheng, et al.
Published: (2025)
by: Liu, Yuecheng, et al.
Published: (2025)
Path planning for unmanned surface vehicle based on predictive artificial potential field. International Journal of Advanced Robotic Systems
by: Song, Jia, et al.
Published: (2026)
by: Song, Jia, et al.
Published: (2026)
Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting
by: Xu, Yiming, et al.
Published: (2024)
by: Xu, Yiming, et al.
Published: (2024)
Learning to Play Atari in a World of Tokens
by: Agarwal, Pranav, et al.
Published: (2024)
by: Agarwal, Pranav, et al.
Published: (2024)
Probing the Impact of Scale on Data-Efficient, Generalist Transformer World Models for Atari
by: Kim, Jooyeon
Published: (2026)
by: Kim, Jooyeon
Published: (2026)
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
by: Zhang, Lingfeng, et al.
Published: (2024)
by: Zhang, Lingfeng, et al.
Published: (2024)
MotionLLM: Understanding Human Behaviors from Human Motions and Videos
by: Chen, Ling-Hao, et al.
Published: (2024)
by: Chen, Ling-Hao, et al.
Published: (2024)
Diffusion for World Modeling: Visual Details Matter in Atari
by: Alonso, Eloi, et al.
Published: (2024)
by: Alonso, Eloi, et al.
Published: (2024)
More than A Point: Capturing Uncertainty with Adaptive Affordance Heatmaps for Spatial Grounding in Robotic Tasks
by: Shao, Xinyu, et al.
Published: (2025)
by: Shao, Xinyu, et al.
Published: (2025)
HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
by: Delfosse, Quentin, et al.
Published: (2024)
by: Delfosse, Quentin, et al.
Published: (2024)
Theory of Dielectric Behavior in Composites
by: Hao, Lifeng, et al.
Published: (2025)
by: Hao, Lifeng, et al.
Published: (2025)
Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection
by: Li, Su, et al.
Published: (2024)
by: Li, Su, et al.
Published: (2024)
Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
by: Fang, Hao, et al.
Published: (2025)
by: Fang, Hao, et al.
Published: (2025)
Behavioral Authentication for Security and Safety
by: Wang, Cheng, et al.
Published: (2023)
by: Wang, Cheng, et al.
Published: (2023)
Generative Modeling for Adversarial Lane-Change Scenarios
by: Zhang, Chuancheng, et al.
Published: (2025)
by: Zhang, Chuancheng, et al.
Published: (2025)
FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation
by: Shu, Jian, et al.
Published: (2025)
by: Shu, Jian, et al.
Published: (2025)
MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents
by: Wei, Ziming, et al.
Published: (2025)
by: Wei, Ziming, et al.
Published: (2025)
BTC-LLM: Efficient Sub-1-Bit LLM Quantization via Learnable Transformation and Binary Codebook
by: Gu, Hao, et al.
Published: (2025)
by: Gu, Hao, et al.
Published: (2025)
SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning
by: Liu, Yuecheng, et al.
Published: (2025)
by: Liu, Yuecheng, et al.
Published: (2025)
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
by: Lian, Niu, et al.
Published: (2025)
by: Lian, Niu, et al.
Published: (2025)
AutoLayout: Closed-Loop Layout Synthesis via Slow-Fast Collaborative Reasoning
by: Chen, Weixing, et al.
Published: (2025)
by: Chen, Weixing, et al.
Published: (2025)
Optimal-Agent-Selection: State-Aware Routing Framework for Efficient Multi-Agent Collaboration
by: Wang, Jingbo, et al.
Published: (2025)
by: Wang, Jingbo, et al.
Published: (2025)
Unexpected Crystallization Behavior of Polypropylene With Low Rubber Content
by: Hengyuan Zhang, et al.
Published: (2026)
by: Hengyuan Zhang, et al.
Published: (2026)
QB-LIF: Learnable-Scale Quantized Burst Neurons for Efficient SNNs
by: Bai, Dewei, et al.
Published: (2026)
by: Bai, Dewei, et al.
Published: (2026)
Efficient Self-Supervised Video Hashing with Selective State Spaces
by: Wang, Jinpeng, et al.
Published: (2024)
by: Wang, Jinpeng, et al.
Published: (2024)
SCB-Dataset3: A Benchmark for Detecting Student Classroom Behavior
by: Yang, Fan, et al.
Published: (2023)
by: Yang, Fan, et al.
Published: (2023)
Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games
by: Waytowich, Nicholas R., et al.
Published: (2024)
by: Waytowich, Nicholas R., et al.
Published: (2024)
Offline Behavioral Data Selection
by: Lei, Shiye, et al.
Published: (2025)
by: Lei, Shiye, et al.
Published: (2025)
Sample-Efficient Behavior Cloning Using General Domain Knowledge
by: Zhu, Feiyu, et al.
Published: (2025)
by: Zhu, Feiyu, et al.
Published: (2025)
CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks
by: Fang, Hao, et al.
Published: (2024)
by: Fang, Hao, et al.
Published: (2024)
Similar Items
-
3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow
by: Ma, Yueen, et al.
Published: (2025) -
A Survey on Vision-Language-Action Models for Embodied AI
by: Ma, Yueen, et al.
Published: (2024) -
CR-Eyes: A Computational Rational Model of Visual Sampling Behavior in Atari Games
by: Lorenz, Martin, et al.
Published: (2026) -
Astra: Efficient Transformer Architecture and Contrastive Dynamics Learning for Embodied Instruction Following
by: Ma, Yueen, et al.
Published: (2024) -
ED2: Environment Dynamics Decomposition World Models for Continuous Control
by: Hao, Jianye, et al.
Published: (2021)