:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fan, Jiajun, Zhuang, Yuzheng, Liu, Yuecheng, Hao, Jianye, Wang, Bin, Zhu, Jiangcheng, Wang, Hao, Xia, Shu-Tao
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2305.05239
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow
by: Ma, Yueen, et al.
Published: (2025)

A Survey on Vision-Language-Action Models for Embodied AI
by: Ma, Yueen, et al.
Published: (2024)

CR-Eyes: A Computational Rational Model of Visual Sampling Behavior in Atari Games
by: Lorenz, Martin, et al.
Published: (2026)

Astra: Efficient Transformer Architecture and Contrastive Dynamics Learning for Embodied Instruction Following
by: Ma, Yueen, et al.
Published: (2024)

ED2: Environment Dynamics Decomposition World Models for Continuous Control
by: Hao, Jianye, et al.
Published: (2021)

Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control
by: Papoudakis, Georgios, et al.
Published: (2025)

MedDialBench: Benchmarking LLM Diagnostic Robustness under Parametric Adversarial Patient Behaviors
by: Luo, Xiaotian, et al.
Published: (2026)

EVLP:Learning Unified Embodied Vision-Language Planner with Reinforced Supervised Fine-Tuning
by: Cai, Xinyan, et al.
Published: (2025)

SCALE: Self-Correcting Visual Navigation for Mobile Robots via Anti-Novelty Estimation
by: Chen, Chang, et al.
Published: (2024)

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
by: Dong, Zibin, et al.
Published: (2023)

OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning
by: Liu, Yuecheng, et al.
Published: (2025)

Path planning for unmanned surface vehicle based on predictive artificial potential field. International Journal of Advanced Robotic Systems
by: Song, Jia, et al.
Published: (2026)

Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting
by: Xu, Yiming, et al.
Published: (2024)

Learning to Play Atari in a World of Tokens
by: Agarwal, Pranav, et al.
Published: (2024)

Probing the Impact of Scale on Data-Efficient, Generalist Transformer World Models for Atari
by: Kim, Jooyeon
Published: (2026)

ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
by: Zhang, Lingfeng, et al.
Published: (2024)

MotionLLM: Understanding Human Behaviors from Human Motions and Videos
by: Chen, Ling-Hao, et al.
Published: (2024)

Diffusion for World Modeling: Visual Details Matter in Atari
by: Alonso, Eloi, et al.
Published: (2024)

More than A Point: Capturing Uncertainty with Adaptive Affordance Heatmaps for Spatial Grounding in Robotic Tasks
by: Shao, Xinyu, et al.
Published: (2025)

HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
by: Delfosse, Quentin, et al.
Published: (2024)

Theory of Dielectric Behavior in Composites
by: Hao, Lifeng, et al.
Published: (2025)

Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection
by: Li, Su, et al.
Published: (2024)

Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
by: Fang, Hao, et al.
Published: (2025)

Behavioral Authentication for Security and Safety
by: Wang, Cheng, et al.
Published: (2023)

Generative Modeling for Adversarial Lane-Change Scenarios
by: Zhang, Chuancheng, et al.
Published: (2025)

FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation
by: Shu, Jian, et al.
Published: (2025)

MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents
by: Wei, Ziming, et al.
Published: (2025)

BTC-LLM: Efficient Sub-1-Bit LLM Quantization via Learnable Transformation and Binary Codebook
by: Gu, Hao, et al.
Published: (2025)

SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning
by: Liu, Yuecheng, et al.
Published: (2025)

AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
by: Lian, Niu, et al.
Published: (2025)

AutoLayout: Closed-Loop Layout Synthesis via Slow-Fast Collaborative Reasoning
by: Chen, Weixing, et al.
Published: (2025)

Optimal-Agent-Selection: State-Aware Routing Framework for Efficient Multi-Agent Collaboration
by: Wang, Jingbo, et al.
Published: (2025)

Unexpected Crystallization Behavior of Polypropylene With Low Rubber Content
by: Hengyuan Zhang, et al.
Published: (2026)

QB-LIF: Learnable-Scale Quantized Burst Neurons for Efficient SNNs
by: Bai, Dewei, et al.
Published: (2026)

Efficient Self-Supervised Video Hashing with Selective State Spaces
by: Wang, Jinpeng, et al.
Published: (2024)

SCB-Dataset3: A Benchmark for Detecting Student Classroom Behavior
by: Yang, Fan, et al.
Published: (2023)

Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games
by: Waytowich, Nicholas R., et al.
Published: (2024)

Offline Behavioral Data Selection
by: Lei, Shiye, et al.
Published: (2025)

Sample-Efficient Behavior Cloning Using General Domain Knowledge
by: Zhu, Feiyu, et al.
Published: (2025)

CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks
by: Fang, Hao, et al.
Published: (2024)