Saved in:
| Main Authors: | Shi, Zhenwu, Gong, Jingyu, Wang, Peiwei, Wang, Xingzan, Qian, Tianwen, Li, Wenxi, Fang, Yuan, Xie, Jiao, Ma, Lizhuang, Lin, Shaohui |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.30969 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception
by: Gong, Jingyu, et al.
Published: (2024)
by: Gong, Jingyu, et al.
Published: (2024)
HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection
by: Xu, Qi'ao, et al.
Published: (2025)
by: Xu, Qi'ao, et al.
Published: (2025)
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
by: Wang, Jiayu, et al.
Published: (2025)
by: Wang, Jiayu, et al.
Published: (2025)
Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization
by: Li, Mengtian, et al.
Published: (2024)
by: Li, Mengtian, et al.
Published: (2024)
UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images
by: Tian, Qijian, et al.
Published: (2025)
by: Tian, Qijian, et al.
Published: (2025)
FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
by: Fan, Ke, et al.
Published: (2024)
by: Fan, Ke, et al.
Published: (2024)
Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
by: Fan, Ke, et al.
Published: (2024)
by: Fan, Ke, et al.
Published: (2024)
Bridging the 2D-3D Gap: A Hierarchical Semantic-Geometric Map for Vision Language Navigation
by: Li, Kailing, et al.
Published: (2026)
by: Li, Kailing, et al.
Published: (2026)
GSCompleter: A Distillation-Free Plugin for Metric-Aware 3D Gaussian Splatting Completion in Seconds
by: Gao, Ao, et al.
Published: (2026)
by: Gao, Ao, et al.
Published: (2026)
MXene‐Based Nanosheets Reverse Tumor Hypoxia to Amplify Triple‐Mode Cancer Therapy
by: Wenzhi Yang, et al.
Published: (2025)
by: Wenzhi Yang, et al.
Published: (2025)
CLiViS: Unleashing Cognitive Map through Linguistic-Visual Synergy for Embodied Visual Reasoning
by: Li, Kailing, et al.
Published: (2025)
by: Li, Kailing, et al.
Published: (2025)
NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval
by: Lin, Zengrong, et al.
Published: (2025)
by: Lin, Zengrong, et al.
Published: (2025)
DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation
by: Gu, Tianjun, et al.
Published: (2025)
by: Gu, Tianjun, et al.
Published: (2025)
Prompt as Free Lunch: Enhancing Diversity in Source-Free Cross-domain Few-shot Learning through Semantic-Guided Prompting
by: Zhuo, Linhai, et al.
Published: (2024)
by: Zhuo, Linhai, et al.
Published: (2024)
S2GS: Streaming Semantic Gaussian Splatting for Online Scene Understanding and Reconstruction
by: Zhang, Renhe, et al.
Published: (2026)
by: Zhang, Renhe, et al.
Published: (2026)
Vision-language models lag human performance on physical dynamics and intent reasoning
by: Gu, Tianjun, et al.
Published: (2026)
by: Gu, Tianjun, et al.
Published: (2026)
Empathy Omni: Enabling Empathetic Speech Response Generation through Large Language Models
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model
by: Wang, Sen, et al.
Published: (2024)
by: Wang, Sen, et al.
Published: (2024)
Engineering substrate channeling with synthetic biomolecular condensates for improved ectoine biosynthesis
by: Wang, Tianwen
Published: (2026)
by: Wang, Tianwen
Published: (2026)
A CVaR‐Based Optimal Perimeter Control Framework With Consideration of Boundary Queue Length
by: Ying Zhang, et al.
Published: (2026)
by: Ying Zhang, et al.
Published: (2026)
Can LLMs Reason Like Automated Theorem Provers for Rust Verification? VCoT-Bench: Evaluating via Verification Chain of Thought
by: Xie, Zichen, et al.
Published: (2026)
by: Xie, Zichen, et al.
Published: (2026)
Omni-Customizer: End-to-End MultiModal Customization for Joint Audio-Video Generation
by: Chen, Yuheng, et al.
Published: (2026)
by: Chen, Yuheng, et al.
Published: (2026)
DreamOmni: Unified Image Generation and Editing
by: Xia, Bin, et al.
Published: (2024)
by: Xia, Bin, et al.
Published: (2024)
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
by: Wei, Cong, et al.
Published: (2024)
by: Wei, Cong, et al.
Published: (2024)
Quantum transport theory with vector interaction
by: Yu, Peiwei, et al.
Published: (2022)
by: Yu, Peiwei, et al.
Published: (2022)
PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion
by: Zhang, Zhiwei, et al.
Published: (2025)
by: Zhang, Zhiwei, et al.
Published: (2025)
Low‐Coordination Configuration Single‐Atom Manganese Nanozymes for NIR‐Imaging‐Oriented Efficient Catalytic Oncotherapy
by: Peiwei Jin, et al.
Published: (2025)
by: Peiwei Jin, et al.
Published: (2025)
Rethinking Invariance in In-context Learning
by: Fang, Lizhe, et al.
Published: (2025)
by: Fang, Lizhe, et al.
Published: (2025)
Editing Physiological Signals in Videos Using Latent Representations
by: Zhou, Tianwen, et al.
Published: (2025)
by: Zhou, Tianwen, et al.
Published: (2025)
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing
by: Zuo, Yi, et al.
Published: (2024)
by: Zuo, Yi, et al.
Published: (2024)
Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation
by: Li, Sien, et al.
Published: (2024)
by: Li, Sien, et al.
Published: (2024)
Emphasizing Semantic Consistency of Salient Posture for Speech-Driven Gesture Generation
by: Liu, Fengqi, et al.
Published: (2024)
by: Liu, Fengqi, et al.
Published: (2024)
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
by: Xu, Pengcheng, et al.
Published: (2024)
by: Xu, Pengcheng, et al.
Published: (2024)
Water‐Dispersible MXene Governs Glycolysis for Cancer Synergistic Therapy
by: Jinfeng Liu, et al.
Published: (2025)
by: Jinfeng Liu, et al.
Published: (2025)
OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
by: Li, Weiqi, et al.
Published: (2024)
by: Li, Weiqi, et al.
Published: (2024)
Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model
by: Yang, Liu, et al.
Published: (2025)
by: Yang, Liu, et al.
Published: (2025)
MotionMaster: Training-free Camera Motion Transfer For Video Generation
by: Hu, Teng, et al.
Published: (2024)
by: Hu, Teng, et al.
Published: (2024)
OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation
by: Xu, Guowei, et al.
Published: (2025)
by: Xu, Guowei, et al.
Published: (2025)
Primary Code And Data For Declining Snowpack Delays Spring Green-Up Date through Disrupting the Chilling-Forcing Balancing
by: Wen, Jingyu
Published: (2026)
by: Wen, Jingyu
Published: (2026)
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario
by: Qian, Tianwen, et al.
Published: (2023)
by: Qian, Tianwen, et al.
Published: (2023)
Similar Items
-
DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception
by: Gong, Jingyu, et al.
Published: (2024) -
HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection
by: Xu, Qi'ao, et al.
Published: (2025) -
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
by: Wang, Jiayu, et al.
Published: (2025) -
Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization
by: Li, Mengtian, et al.
Published: (2024) -
UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images
by: Tian, Qijian, et al.
Published: (2025)