:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gao, Jialin, Zhou, Donghao, Liang, Mingjian, Liu, Lihao, Fu, Chi-Wing, Hu, Xiaowei, Heng, Pheng-Ann
Format:	Preprint
Published:	2025
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2510.02178
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DisCo: Disentangled Control for Realistic Human Dance Generation
by: Wang, Tan, et al.
Published: (2023)

IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation
by: Zhou, Donghao, et al.
Published: (2025)

DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation
by: Du, Kounianhua, et al.
Published: (2024)

Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making
by: Wang, Yihan, et al.
Published: (2025)

DisCo-Speech: Controllable Zero-Shot Speech Generation with A Disentangled Speech Codec
by: Li, Tao, et al.
Published: (2025)

SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis
by: Sun, Xiaohao, et al.
Published: (2025)

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models
by: Xu, Jiaqi, et al.
Published: (2024)

DisCo: Graph-Based Disentangled Contrastive Learning for Cold-Start Cross-Domain Recommendation
by: Li, Hourun, et al.
Published: (2024)

Unveiling Deep Shadows: A Survey and Benchmark on Image and Video Shadow Detection, Removal, and Generation in the Deep Learning Era
by: Hu, Xiaowei, et al.
Published: (2024)

LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation
by: Shi, Hengyu, et al.
Published: (2025)

DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing
by: Chi, Yufeng, et al.
Published: (2025)

Overcoming Support Dilution for Robust Few-shot Semantic Segmentation
by: Tang, Wailing, et al.
Published: (2025)

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
by: Xu, Yilun, et al.
Published: (2024)

DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs
by: Zhao, Jiahe, et al.
Published: (2025)

Video Instance Shadow Detection Under the Sun and Sky
by: Xing, Zhenghao, et al.
Published: (2022)

EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning
by: Xing, Zhenghao, et al.
Published: (2025)

Revisiting Shadow Detection: A New Benchmark Dataset for Complex World
by: Hu, Xiaowei, et al.
Published: (2019)

UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation
by: Wang, Yinqiao, et al.
Published: (2025)

SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-View Adaptation
by: Wang, Yinqiao, et al.
Published: (2024)

DisCo-FLoc: Semantic-Free Floorplan Localization via $SE(2)$-Aware Contrastive Disambiguation
by: Zhong, Ping, et al.
Published: (2026)

StructLayoutFormer:Conditional Structured Layout Generation via Structure Serialization and Disentanglement
by: Hu, Xin, et al.
Published: (2025)

SPATIALGEN: Layout-guided 3D Indoor Scene Generation
by: Fang, Chuan, et al.
Published: (2025)

Perceive-then-Plan: Layout-as-Policy for Monocular 3D Scene Layout Estimation
by: Zhou, Junwei, et al.
Published: (2026)

CasLayout: Cascaded 3D Layout Diffusion for Indoor Scene Synthesis with Implicit Relation Modeling
by: Wu, Yingrui, et al.
Published: (2026)

DisCo: Distributed Contact-Rich Trajectory Optimization for Forceful Multi-Robot Collaboration
by: Shorinwa, Ola, et al.
Published: (2024)

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing
by: Chen, Yuqing, et al.
Published: (2025)

AutoLayout: Closed-Loop Layout Synthesis via Slow-Fast Collaborative Reasoning
by: Chen, Weixing, et al.
Published: (2025)

MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
by: Peng, Fei, et al.
Published: (2025)

Co-Layout: LLM-driven Co-optimization for Interior Layout
by: Xiang, Chucheng, et al.
Published: (2025)

DisCo-DSO: Coupling Discrete and Continuous Optimization for Efficient Generative Design in Hybrid Spaces
by: Pettit, Jacob F., et al.
Published: (2024)

Coordinated 2D-3D Visualization of Volumetric Medical Data in XR with Multimodal Interactions
by: Liu, Qixuan, et al.
Published: (2025)

MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models
by: Yan, Qiao, et al.
Published: (2025)

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
by: Lv, Zhengyao, et al.
Published: (2024)

InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
by: Lin, Chenguo, et al.
Published: (2024)

LayoutAgent: A Vision-Language Agent Guided Compositional Diffusion for Spatial Layout Planning
by: Fan, Zezhong, et al.
Published: (2025)

RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation
by: Sun, Wenzhuo, et al.
Published: (2025)

Rethinking Intermediate Representation for VLM-based Robot Manipulation
by: Tang, Weiliang, et al.
Published: (2025)

Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking
by: Guo, Xucheng, et al.
Published: (2025)

LPI-RIT at LeWiDi-2025: Improving Distributional Predictions via Metadata and Loss Reweighting with DisCo
by: Sawkar, Mandira, et al.
Published: (2025)

Hand-Shadow Poser
by: Xu, Hao, et al.
Published: (2025)