Saved in:
| Main Authors: | Parolari, Luca, Faccioli, Nicla, Ballan, Lamberto |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.25358 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models
by: Izzo, Elena, et al.
Published: (2025)
by: Izzo, Elena, et al.
Published: (2025)
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension
by: Parolari, Luca, et al.
Published: (2024)
by: Parolari, Luca, et al.
Published: (2024)
Towards Polyp Counting In Full-Procedure Colonoscopy Videos
by: Parolari, Luca, et al.
Published: (2025)
by: Parolari, Luca, et al.
Published: (2025)
Temporally-Aware Supervised Contrastive Learning for Polyp Counting in Colonoscopy
by: Parolari, Luca, et al.
Published: (2025)
by: Parolari, Luca, et al.
Published: (2025)
Contrastive Learning under Noisy Temporal Self-Supervision for Colonoscopy Videos
by: Parolari, Luca, et al.
Published: (2026)
by: Parolari, Luca, et al.
Published: (2026)
Multiview Progress Prediction of Robot Activities
by: Zoppellari, Elena, et al.
Published: (2026)
by: Zoppellari, Elena, et al.
Published: (2026)
PersONAL: Towards a Comprehensive Benchmark for Personalized Embodied Agents
by: Ziliotto, Filippo, et al.
Published: (2025)
by: Ziliotto, Filippo, et al.
Published: (2025)
You Only Landmark Once: Lightweight U-Net Face Super Resolution with YOLO-World Landmark Heatmaps
by: Carraro, Riccardo, et al.
Published: (2026)
by: Carraro, Riccardo, et al.
Published: (2026)
MLFM: Multi-Layered Feature Maps for Richer Language Understanding in Zero-Shot Semantic Navigation
by: Raychaudhuri, Sonia, et al.
Published: (2025)
by: Raychaudhuri, Sonia, et al.
Published: (2025)
Assessing the Visual Enumeration Abilities of Specialized Counting Architectures and Vision-Language Models
by: Hou, Kuinan, et al.
Published: (2025)
by: Hou, Kuinan, et al.
Published: (2025)
Distilling Knowledge for Short-to-Long Term Trajectory Prediction
by: Das, Sourav, et al.
Published: (2023)
by: Das, Sourav, et al.
Published: (2023)
Following the Human Thread in Social Navigation
by: Scofano, Luca, et al.
Published: (2024)
by: Scofano, Luca, et al.
Published: (2024)
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
by: Jia, Chengyou, et al.
Published: (2023)
by: Jia, Chengyou, et al.
Published: (2023)
Spatial Diffusion for Cell Layout Generation
by: Li, Chen, et al.
Published: (2024)
by: Li, Chen, et al.
Published: (2024)
Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors
by: Chen, Minglin, et al.
Published: (2025)
by: Chen, Minglin, et al.
Published: (2025)
SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis
by: Sun, Xiaohao, et al.
Published: (2025)
by: Sun, Xiaohao, et al.
Published: (2025)
Bayesian Fields: Task-driven Open-Set Semantic Gaussian Splatting
by: Maggio, Dominic, et al.
Published: (2025)
by: Maggio, Dominic, et al.
Published: (2025)
Open-Set Biometrics: Beyond Good Closed-Set Models
by: Su, Yiyang, et al.
Published: (2024)
by: Su, Yiyang, et al.
Published: (2024)
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
by: Zheng, Guangcong, et al.
Published: (2023)
by: Zheng, Guangcong, et al.
Published: (2023)
Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model
by: Hanif, Sidra, et al.
Published: (2025)
by: Hanif, Sidra, et al.
Published: (2025)
LayoutAgent: A Vision-Language Agent Guided Compositional Diffusion for Spatial Layout Planning
by: Fan, Zezhong, et al.
Published: (2025)
by: Fan, Zezhong, et al.
Published: (2025)
Spatial-DISE: A Unified Benchmark for Evaluating Spatial Reasoning in Vision-Language Models
by: Huang, Xinmiao, et al.
Published: (2025)
by: Huang, Xinmiao, et al.
Published: (2025)
STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation
by: Wang, Ruyu, et al.
Published: (2025)
by: Wang, Ruyu, et al.
Published: (2025)
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
by: Taghipour, Ashkan, et al.
Published: (2024)
by: Taghipour, Ashkan, et al.
Published: (2024)
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
by: Peng, Fei, et al.
Published: (2025)
by: Peng, Fei, et al.
Published: (2025)
Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model
by: Li, Guandong
Published: (2024)
by: Li, Guandong
Published: (2024)
POCI-Diff: Position Objects Consistently and Interactively with 3D-Layout Guided Diffusion
by: Rigo, Andrea, et al.
Published: (2026)
by: Rigo, Andrea, et al.
Published: (2026)
CoSMo3D: Open-World Promptable 3D Semantic Part Segmentation through LLM-Guided Canonical Spatial Modeling
by: Jin, Li, et al.
Published: (2026)
by: Jin, Li, et al.
Published: (2026)
Semantic Foam: Unifying Spatial and Semantic Scene Decomposition
by: Sharafeldin, Amr, et al.
Published: (2026)
by: Sharafeldin, Amr, et al.
Published: (2026)
uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
by: Lee, Jonathan, et al.
Published: (2025)
by: Lee, Jonathan, et al.
Published: (2025)
Layout-Guided Controllable Pathology Image Generation with In-Context Diffusion Transformers
by: Shou, Yuntao, et al.
Published: (2026)
by: Shou, Yuntao, et al.
Published: (2026)
Setting-Matched and Semantics-Scaled Benchmarking of One-Step Generative Models Against Multistep Diffusion and Flow Models
by: Ravishankar, Advaith, et al.
Published: (2026)
by: Ravishankar, Advaith, et al.
Published: (2026)
Mem3R: Streaming 3D Reconstruction with Hybrid Memory via Test-Time Training
by: Liu, Changkun, et al.
Published: (2026)
by: Liu, Changkun, et al.
Published: (2026)
Enhancing Image Layout Control with Loss-Guided Diffusion Models
by: Patel, Zakaria, et al.
Published: (2024)
by: Patel, Zakaria, et al.
Published: (2024)
Uni-Layout: Integrating Human Feedback in Unified Layout Generation and Evaluation
by: Lu, Shuo, et al.
Published: (2025)
by: Lu, Shuo, et al.
Published: (2025)
Consistent Image Layout Editing with Diffusion Models
by: Xia, Tao, et al.
Published: (2025)
by: Xia, Tao, et al.
Published: (2025)
UniLayDiff: A Unified Diffusion Transformer for Content-Aware Layout Generation
by: Liu, Zeyang, et al.
Published: (2025)
by: Liu, Zeyang, et al.
Published: (2025)
LED Benchmark: Diagnosing Structural Layout Errors for Document Layout Analysis
by: Heo, Inbum, et al.
Published: (2025)
by: Heo, Inbum, et al.
Published: (2025)
Guiding Diffusion Models with Semantically Degraded Conditions
by: Han, Shilong, et al.
Published: (2026)
by: Han, Shilong, et al.
Published: (2026)
Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation
by: Zhang, Xiaoyu, et al.
Published: (2024)
by: Zhang, Xiaoyu, et al.
Published: (2024)
Similar Items
-
7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models
by: Izzo, Elena, et al.
Published: (2025) -
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension
by: Parolari, Luca, et al.
Published: (2024) -
Towards Polyp Counting In Full-Procedure Colonoscopy Videos
by: Parolari, Luca, et al.
Published: (2025) -
Temporally-Aware Supervised Contrastive Learning for Polyp Counting in Colonoscopy
by: Parolari, Luca, et al.
Published: (2025) -
Contrastive Learning under Noisy Temporal Self-Supervision for Colonoscopy Videos
by: Parolari, Luca, et al.
Published: (2026)