:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Goyal, Shreya, Khan, Naimul, Chattopadhyay, Chiranjoy, Bhatnagar, Gaurav
Format:	Preprint
Published:	2021
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2103.08297
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Knowledge driven Description Synthesis for Floor Plan Interpretation
by: Goyal, Shreya, et al.
Published: (2021)

Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
by: Khanna, Sandeep, et al.
Published: (2024)

DynaSeg: A Deep Dynamic Fusion Method for Unsupervised Image Segmentation Incorporating Feature Similarity and Spatial Continuity
by: Guermazi, Boujemaa, et al.
Published: (2024)

DynaGuide: A Generalizable Dynamic Guidance Framework for Unsupervised Semantic Segmentation
by: Guermazi, Boujemaa, et al.
Published: (2026)

FaceGemma: Enhancing Image Captioning with Facial Attributes for Portrait Images
by: Haque, Naimul, et al.
Published: (2023)

Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification
by: French, Satchel, et al.
Published: (2025)

Stress Classification from ECG Signals Using Vision Transformer
by: Ahmad, Zeeshan, et al.
Published: (2026)

Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
by: Athar, ShahRukh, et al.
Published: (2024)

Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images
by: Garbin, Emanuel, et al.
Published: (2025)

STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs
by: Chattopadhyay, Nandish, et al.
Published: (2026)

AUGCAL: Improving Sim2Real Adaptation by Uncertainty Calibration on Augmented Synthetic Images
by: Chattopadhyay, Prithvijit, et al.
Published: (2023)

GazeProphetV2: Head-Movement-Based Gaze Prediction Enabling Efficient Foveated Rendering on Mobile VR
by: Ebadulla, Farhaan, et al.
Published: (2025)

Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
by: Eldesokey, Abdelrahman, et al.
Published: (2024)

Maximizing Generalization: The Effect of Different Augmentation Techniques on Lightweight Vision Transformer for Bengali Character Classification
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)

Computer-Aided Layout Generation for Building Design: A Review
by: Liu, Jiachen, et al.
Published: (2025)

Image Demoiréing Using Dual Camera Fusion on Mobile Phones
by: Mei, Yanting, et al.
Published: (2025)

uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
by: Lee, Jonathan, et al.
Published: (2025)

Efficient Hybrid Zoom using Camera Fusion on Mobile Phones
by: Wu, Xiaotong, et al.
Published: (2024)

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
by: Zhang, Hui, et al.
Published: (2024)

STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation
by: Wang, Ruyu, et al.
Published: (2025)

Tokenizing Buildings: A Transformer for Layout Synthesis
by: de Guevara, Manuel Ladron, et al.
Published: (2025)

Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM
by: Wang, Can, et al.
Published: (2024)

Consistent Image Layout Editing with Diffusion Models
by: Xia, Tao, et al.
Published: (2025)

Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control
by: Lukovnikov, Denis, et al.
Published: (2024)

BornoViT: A Novel Efficient Vision Transformer for Bengali Handwritten Basic Characters Classification
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)

ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts
by: Huang, Linhao, et al.
Published: (2025)

LayoutFlow: Flow Matching for Layout Generation
by: Guerreiro, Julian Jorge Andrade, et al.
Published: (2024)

Dual-Camera Smooth Zoom on Mobile Phones
by: Wu, Renlong, et al.
Published: (2024)

SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters
by: Tanaka, Shohei, et al.
Published: (2024)

Griffin: Generative Reference and Layout Guided Image Composition
by: Mikaeili, Aryan, et al.
Published: (2025)

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
by: Yu, Ning, et al.
Published: (2022)

MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
by: Gao, Shanghua, et al.
Published: (2023)

An Object-Centered Data Acquisition Method for 3D Gaussian Splatting using Mobile Phones
by: Zhang, Yuezhe, et al.
Published: (2026)

Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis
by: Yadav, Shivangi, et al.
Published: (2024)

Layout Agnostic Scene Text Image Synthesis with Diffusion Models
by: Zhangli, Qilong, et al.
Published: (2024)

Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
by: Cheng, Jiaxin, et al.
Published: (2024)

Training-Free Layout-to-Image Generation with Marginal Attention Constraints
by: Chen, Huancheng, et al.
Published: (2024)

ConsistCompose: Unified Multimodal Layout Control for Image Composition
by: Shi, Xuanke, et al.
Published: (2025)

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
by: Lv, Zhengyao, et al.
Published: (2024)

IMAGHarmony: Controllable Image Editing with Consistent Object Quantity and Layout
by: Shen, Fei, et al.
Published: (2025)