Saved in:
| Main Authors: | Goyal, Shreya, Khan, Naimul, Chattopadhyay, Chiranjoy, Bhatnagar, Gaurav |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2103.08297 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Knowledge driven Description Synthesis for Floor Plan Interpretation
by: Goyal, Shreya, et al.
Published: (2021)
by: Goyal, Shreya, et al.
Published: (2021)
Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
by: Khanna, Sandeep, et al.
Published: (2024)
by: Khanna, Sandeep, et al.
Published: (2024)
DynaSeg: A Deep Dynamic Fusion Method for Unsupervised Image Segmentation Incorporating Feature Similarity and Spatial Continuity
by: Guermazi, Boujemaa, et al.
Published: (2024)
by: Guermazi, Boujemaa, et al.
Published: (2024)
DynaGuide: A Generalizable Dynamic Guidance Framework for Unsupervised Semantic Segmentation
by: Guermazi, Boujemaa, et al.
Published: (2026)
by: Guermazi, Boujemaa, et al.
Published: (2026)
FaceGemma: Enhancing Image Captioning with Facial Attributes for Portrait Images
by: Haque, Naimul, et al.
Published: (2023)
by: Haque, Naimul, et al.
Published: (2023)
Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification
by: French, Satchel, et al.
Published: (2025)
by: French, Satchel, et al.
Published: (2025)
Stress Classification from ECG Signals Using Vision Transformer
by: Ahmad, Zeeshan, et al.
Published: (2026)
by: Ahmad, Zeeshan, et al.
Published: (2026)
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
by: Athar, ShahRukh, et al.
Published: (2024)
by: Athar, ShahRukh, et al.
Published: (2024)
Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images
by: Garbin, Emanuel, et al.
Published: (2025)
by: Garbin, Emanuel, et al.
Published: (2025)
STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs
by: Chattopadhyay, Nandish, et al.
Published: (2026)
by: Chattopadhyay, Nandish, et al.
Published: (2026)
AUGCAL: Improving Sim2Real Adaptation by Uncertainty Calibration on Augmented Synthetic Images
by: Chattopadhyay, Prithvijit, et al.
Published: (2023)
by: Chattopadhyay, Prithvijit, et al.
Published: (2023)
GazeProphetV2: Head-Movement-Based Gaze Prediction Enabling Efficient Foveated Rendering on Mobile VR
by: Ebadulla, Farhaan, et al.
Published: (2025)
by: Ebadulla, Farhaan, et al.
Published: (2025)
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
by: Eldesokey, Abdelrahman, et al.
Published: (2024)
by: Eldesokey, Abdelrahman, et al.
Published: (2024)
Maximizing Generalization: The Effect of Different Augmentation Techniques on Lightweight Vision Transformer for Bengali Character Classification
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)
Computer-Aided Layout Generation for Building Design: A Review
by: Liu, Jiachen, et al.
Published: (2025)
by: Liu, Jiachen, et al.
Published: (2025)
Image Demoiréing Using Dual Camera Fusion on Mobile Phones
by: Mei, Yanting, et al.
Published: (2025)
by: Mei, Yanting, et al.
Published: (2025)
uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
by: Lee, Jonathan, et al.
Published: (2025)
by: Lee, Jonathan, et al.
Published: (2025)
Efficient Hybrid Zoom using Camera Fusion on Mobile Phones
by: Wu, Xiaotong, et al.
Published: (2024)
by: Wu, Xiaotong, et al.
Published: (2024)
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
by: Zhang, Hui, et al.
Published: (2024)
by: Zhang, Hui, et al.
Published: (2024)
STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation
by: Wang, Ruyu, et al.
Published: (2025)
by: Wang, Ruyu, et al.
Published: (2025)
Tokenizing Buildings: A Transformer for Layout Synthesis
by: de Guevara, Manuel Ladron, et al.
Published: (2025)
by: de Guevara, Manuel Ladron, et al.
Published: (2025)
Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM
by: Wang, Can, et al.
Published: (2024)
by: Wang, Can, et al.
Published: (2024)
Consistent Image Layout Editing with Diffusion Models
by: Xia, Tao, et al.
Published: (2025)
by: Xia, Tao, et al.
Published: (2025)
Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control
by: Lukovnikov, Denis, et al.
Published: (2024)
by: Lukovnikov, Denis, et al.
Published: (2024)
BornoViT: A Novel Efficient Vision Transformer for Bengali Handwritten Basic Characters Classification
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)
ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts
by: Huang, Linhao, et al.
Published: (2025)
by: Huang, Linhao, et al.
Published: (2025)
LayoutFlow: Flow Matching for Layout Generation
by: Guerreiro, Julian Jorge Andrade, et al.
Published: (2024)
by: Guerreiro, Julian Jorge Andrade, et al.
Published: (2024)
Dual-Camera Smooth Zoom on Mobile Phones
by: Wu, Renlong, et al.
Published: (2024)
by: Wu, Renlong, et al.
Published: (2024)
SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters
by: Tanaka, Shohei, et al.
Published: (2024)
by: Tanaka, Shohei, et al.
Published: (2024)
Griffin: Generative Reference and Layout Guided Image Composition
by: Mikaeili, Aryan, et al.
Published: (2025)
by: Mikaeili, Aryan, et al.
Published: (2025)
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
by: Yu, Ning, et al.
Published: (2022)
by: Yu, Ning, et al.
Published: (2022)
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
by: Gao, Shanghua, et al.
Published: (2023)
by: Gao, Shanghua, et al.
Published: (2023)
An Object-Centered Data Acquisition Method for 3D Gaussian Splatting using Mobile Phones
by: Zhang, Yuezhe, et al.
Published: (2026)
by: Zhang, Yuezhe, et al.
Published: (2026)
Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis
by: Yadav, Shivangi, et al.
Published: (2024)
by: Yadav, Shivangi, et al.
Published: (2024)
Layout Agnostic Scene Text Image Synthesis with Diffusion Models
by: Zhangli, Qilong, et al.
Published: (2024)
by: Zhangli, Qilong, et al.
Published: (2024)
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
by: Cheng, Jiaxin, et al.
Published: (2024)
by: Cheng, Jiaxin, et al.
Published: (2024)
Training-Free Layout-to-Image Generation with Marginal Attention Constraints
by: Chen, Huancheng, et al.
Published: (2024)
by: Chen, Huancheng, et al.
Published: (2024)
ConsistCompose: Unified Multimodal Layout Control for Image Composition
by: Shi, Xuanke, et al.
Published: (2025)
by: Shi, Xuanke, et al.
Published: (2025)
PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
by: Lv, Zhengyao, et al.
Published: (2024)
by: Lv, Zhengyao, et al.
Published: (2024)
IMAGHarmony: Controllable Image Editing with Consistent Object Quantity and Layout
by: Shen, Fei, et al.
Published: (2025)
by: Shen, Fei, et al.
Published: (2025)
Similar Items
-
Knowledge driven Description Synthesis for Floor Plan Interpretation
by: Goyal, Shreya, et al.
Published: (2021) -
Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
by: Khanna, Sandeep, et al.
Published: (2024) -
DynaSeg: A Deep Dynamic Fusion Method for Unsupervised Image Segmentation Incorporating Feature Similarity and Spatial Continuity
by: Guermazi, Boujemaa, et al.
Published: (2024) -
DynaGuide: A Generalizable Dynamic Guidance Framework for Unsupervised Semantic Segmentation
by: Guermazi, Boujemaa, et al.
Published: (2026) -
FaceGemma: Enhancing Image Captioning with Facial Attributes for Portrait Images
by: Haque, Naimul, et al.
Published: (2023)