:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Swami, Kunal, Chittersu, Raghu, Rathore, Yuvraj, Irny, Rajeev, Doodekula, Shashavali, Shukla, Alok
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.14237
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control
by: Swami, Kunal, et al.
Published: (2025)

Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition
by: Chittersu, Raghu Vamsi, et al.
Published: (2025)

Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation
by: Swami, Kunal
Published: (2025)

Generating Part-Based Global Explanations Via Correspondence
by: Rathore, Kunal, et al.
Published: (2025)

Freq-DP Net: A Dual-Branch Network for Fence Removal using Dual-Pixel and Fourier Priors
by: Swami, Kunal, et al.
Published: (2026)

Toward Intelligent Scene Augmentation for Context-Aware Object Placement and Sponsor-Logo Integration
by: Saraswat, Unnati, et al.
Published: (2025)

Zero-Shot Product Attribute Labeling with Vision-Language Models: A Three-Tier Evaluation Framework
by: Shukla, Shubham, et al.
Published: (2026)

Can GPT-4o mini and Gemini 2.0 Flash Predict Fine-Grained Fashion Product Attributes? A Zero-Shot Analysis
by: Shukla, Shubham, et al.
Published: (2025)

Semantic-Guided 3D Gaussian Splatting for Transient Object Removal
by: Prabakaran, Aditi, et al.
Published: (2026)

DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition
by: Liu, Jiacheng, et al.
Published: (2024)

Object Placement for Anything
by: Gao, Bingjie, et al.
Published: (2025)

Visually Interpretable Subtask Reasoning for Visual Question Answering
by: Cheng, Yu, et al.
Published: (2025)

Toward Strategy Identification and Subtask Decomposition In Task Exploration
by: Odem, Tom
Published: (2025)

DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning
by: Swami, Kunal, et al.
Published: (2025)

Object Pose Estimation through Dexterous Touch
by: Shahidzadeh, Amir-Hossein, et al.
Published: (2025)

TouchAnything: Diffusion-Guided 3D Reconstruction from Sparse Robot Touches
by: Gu, Langzhe, et al.
Published: (2026)

Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
by: Tan, Jiaqi, et al.
Published: (2025)

Planning Robot Placement for Object Grasping
by: Saini, Manish, et al.
Published: (2024)

HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement
by: Schouten, Marco, et al.
Published: (2026)

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
by: Abdelreheem, Ahmed, et al.
Published: (2025)

Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection
by: Inoue, Riku, et al.
Published: (2025)

AFTER: Mitigating the Object Hallucination of LVLM via Adaptive Factual-Guided Activation Editing
by: Wang, Tianbo, et al.
Published: (2026)

Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation
by: Li, Jiusi, et al.
Published: (2025)

Imagining the Unseen: Generative Location Modeling for Object Placement
by: Yun, Jooyeol, et al.
Published: (2024)

DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
by: Wang, Miaowei, et al.
Published: (2025)

InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
by: Rajan, Sreehari, et al.
Published: (2025)

PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation
by: Röfer, Adrian, et al.
Published: (2024)

Touch-R1: Reinforcing Touch Reasoning in MLLMs
by: Lai, Yingxin, et al.
Published: (2026)

Subtask-Aware Visual Reward Learning from Segmented Demonstrations
by: Kim, Changyeon, et al.
Published: (2025)

A$^2$-Edit: Precise Reference-Guided Image Editing of Arbitrary Objects and Ambiguous Masks
by: Zheng, Huayu, et al.
Published: (2026)

Explainability-Inspired Layer-Wise Pruning of Deep Neural Networks for Efficient Object Detection
by: Shukla, Abhinav, et al.
Published: (2026)

FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
by: Jiang, Yilei, et al.
Published: (2025)

Zero-shot Face Editing via ID-Attribute Decoupled Inversion
by: Hou, Yang, et al.
Published: (2025)

ACE-LoRA: Adaptive Orthogonal Decoupling for Continual Image Editing
by: Liu, Yuehao, et al.
Published: (2026)

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025)

Spatial Transform Decoupling for Oriented Object Detection
by: Yu, Hongtian, et al.
Published: (2023)

BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
by: Zhou, Hang, et al.
Published: (2025)

RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data
by: Cho, Yoorhim, et al.
Published: (2025)

HipyrNet: Hypernet-Guided Feature Pyramid network for mixed-exposure correction
by: Rathore, Shaurya Singh, et al.
Published: (2025)

Controllable 3D Placement of Objects with Scene-Aware Diffusion Models
by: Omran, Mohamed, et al.
Published: (2025)