Saved in:
| Main Authors: | Swami, Kunal, Chittersu, Raghu, Rathore, Yuvraj, Irny, Rajeev, Doodekula, Shashavali, Shukla, Alok |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.14237 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control
by: Swami, Kunal, et al.
Published: (2025)
by: Swami, Kunal, et al.
Published: (2025)
Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition
by: Chittersu, Raghu Vamsi, et al.
Published: (2025)
by: Chittersu, Raghu Vamsi, et al.
Published: (2025)
Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation
by: Swami, Kunal
Published: (2025)
by: Swami, Kunal
Published: (2025)
Generating Part-Based Global Explanations Via Correspondence
by: Rathore, Kunal, et al.
Published: (2025)
by: Rathore, Kunal, et al.
Published: (2025)
Freq-DP Net: A Dual-Branch Network for Fence Removal using Dual-Pixel and Fourier Priors
by: Swami, Kunal, et al.
Published: (2026)
by: Swami, Kunal, et al.
Published: (2026)
Toward Intelligent Scene Augmentation for Context-Aware Object Placement and Sponsor-Logo Integration
by: Saraswat, Unnati, et al.
Published: (2025)
by: Saraswat, Unnati, et al.
Published: (2025)
Zero-Shot Product Attribute Labeling with Vision-Language Models: A Three-Tier Evaluation Framework
by: Shukla, Shubham, et al.
Published: (2026)
by: Shukla, Shubham, et al.
Published: (2026)
Can GPT-4o mini and Gemini 2.0 Flash Predict Fine-Grained Fashion Product Attributes? A Zero-Shot Analysis
by: Shukla, Shubham, et al.
Published: (2025)
by: Shukla, Shubham, et al.
Published: (2025)
Semantic-Guided 3D Gaussian Splatting for Transient Object Removal
by: Prabakaran, Aditi, et al.
Published: (2026)
by: Prabakaran, Aditi, et al.
Published: (2026)
DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition
by: Liu, Jiacheng, et al.
Published: (2024)
by: Liu, Jiacheng, et al.
Published: (2024)
Object Placement for Anything
by: Gao, Bingjie, et al.
Published: (2025)
by: Gao, Bingjie, et al.
Published: (2025)
Visually Interpretable Subtask Reasoning for Visual Question Answering
by: Cheng, Yu, et al.
Published: (2025)
by: Cheng, Yu, et al.
Published: (2025)
Toward Strategy Identification and Subtask Decomposition In Task Exploration
by: Odem, Tom
Published: (2025)
by: Odem, Tom
Published: (2025)
DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning
by: Swami, Kunal, et al.
Published: (2025)
by: Swami, Kunal, et al.
Published: (2025)
Object Pose Estimation through Dexterous Touch
by: Shahidzadeh, Amir-Hossein, et al.
Published: (2025)
by: Shahidzadeh, Amir-Hossein, et al.
Published: (2025)
TouchAnything: Diffusion-Guided 3D Reconstruction from Sparse Robot Touches
by: Gu, Langzhe, et al.
Published: (2026)
by: Gu, Langzhe, et al.
Published: (2026)
Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
by: Tan, Jiaqi, et al.
Published: (2025)
by: Tan, Jiaqi, et al.
Published: (2025)
Planning Robot Placement for Object Grasping
by: Saini, Manish, et al.
Published: (2024)
by: Saini, Manish, et al.
Published: (2024)
HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement
by: Schouten, Marco, et al.
Published: (2026)
by: Schouten, Marco, et al.
Published: (2026)
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
by: Abdelreheem, Ahmed, et al.
Published: (2025)
by: Abdelreheem, Ahmed, et al.
Published: (2025)
Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection
by: Inoue, Riku, et al.
Published: (2025)
by: Inoue, Riku, et al.
Published: (2025)
AFTER: Mitigating the Object Hallucination of LVLM via Adaptive Factual-Guided Activation Editing
by: Wang, Tianbo, et al.
Published: (2026)
by: Wang, Tianbo, et al.
Published: (2026)
Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation
by: Li, Jiusi, et al.
Published: (2025)
by: Li, Jiusi, et al.
Published: (2025)
Imagining the Unseen: Generative Location Modeling for Object Placement
by: Yun, Jooyeol, et al.
Published: (2024)
by: Yun, Jooyeol, et al.
Published: (2024)
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
by: Wang, Miaowei, et al.
Published: (2025)
by: Wang, Miaowei, et al.
Published: (2025)
InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
by: Rajan, Sreehari, et al.
Published: (2025)
by: Rajan, Sreehari, et al.
Published: (2025)
PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation
by: Röfer, Adrian, et al.
Published: (2024)
by: Röfer, Adrian, et al.
Published: (2024)
Touch-R1: Reinforcing Touch Reasoning in MLLMs
by: Lai, Yingxin, et al.
Published: (2026)
by: Lai, Yingxin, et al.
Published: (2026)
Subtask-Aware Visual Reward Learning from Segmented Demonstrations
by: Kim, Changyeon, et al.
Published: (2025)
by: Kim, Changyeon, et al.
Published: (2025)
A$^2$-Edit: Precise Reference-Guided Image Editing of Arbitrary Objects and Ambiguous Masks
by: Zheng, Huayu, et al.
Published: (2026)
by: Zheng, Huayu, et al.
Published: (2026)
Explainability-Inspired Layer-Wise Pruning of Deep Neural Networks for Efficient Object Detection
by: Shukla, Abhinav, et al.
Published: (2026)
by: Shukla, Abhinav, et al.
Published: (2026)
FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
by: Jiang, Yilei, et al.
Published: (2025)
by: Jiang, Yilei, et al.
Published: (2025)
Zero-shot Face Editing via ID-Attribute Decoupled Inversion
by: Hou, Yang, et al.
Published: (2025)
by: Hou, Yang, et al.
Published: (2025)
ACE-LoRA: Adaptive Orthogonal Decoupling for Continual Image Editing
by: Liu, Yuehao, et al.
Published: (2026)
by: Liu, Yuehao, et al.
Published: (2026)
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
Spatial Transform Decoupling for Oriented Object Detection
by: Yu, Hongtian, et al.
Published: (2023)
by: Yu, Hongtian, et al.
Published: (2023)
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
by: Zhou, Hang, et al.
Published: (2025)
by: Zhou, Hang, et al.
Published: (2025)
RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data
by: Cho, Yoorhim, et al.
Published: (2025)
by: Cho, Yoorhim, et al.
Published: (2025)
HipyrNet: Hypernet-Guided Feature Pyramid network for mixed-exposure correction
by: Rathore, Shaurya Singh, et al.
Published: (2025)
by: Rathore, Shaurya Singh, et al.
Published: (2025)
Controllable 3D Placement of Objects with Scene-Aware Diffusion Models
by: Omran, Mohamed, et al.
Published: (2025)
by: Omran, Mohamed, et al.
Published: (2025)
Similar Items
-
PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control
by: Swami, Kunal, et al.
Published: (2025) -
Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition
by: Chittersu, Raghu Vamsi, et al.
Published: (2025) -
Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation
by: Swami, Kunal
Published: (2025) -
Generating Part-Based Global Explanations Via Correspondence
by: Rathore, Kunal, et al.
Published: (2025) -
Freq-DP Net: A Dual-Branch Network for Fence Removal using Dual-Pixel and Fourier Priors
by: Swami, Kunal, et al.
Published: (2026)