:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Goswami, Ashish, Modi, Satyam Kumar, Deshineni, Santhosh Rishi, Singh, Harman, P, Prathosh A., Singla, Parag
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.06089
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Scene Graph Anticipation
by: Peddi, Rohith, et al.
Published: (2024)

Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation
by: Peddi, Rohith, et al.
Published: (2024)

TensoIS: A Step Towards Feed-Forward Tensorial Inverse Subsurface Scattering for Perlin Distributed Heterogeneous Media
by: Tiwari, Ashish, et al.
Published: (2025)

LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation
by: Tiwary, Piyush, et al.
Published: (2025)

Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos
by: Peddi, Rohith, et al.
Published: (2026)

BoxCell: Leveraging SAM for Cell Segmentation with Box Supervision
by: Tyagi, Aayush Kumar, et al.
Published: (2023)

GraPLUS: Graph-based Placement Using Semantics for Image Composition
by: Khaleghi, Mir Mohammad, et al.
Published: (2025)

Factual and Edit-Sensitive Graph-to-Sequence Generation via Graph-Aware Adaptive Noising
by: Shahane, Aditya Hemant, et al.
Published: (2026)

Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers
by: Gandhi, Sanket, et al.
Published: (2024)

Generative AI for Enhanced Wildfire Detection: Bridging the Synthetic-Real Domain Gap
by: Gaba, Satyam
Published: (2025)

The Percept-V Challenge: Can Multimodal LLMs Crack Simple Perception Problems?
by: Ghosh, Samrajnee, et al.
Published: (2025)

MANTA: Physics-Informed Generalized Underwater Object Tracking
by: Srinath, Suhas, et al.
Published: (2025)

Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
by: Tiwary, Piyush, et al.
Published: (2023)

A Diffusion-Driven Fine-Grained Nodule Synthesis Framework for Enhanced Lung Nodule Detection from Chest Radiographs
by: Goyal, Aryan, et al.
Published: (2026)

UnDIVE: Generalized Underwater Video Enhancement Using Generative Priors
by: Srinath, Suhas, et al.
Published: (2024)

Partially Blinded Unlearning: Class Unlearning for Deep Networks a Bayesian Perspective
by: Panda, Subhodip, et al.
Published: (2024)

GenSelfDiff-HIS: Generative Self-Supervision Using Diffusion for Histopathological Image Segmentation
by: Purma, Vishnuvardhan, et al.
Published: (2023)

ShapeGraFormer: GraFormer-Based Network for Hand-Object Reconstruction from a Single Depth Map
by: Aboukhadra, Ahmed Tawfik, et al.
Published: (2023)

GraCo: Granularity-Controllable Interactive Segmentation
by: Zhao, Yian, et al.
Published: (2024)

Optimal Kinematic Synthesis and Prototype Development of Knee Exoskeleton
by: Gautam, Shashank Mani, et al.
Published: (2024)

Asynchronous Perception Machine For Efficient Test-Time-Training
by: Modi, Rajat, et al.
Published: (2024)

GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies
by: Neau, Maëlic, et al.
Published: (2025)

DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes
by: Kumar, Rishav, et al.
Published: (2025)

LunarDepthNet: Generation of Digital Elevation Models using Deep Learning and Monocular Satellite Images
by: Aadi, Aaranay, et al.
Published: (2026)

Improving Long-Tailed Object Detection with Balanced Group Softmax and Metric Learning
by: Gaba, Satyam
Published: (2025)

DeiTFake: Deepfake Detection Model using DeiT Multi-Stage Training
by: Kumar, Saksham, et al.
Published: (2025)

StegaVision: Enhancing Steganography with Attention Mechanism
by: Kumar, Abhinav, et al.
Published: (2024)

CoNO: Complex Neural Operator for Continous Dynamical Physical Systems
by: Tiwari, Karn, et al.
Published: (2024)

Neural Compound-Word (Sandhi) Generation and Splitting in Sanskrit Language
by: Dave, Sushant, et al.
Published: (2020)

Solar potential analysis over Indian cities using high-resolution satellite imagery and DEM
by: Singla, Jai
Published: (2024)

Step1X-Edit: A Practical Framework for General Image Editing
by: Liu, Shiyu, et al.
Published: (2025)

GraVoS: Voxel Selection for 3D Point-Cloud Detection
by: Shrout, Oren, et al.
Published: (2022)

GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes
by: Kolf, Jan Niklas, et al.
Published: (2024)

Learning to Recover from Plan Execution Errors during Robot Manipulation: A Neuro-symbolic Approach
by: Kalithasan, Namasivayam, et al.
Published: (2024)

CityGuessr: City-Level Video Geo-Localization on a Global Scale
by: Kulkarni, Parth Parag, et al.
Published: (2024)

Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
by: Qin, Sizhong, et al.
Published: (2026)

Counterfactual Edits for Generative Evaluation
by: Lymperaiou, Maria, et al.
Published: (2023)

On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
by: Modi, Rajat, et al.
Published: (2024)

LoMOE: Localized Multi-Object Editing via Multi-Diffusion
by: Chakrabarty, Goirik, et al.
Published: (2024)

Modality Agnostic Efficient Long Range Encoder
by: Parag, Toufiq, et al.
Published: (2025)