Saved in:
| Main Authors: | Goswami, Ashish, Modi, Satyam Kumar, Deshineni, Santhosh Rishi, Singh, Harman, P, Prathosh A., Singla, Parag |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.06089 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Scene Graph Anticipation
by: Peddi, Rohith, et al.
Published: (2024)
by: Peddi, Rohith, et al.
Published: (2024)
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation
by: Peddi, Rohith, et al.
Published: (2024)
by: Peddi, Rohith, et al.
Published: (2024)
TensoIS: A Step Towards Feed-Forward Tensorial Inverse Subsurface Scattering for Perlin Distributed Heterogeneous Media
by: Tiwari, Ashish, et al.
Published: (2025)
by: Tiwari, Ashish, et al.
Published: (2025)
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation
by: Tiwary, Piyush, et al.
Published: (2025)
by: Tiwary, Piyush, et al.
Published: (2025)
Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos
by: Peddi, Rohith, et al.
Published: (2026)
by: Peddi, Rohith, et al.
Published: (2026)
BoxCell: Leveraging SAM for Cell Segmentation with Box Supervision
by: Tyagi, Aayush Kumar, et al.
Published: (2023)
by: Tyagi, Aayush Kumar, et al.
Published: (2023)
GraPLUS: Graph-based Placement Using Semantics for Image Composition
by: Khaleghi, Mir Mohammad, et al.
Published: (2025)
by: Khaleghi, Mir Mohammad, et al.
Published: (2025)
Factual and Edit-Sensitive Graph-to-Sequence Generation via Graph-Aware Adaptive Noising
by: Shahane, Aditya Hemant, et al.
Published: (2026)
by: Shahane, Aditya Hemant, et al.
Published: (2026)
Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers
by: Gandhi, Sanket, et al.
Published: (2024)
by: Gandhi, Sanket, et al.
Published: (2024)
Generative AI for Enhanced Wildfire Detection: Bridging the Synthetic-Real Domain Gap
by: Gaba, Satyam
Published: (2025)
by: Gaba, Satyam
Published: (2025)
The Percept-V Challenge: Can Multimodal LLMs Crack Simple Perception Problems?
by: Ghosh, Samrajnee, et al.
Published: (2025)
by: Ghosh, Samrajnee, et al.
Published: (2025)
MANTA: Physics-Informed Generalized Underwater Object Tracking
by: Srinath, Suhas, et al.
Published: (2025)
by: Srinath, Suhas, et al.
Published: (2025)
Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
by: Tiwary, Piyush, et al.
Published: (2023)
by: Tiwary, Piyush, et al.
Published: (2023)
A Diffusion-Driven Fine-Grained Nodule Synthesis Framework for Enhanced Lung Nodule Detection from Chest Radiographs
by: Goyal, Aryan, et al.
Published: (2026)
by: Goyal, Aryan, et al.
Published: (2026)
UnDIVE: Generalized Underwater Video Enhancement Using Generative Priors
by: Srinath, Suhas, et al.
Published: (2024)
by: Srinath, Suhas, et al.
Published: (2024)
Partially Blinded Unlearning: Class Unlearning for Deep Networks a Bayesian Perspective
by: Panda, Subhodip, et al.
Published: (2024)
by: Panda, Subhodip, et al.
Published: (2024)
GenSelfDiff-HIS: Generative Self-Supervision Using Diffusion for Histopathological Image Segmentation
by: Purma, Vishnuvardhan, et al.
Published: (2023)
by: Purma, Vishnuvardhan, et al.
Published: (2023)
ShapeGraFormer: GraFormer-Based Network for Hand-Object Reconstruction from a Single Depth Map
by: Aboukhadra, Ahmed Tawfik, et al.
Published: (2023)
by: Aboukhadra, Ahmed Tawfik, et al.
Published: (2023)
GraCo: Granularity-Controllable Interactive Segmentation
by: Zhao, Yian, et al.
Published: (2024)
by: Zhao, Yian, et al.
Published: (2024)
Optimal Kinematic Synthesis and Prototype Development of Knee Exoskeleton
by: Gautam, Shashank Mani, et al.
Published: (2024)
by: Gautam, Shashank Mani, et al.
Published: (2024)
Asynchronous Perception Machine For Efficient Test-Time-Training
by: Modi, Rajat, et al.
Published: (2024)
by: Modi, Rajat, et al.
Published: (2024)
GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies
by: Neau, Maëlic, et al.
Published: (2025)
by: Neau, Maëlic, et al.
Published: (2025)
DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes
by: Kumar, Rishav, et al.
Published: (2025)
by: Kumar, Rishav, et al.
Published: (2025)
LunarDepthNet: Generation of Digital Elevation Models using Deep Learning and Monocular Satellite Images
by: Aadi, Aaranay, et al.
Published: (2026)
by: Aadi, Aaranay, et al.
Published: (2026)
Improving Long-Tailed Object Detection with Balanced Group Softmax and Metric Learning
by: Gaba, Satyam
Published: (2025)
by: Gaba, Satyam
Published: (2025)
DeiTFake: Deepfake Detection Model using DeiT Multi-Stage Training
by: Kumar, Saksham, et al.
Published: (2025)
by: Kumar, Saksham, et al.
Published: (2025)
StegaVision: Enhancing Steganography with Attention Mechanism
by: Kumar, Abhinav, et al.
Published: (2024)
by: Kumar, Abhinav, et al.
Published: (2024)
CoNO: Complex Neural Operator for Continous Dynamical Physical Systems
by: Tiwari, Karn, et al.
Published: (2024)
by: Tiwari, Karn, et al.
Published: (2024)
Neural Compound-Word (Sandhi) Generation and Splitting in Sanskrit Language
by: Dave, Sushant, et al.
Published: (2020)
by: Dave, Sushant, et al.
Published: (2020)
Solar potential analysis over Indian cities using high-resolution satellite imagery and DEM
by: Singla, Jai
Published: (2024)
by: Singla, Jai
Published: (2024)
Step1X-Edit: A Practical Framework for General Image Editing
by: Liu, Shiyu, et al.
Published: (2025)
by: Liu, Shiyu, et al.
Published: (2025)
GraVoS: Voxel Selection for 3D Point-Cloud Detection
by: Shrout, Oren, et al.
Published: (2022)
by: Shrout, Oren, et al.
Published: (2022)
GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes
by: Kolf, Jan Niklas, et al.
Published: (2024)
by: Kolf, Jan Niklas, et al.
Published: (2024)
Learning to Recover from Plan Execution Errors during Robot Manipulation: A Neuro-symbolic Approach
by: Kalithasan, Namasivayam, et al.
Published: (2024)
by: Kalithasan, Namasivayam, et al.
Published: (2024)
CityGuessr: City-Level Video Geo-Localization on a Global Scale
by: Kulkarni, Parth Parag, et al.
Published: (2024)
by: Kulkarni, Parth Parag, et al.
Published: (2024)
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
by: Qin, Sizhong, et al.
Published: (2026)
by: Qin, Sizhong, et al.
Published: (2026)
Counterfactual Edits for Generative Evaluation
by: Lymperaiou, Maria, et al.
Published: (2023)
by: Lymperaiou, Maria, et al.
Published: (2023)
On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
by: Modi, Rajat, et al.
Published: (2024)
by: Modi, Rajat, et al.
Published: (2024)
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
by: Chakrabarty, Goirik, et al.
Published: (2024)
by: Chakrabarty, Goirik, et al.
Published: (2024)
Modality Agnostic Efficient Long Range Encoder
by: Parag, Toufiq, et al.
Published: (2025)
by: Parag, Toufiq, et al.
Published: (2025)
Similar Items
-
Towards Scene Graph Anticipation
by: Peddi, Rohith, et al.
Published: (2024) -
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation
by: Peddi, Rohith, et al.
Published: (2024) -
TensoIS: A Step Towards Feed-Forward Tensorial Inverse Subsurface Scattering for Perlin Distributed Heterogeneous Media
by: Tiwari, Ashish, et al.
Published: (2025) -
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation
by: Tiwary, Piyush, et al.
Published: (2025) -
Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos
by: Peddi, Rohith, et al.
Published: (2026)