:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jaiswal, Abhishek, Srivastava, Nisheeth
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2507.11642
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning to Play Video Games with Intuitive Physics Priors
by: Jaiswal, Abhishek, et al.
Published: (2024)

Real-Time Feedback and Benchmark Dataset for Isometric Pose Evaluation
by: Jaiswal, Abhishek, et al.
Published: (2025)

Benchmarking Reliability of Deep Learning Models for Pathological Gait Classification
by: Jaiswal, Abhishek, et al.
Published: (2024)

Style-based Clustering of Visual Artworks and the Play of Neural Style-Representations
by: Dangeti, Abhishek, et al.
Published: (2024)

Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment
by: Lin, Xin Lei, et al.
Published: (2025)

SitPose: Real-Time Detection of Sitting Posture and Sedentary Behavior Using Ensemble Learning With Depth Sensor
by: Jin, Hang, et al.
Published: (2024)

DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
by: Chen, Yi, et al.
Published: (2026)

CART: Compositional Auto-Regressive Transformer for Image Generation
by: Roheda, Siddharth, et al.
Published: (2024)

Few-shot multi-token DreamBooth with LoRa for style-consistent character generation
by: Pascual, Ruben, et al.
Published: (2025)

Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers
by: Tao, Wei, et al.
Published: (2024)

Research on Driver Facial Fatigue Detection Based on Yolov8 Model
by: Zhou, Chang, et al.
Published: (2024)

Plug and Play Active Learning for Object Detection
by: Yang, Chenhongyi, et al.
Published: (2022)

Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
by: Zeevi, Tal, et al.
Published: (2024)

Skeleton-Based Posture Classification to Promote Safer Walker-Assisted Gait in Older Adults
by: M., Sergio D. Sierra, et al.
Published: (2026)

Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network
by: Wang, Zhen, et al.
Published: (2024)

Momentum Guidance: Plug-and-Play Guidance for Flow Models
by: Liao, Runlong, et al.
Published: (2026)

From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation
by: Niu, Ke, et al.
Published: (2025)

Adaptive Moments are Surprisingly Effective for Plug-and-Play Diffusion Sampling
by: Belardi, Christian, et al.
Published: (2026)

Playing the network backward: A Game Theoretic Attribution Framework
by: Zimmermann, Jakob Paul, et al.
Published: (2026)

Action Dubber: Timing Audible Actions via Inflectional Flow
by: Wan, Wenlong, et al.
Published: (2025)

Plug-and-Play Image Restoration with Flow Matching: A Continuous Viewpoint
by: Jia, Fan, et al.
Published: (2025)

Leveraging Data to Say No: Memory Augmented Plug-and-Play Selective Prediction
by: Sarkar, Aditya, et al.
Published: (2026)

PnP-Flow: Plug-and-Play Image Restoration with Flow Matching
by: Martin, Ségolène, et al.
Published: (2024)

Multi-class Seismic Building Damage Assessment from InSAR Imagery using Quadratic Variational Causal Bayesian Inference
by: Li, Xuechun, et al.
Published: (2025)

Transfer Learning with Point Transformers
by: Gupta, Kartik, et al.
Published: (2024)

Learning to Drive via Asymmetric Self-Play
by: Zhang, Chris, et al.
Published: (2024)

Multi-State-Action Tokenisation in Decision Transformers for Multi-Discrete Action Spaces
by: Moodley, Perusha, et al.
Published: (2024)

Investigating the Quality of DermaMNIST and Fitzpatrick17k Dermatological Image Datasets
by: Abhishek, Kumar, et al.
Published: (2024)

Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis
by: Nagar, Aishik, et al.
Published: (2024)

Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
by: Guruprasad, Pranav, et al.
Published: (2025)

VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
by: Srivastava, Divyansh, et al.
Published: (2024)

SAIP: A Plug-and-Play Scale-adaptive Module in Diffusion-based Inverse Problems
by: Wang, Lingyu, et al.
Published: (2025)

Self-supervised Deep Hyperspectral Inpainting with the Plug and Play and Deep Image Prior Models
by: Li, Shuo, et al.
Published: (2025)

Toward Real-World Adoption of Portrait Relighting via Hybrid Domain Knowledge Fusion
by: Huang, Qian, et al.
Published: (2026)

Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification
by: Sun, Changchang, et al.
Published: (2024)

Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios
by: Jaiswal, Shantanu, et al.
Published: (2024)

ZigMa: A DiT-style Zigzag Mamba Diffusion Model
by: Hu, Vincent Tao, et al.
Published: (2024)

Smart Pressure e-Mat for Human Sleeping Posture and Dynamic Activity Recognition
by: Yuan, Liangqi, et al.
Published: (2023)

Taylor Videos for Action Recognition
by: Wang, Lei, et al.
Published: (2024)

ConceptMix++: Leveling the Playing Field in Text-to-Image Benchmarking via Iterative Prompt Optimization
by: Gan, Haosheng, et al.
Published: (2025)