Saved in:
| Main Authors: | Jaiswal, Abhishek, Srivastava, Nisheeth |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.11642 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning to Play Video Games with Intuitive Physics Priors
by: Jaiswal, Abhishek, et al.
Published: (2024)
by: Jaiswal, Abhishek, et al.
Published: (2024)
Real-Time Feedback and Benchmark Dataset for Isometric Pose Evaluation
by: Jaiswal, Abhishek, et al.
Published: (2025)
by: Jaiswal, Abhishek, et al.
Published: (2025)
Benchmarking Reliability of Deep Learning Models for Pathological Gait Classification
by: Jaiswal, Abhishek, et al.
Published: (2024)
by: Jaiswal, Abhishek, et al.
Published: (2024)
Style-based Clustering of Visual Artworks and the Play of Neural Style-Representations
by: Dangeti, Abhishek, et al.
Published: (2024)
by: Dangeti, Abhishek, et al.
Published: (2024)
Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment
by: Lin, Xin Lei, et al.
Published: (2025)
by: Lin, Xin Lei, et al.
Published: (2025)
SitPose: Real-Time Detection of Sitting Posture and Sedentary Behavior Using Ensemble Learning With Depth Sensor
by: Jin, Hang, et al.
Published: (2024)
by: Jin, Hang, et al.
Published: (2024)
DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
by: Chen, Yi, et al.
Published: (2026)
by: Chen, Yi, et al.
Published: (2026)
CART: Compositional Auto-Regressive Transformer for Image Generation
by: Roheda, Siddharth, et al.
Published: (2024)
by: Roheda, Siddharth, et al.
Published: (2024)
Few-shot multi-token DreamBooth with LoRa for style-consistent character generation
by: Pascual, Ruben, et al.
Published: (2025)
by: Pascual, Ruben, et al.
Published: (2025)
Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers
by: Tao, Wei, et al.
Published: (2024)
by: Tao, Wei, et al.
Published: (2024)
Research on Driver Facial Fatigue Detection Based on Yolov8 Model
by: Zhou, Chang, et al.
Published: (2024)
by: Zhou, Chang, et al.
Published: (2024)
Plug and Play Active Learning for Object Detection
by: Yang, Chenhongyi, et al.
Published: (2022)
by: Yang, Chenhongyi, et al.
Published: (2022)
Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
by: Zeevi, Tal, et al.
Published: (2024)
by: Zeevi, Tal, et al.
Published: (2024)
Skeleton-Based Posture Classification to Promote Safer Walker-Assisted Gait in Older Adults
by: M., Sergio D. Sierra, et al.
Published: (2026)
by: M., Sergio D. Sierra, et al.
Published: (2026)
Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network
by: Wang, Zhen, et al.
Published: (2024)
by: Wang, Zhen, et al.
Published: (2024)
Momentum Guidance: Plug-and-Play Guidance for Flow Models
by: Liao, Runlong, et al.
Published: (2026)
by: Liao, Runlong, et al.
Published: (2026)
From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation
by: Niu, Ke, et al.
Published: (2025)
by: Niu, Ke, et al.
Published: (2025)
Adaptive Moments are Surprisingly Effective for Plug-and-Play Diffusion Sampling
by: Belardi, Christian, et al.
Published: (2026)
by: Belardi, Christian, et al.
Published: (2026)
Playing the network backward: A Game Theoretic Attribution Framework
by: Zimmermann, Jakob Paul, et al.
Published: (2026)
by: Zimmermann, Jakob Paul, et al.
Published: (2026)
Action Dubber: Timing Audible Actions via Inflectional Flow
by: Wan, Wenlong, et al.
Published: (2025)
by: Wan, Wenlong, et al.
Published: (2025)
Plug-and-Play Image Restoration with Flow Matching: A Continuous Viewpoint
by: Jia, Fan, et al.
Published: (2025)
by: Jia, Fan, et al.
Published: (2025)
Leveraging Data to Say No: Memory Augmented Plug-and-Play Selective Prediction
by: Sarkar, Aditya, et al.
Published: (2026)
by: Sarkar, Aditya, et al.
Published: (2026)
PnP-Flow: Plug-and-Play Image Restoration with Flow Matching
by: Martin, Ségolène, et al.
Published: (2024)
by: Martin, Ségolène, et al.
Published: (2024)
Multi-class Seismic Building Damage Assessment from InSAR Imagery using Quadratic Variational Causal Bayesian Inference
by: Li, Xuechun, et al.
Published: (2025)
by: Li, Xuechun, et al.
Published: (2025)
Transfer Learning with Point Transformers
by: Gupta, Kartik, et al.
Published: (2024)
by: Gupta, Kartik, et al.
Published: (2024)
Learning to Drive via Asymmetric Self-Play
by: Zhang, Chris, et al.
Published: (2024)
by: Zhang, Chris, et al.
Published: (2024)
Multi-State-Action Tokenisation in Decision Transformers for Multi-Discrete Action Spaces
by: Moodley, Perusha, et al.
Published: (2024)
by: Moodley, Perusha, et al.
Published: (2024)
Investigating the Quality of DermaMNIST and Fitzpatrick17k Dermatological Image Datasets
by: Abhishek, Kumar, et al.
Published: (2024)
by: Abhishek, Kumar, et al.
Published: (2024)
Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis
by: Nagar, Aishik, et al.
Published: (2024)
by: Nagar, Aishik, et al.
Published: (2024)
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
by: Guruprasad, Pranav, et al.
Published: (2025)
by: Guruprasad, Pranav, et al.
Published: (2025)
VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
by: Srivastava, Divyansh, et al.
Published: (2024)
by: Srivastava, Divyansh, et al.
Published: (2024)
SAIP: A Plug-and-Play Scale-adaptive Module in Diffusion-based Inverse Problems
by: Wang, Lingyu, et al.
Published: (2025)
by: Wang, Lingyu, et al.
Published: (2025)
Self-supervised Deep Hyperspectral Inpainting with the Plug and Play and Deep Image Prior Models
by: Li, Shuo, et al.
Published: (2025)
by: Li, Shuo, et al.
Published: (2025)
Toward Real-World Adoption of Portrait Relighting via Hybrid Domain Knowledge Fusion
by: Huang, Qian, et al.
Published: (2026)
by: Huang, Qian, et al.
Published: (2026)
Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification
by: Sun, Changchang, et al.
Published: (2024)
by: Sun, Changchang, et al.
Published: (2024)
Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios
by: Jaiswal, Shantanu, et al.
Published: (2024)
by: Jaiswal, Shantanu, et al.
Published: (2024)
ZigMa: A DiT-style Zigzag Mamba Diffusion Model
by: Hu, Vincent Tao, et al.
Published: (2024)
by: Hu, Vincent Tao, et al.
Published: (2024)
Smart Pressure e-Mat for Human Sleeping Posture and Dynamic Activity Recognition
by: Yuan, Liangqi, et al.
Published: (2023)
by: Yuan, Liangqi, et al.
Published: (2023)
Taylor Videos for Action Recognition
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
ConceptMix++: Leveling the Playing Field in Text-to-Image Benchmarking via Iterative Prompt Optimization
by: Gan, Haosheng, et al.
Published: (2025)
by: Gan, Haosheng, et al.
Published: (2025)
Similar Items
-
Learning to Play Video Games with Intuitive Physics Priors
by: Jaiswal, Abhishek, et al.
Published: (2024) -
Real-Time Feedback and Benchmark Dataset for Isometric Pose Evaluation
by: Jaiswal, Abhishek, et al.
Published: (2025) -
Benchmarking Reliability of Deep Learning Models for Pathological Gait Classification
by: Jaiswal, Abhishek, et al.
Published: (2024) -
Style-based Clustering of Visual Artworks and the Play of Neural Style-Representations
by: Dangeti, Abhishek, et al.
Published: (2024) -
Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment
by: Lin, Xin Lei, et al.
Published: (2025)