:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Setu, Jyotirmay Nag, Desai, Kevin, Quarles, John
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2510.10422
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Mazed and Confused: A Dataset of Cybersickness, Working Memory, Mental Load, Physical Load, and Attention During a Real Walking Task in VR
by: Setu, Jyotirmay Nag, et al.
Published: (2024)

PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes
by: Peng, Kebin, et al.
Published: (2024)

AG-EgoPose: Leveraging Action-Guided Motion and Kinematic Joint Encoding for Egocentric 3D Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2026)

TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos
by: Yu, Yakun, et al.
Published: (2026)

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
by: Cao, Meng, et al.
Published: (2024)

Gameplay Highlights Generation
by: Edithal, Vignesh, et al.
Published: (2025)

Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos
by: Cao, Meng, et al.
Published: (2026)

Towards Consumer-Grade Cybersickness Prediction: Multi-Model Alignment for Real-Time Vision-Only Inference
by: Zhu, Yitong, et al.
Published: (2025)

TRACE: Temporal Radiology with Anatomical Change Explanation for Grounded X-ray Report Generation
by: Aranya, OFM Riaz Rahman, et al.
Published: (2026)

Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
by: Jiang, Songtao, et al.
Published: (2026)

SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
by: Wang, Jianyi, et al.
Published: (2025)

Revealing Human Attention Patterns from Gameplay Analysis for Reinforcement Learning
by: Krauss, Henrik, et al.
Published: (2025)

$R^2$-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding
by: Liu, Ye, et al.
Published: (2024)

CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models
by: Kim, Joowon, et al.
Published: (2026)

Temporal Reasoning Transfer from Text to Video
by: Li, Lei, et al.
Published: (2024)

MoA-VR: A Mixture-of-Agents System Towards All-in-One Video Restoration
by: Liu, Lu, et al.
Published: (2025)

Deep Brain Net: An Optimized Deep Learning Model for Brain tumor Detection in MRI Images Using EfficientNetB0 and ResNet50 with Transfer Learning
by: Onah, Daniel, et al.
Published: (2025)

Concepts in Motion: Temporal Concept Bottleneck Model for Interpretable Video Classification
by: Knab, Patrick, et al.
Published: (2025)

Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding
by: Nguyen, Thong, et al.
Published: (2025)

VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning
by: Wang, Qunzhong, et al.
Published: (2025)

Yoga Pose Classification Using Transfer Learning
by: Akash, M. M., et al.
Published: (2024)

EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
by: Cai, Yufei, et al.
Published: (2025)

Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration
by: Bai, Haoran, et al.
Published: (2025)

Enhancing Cocoa Pod Disease Classification via Transfer Learning and Ensemble Methods: Toward Robust Predictive Modeling
by: Anduyan, Devina, et al.
Published: (2025)

Find Matching Faces Based On Face Parameters
by: Bhatt, Setu A., et al.
Published: (2025)

Temporal Contrastive Learning for Video Temporal Reasoning in Large Vision-Language Models
by: Souza, Rafael, et al.
Published: (2024)

Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification
by: French, Satchel, et al.
Published: (2025)

CoVR-R:Reason-Aware Composed Video Retrieval
by: Thawakar, Omkar, et al.
Published: (2026)

Pixels to Play: A Foundation Model for 3D Gameplay
by: Yue, Yuguang, et al.
Published: (2025)

Classification of Geographical Land Structure Using Convolution Neural Network and Transfer Learning
by: Zaid, Mustafa M. Abd, et al.
Published: (2024)

A Survey on 3D Egocentric Human Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2024)

Efficient Transfer Learning for Video-language Foundation Models
by: Chen, Haoxing, et al.
Published: (2024)

Towards Long-Form Spatio-Temporal Video Grounding
by: Gu, Xin, et al.
Published: (2026)

Object-WIPER : Training-Free Object and Associated Effect Removal in Videos
by: Kushwaha, Saksham Singh, et al.
Published: (2026)

CollideNet: Hierarchical Multi-scale Video Representation Learning with Disentanglement for Time-To-Collision Forecasting
by: Desai, Nishq Poorav, et al.
Published: (2026)

ThermalTap: Passive Application Fingerprinting in VR Headsets via Thermal Side Channels
by: Akram, Mahsin Bin, et al.
Published: (2026)

Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
by: Wasim, Syed Talal, et al.
Published: (2023)

ImViD: Immersive Volumetric Videos for Enhanced VR Engagement
by: Yang, Zhengxian, et al.
Published: (2025)

CoVR-2: Automatic Data Construction for Composed Video Retrieval
by: Ventura, Lucas, et al.
Published: (2023)

Learning Temporally Consistent Video Depth from Video Diffusion Priors
by: Shao, Jiahao, et al.
Published: (2024)