Saved in:
| Main Authors: | Setu, Jyotirmay Nag, Desai, Kevin, Quarles, John |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.10422 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mazed and Confused: A Dataset of Cybersickness, Working Memory, Mental Load, Physical Load, and Attention During a Real Walking Task in VR
by: Setu, Jyotirmay Nag, et al.
Published: (2024)
by: Setu, Jyotirmay Nag, et al.
Published: (2024)
PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes
by: Peng, Kebin, et al.
Published: (2024)
by: Peng, Kebin, et al.
Published: (2024)
AG-EgoPose: Leveraging Action-Guided Motion and Kinematic Joint Encoding for Egocentric 3D Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2026)
by: Azam, Md Mushfiqur, et al.
Published: (2026)
TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos
by: Yu, Yakun, et al.
Published: (2026)
by: Yu, Yakun, et al.
Published: (2026)
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
by: Cao, Meng, et al.
Published: (2024)
by: Cao, Meng, et al.
Published: (2024)
Gameplay Highlights Generation
by: Edithal, Vignesh, et al.
Published: (2025)
by: Edithal, Vignesh, et al.
Published: (2025)
Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos
by: Cao, Meng, et al.
Published: (2026)
by: Cao, Meng, et al.
Published: (2026)
Towards Consumer-Grade Cybersickness Prediction: Multi-Model Alignment for Real-Time Vision-Only Inference
by: Zhu, Yitong, et al.
Published: (2025)
by: Zhu, Yitong, et al.
Published: (2025)
TRACE: Temporal Radiology with Anatomical Change Explanation for Grounded X-ray Report Generation
by: Aranya, OFM Riaz Rahman, et al.
Published: (2026)
by: Aranya, OFM Riaz Rahman, et al.
Published: (2026)
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
by: Jiang, Songtao, et al.
Published: (2026)
by: Jiang, Songtao, et al.
Published: (2026)
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
by: Wang, Jianyi, et al.
Published: (2025)
by: Wang, Jianyi, et al.
Published: (2025)
Revealing Human Attention Patterns from Gameplay Analysis for Reinforcement Learning
by: Krauss, Henrik, et al.
Published: (2025)
by: Krauss, Henrik, et al.
Published: (2025)
$R^2$-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding
by: Liu, Ye, et al.
Published: (2024)
by: Liu, Ye, et al.
Published: (2024)
CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models
by: Kim, Joowon, et al.
Published: (2026)
by: Kim, Joowon, et al.
Published: (2026)
Temporal Reasoning Transfer from Text to Video
by: Li, Lei, et al.
Published: (2024)
by: Li, Lei, et al.
Published: (2024)
MoA-VR: A Mixture-of-Agents System Towards All-in-One Video Restoration
by: Liu, Lu, et al.
Published: (2025)
by: Liu, Lu, et al.
Published: (2025)
Deep Brain Net: An Optimized Deep Learning Model for Brain tumor Detection in MRI Images Using EfficientNetB0 and ResNet50 with Transfer Learning
by: Onah, Daniel, et al.
Published: (2025)
by: Onah, Daniel, et al.
Published: (2025)
Concepts in Motion: Temporal Concept Bottleneck Model for Interpretable Video Classification
by: Knab, Patrick, et al.
Published: (2025)
by: Knab, Patrick, et al.
Published: (2025)
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding
by: Nguyen, Thong, et al.
Published: (2025)
by: Nguyen, Thong, et al.
Published: (2025)
VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning
by: Wang, Qunzhong, et al.
Published: (2025)
by: Wang, Qunzhong, et al.
Published: (2025)
Yoga Pose Classification Using Transfer Learning
by: Akash, M. M., et al.
Published: (2024)
by: Akash, M. M., et al.
Published: (2024)
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
by: Cai, Yufei, et al.
Published: (2025)
by: Cai, Yufei, et al.
Published: (2025)
Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration
by: Bai, Haoran, et al.
Published: (2025)
by: Bai, Haoran, et al.
Published: (2025)
Enhancing Cocoa Pod Disease Classification via Transfer Learning and Ensemble Methods: Toward Robust Predictive Modeling
by: Anduyan, Devina, et al.
Published: (2025)
by: Anduyan, Devina, et al.
Published: (2025)
Find Matching Faces Based On Face Parameters
by: Bhatt, Setu A., et al.
Published: (2025)
by: Bhatt, Setu A., et al.
Published: (2025)
Temporal Contrastive Learning for Video Temporal Reasoning in Large Vision-Language Models
by: Souza, Rafael, et al.
Published: (2024)
by: Souza, Rafael, et al.
Published: (2024)
Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification
by: French, Satchel, et al.
Published: (2025)
by: French, Satchel, et al.
Published: (2025)
CoVR-R:Reason-Aware Composed Video Retrieval
by: Thawakar, Omkar, et al.
Published: (2026)
by: Thawakar, Omkar, et al.
Published: (2026)
Pixels to Play: A Foundation Model for 3D Gameplay
by: Yue, Yuguang, et al.
Published: (2025)
by: Yue, Yuguang, et al.
Published: (2025)
Classification of Geographical Land Structure Using Convolution Neural Network and Transfer Learning
by: Zaid, Mustafa M. Abd, et al.
Published: (2024)
by: Zaid, Mustafa M. Abd, et al.
Published: (2024)
A Survey on 3D Egocentric Human Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2024)
by: Azam, Md Mushfiqur, et al.
Published: (2024)
Efficient Transfer Learning for Video-language Foundation Models
by: Chen, Haoxing, et al.
Published: (2024)
by: Chen, Haoxing, et al.
Published: (2024)
Towards Long-Form Spatio-Temporal Video Grounding
by: Gu, Xin, et al.
Published: (2026)
by: Gu, Xin, et al.
Published: (2026)
Object-WIPER : Training-Free Object and Associated Effect Removal in Videos
by: Kushwaha, Saksham Singh, et al.
Published: (2026)
by: Kushwaha, Saksham Singh, et al.
Published: (2026)
CollideNet: Hierarchical Multi-scale Video Representation Learning with Disentanglement for Time-To-Collision Forecasting
by: Desai, Nishq Poorav, et al.
Published: (2026)
by: Desai, Nishq Poorav, et al.
Published: (2026)
ThermalTap: Passive Application Fingerprinting in VR Headsets via Thermal Side Channels
by: Akram, Mahsin Bin, et al.
Published: (2026)
by: Akram, Mahsin Bin, et al.
Published: (2026)
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
by: Wasim, Syed Talal, et al.
Published: (2023)
by: Wasim, Syed Talal, et al.
Published: (2023)
ImViD: Immersive Volumetric Videos for Enhanced VR Engagement
by: Yang, Zhengxian, et al.
Published: (2025)
by: Yang, Zhengxian, et al.
Published: (2025)
CoVR-2: Automatic Data Construction for Composed Video Retrieval
by: Ventura, Lucas, et al.
Published: (2023)
by: Ventura, Lucas, et al.
Published: (2023)
Learning Temporally Consistent Video Depth from Video Diffusion Priors
by: Shao, Jiahao, et al.
Published: (2024)
by: Shao, Jiahao, et al.
Published: (2024)
Similar Items
-
Mazed and Confused: A Dataset of Cybersickness, Working Memory, Mental Load, Physical Load, and Attention During a Real Walking Task in VR
by: Setu, Jyotirmay Nag, et al.
Published: (2024) -
PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes
by: Peng, Kebin, et al.
Published: (2024) -
AG-EgoPose: Leveraging Action-Guided Motion and Kinematic Joint Encoding for Egocentric 3D Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2026) -
TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos
by: Yu, Yakun, et al.
Published: (2026) -
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
by: Cao, Meng, et al.
Published: (2024)