Saved in:
| Main Authors: | Vellenga, Koen, Steinhauer, H. Joe, Andersson, Jonas, Sjögren, Anders |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.05006 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Last Layer Hamiltonian Monte Carlo
by: Vellenga, Koen, et al.
Published: (2025)
by: Vellenga, Koen, et al.
Published: (2025)
Taylor Videos for Action Recognition
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
Video RWKV:Video Action Recognition Based RWKV
by: Yin, Zhuowen, et al.
Published: (2024)
by: Yin, Zhuowen, et al.
Published: (2024)
CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition
by: Wang, Ruoyu, et al.
Published: (2024)
by: Wang, Ruoyu, et al.
Published: (2024)
Uncertainties of Latent Representations in Computer Vision
by: Kirchhof, Michael
Published: (2024)
by: Kirchhof, Michael
Published: (2024)
Latent Action Pretraining from Videos
by: Ye, Seonghyeon, et al.
Published: (2024)
by: Ye, Seonghyeon, et al.
Published: (2024)
EZ-CLIP: Efficient Zeroshot Video Action Recognition
by: Ahmad, Shahzad, et al.
Published: (2023)
by: Ahmad, Shahzad, et al.
Published: (2023)
Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics
by: Hoang, Christopher, et al.
Published: (2025)
by: Hoang, Christopher, et al.
Published: (2025)
Designing deep neural networks for driver intention recognition
by: Vellenga, Koen, et al.
Published: (2024)
by: Vellenga, Koen, et al.
Published: (2024)
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
by: Lin, Han, et al.
Published: (2024)
by: Lin, Han, et al.
Published: (2024)
Action-slot: Visual Action-centric Representations for Multi-label Atomic Activity Recognition in Traffic Scenes
by: Kung, Chi-Hsi, et al.
Published: (2023)
by: Kung, Chi-Hsi, et al.
Published: (2023)
Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation
by: Soufleri, Efstathia, et al.
Published: (2024)
by: Soufleri, Efstathia, et al.
Published: (2024)
VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition
by: Yadav, Tanush, et al.
Published: (2026)
by: Yadav, Tanush, et al.
Published: (2026)
Olaf-World: Orienting Latent Actions for Video World Modeling
by: Jiang, Yuxin, et al.
Published: (2026)
by: Jiang, Yuxin, et al.
Published: (2026)
Being-H0.7: A Latent World-Action Model from Egocentric Videos
by: Luo, Hao, et al.
Published: (2026)
by: Luo, Hao, et al.
Published: (2026)
DeCo-VAE: Learning Compact Latents for Video Reconstruction via Decoupled Representation
by: Yin, Xiangchen, et al.
Published: (2025)
by: Yin, Xiangchen, et al.
Published: (2025)
Exploring Video-Based Driver Activity Recognition under Noisy Labels
by: Fan, Linjuan, et al.
Published: (2025)
by: Fan, Linjuan, et al.
Published: (2025)
When Spatial meets Temporal in Action Recognition
by: Chen, Huilin, et al.
Published: (2024)
by: Chen, Huilin, et al.
Published: (2024)
Canonical Latent Representations in Conditional Diffusion Models
by: Xu, Yitao, et al.
Published: (2025)
by: Xu, Yitao, et al.
Published: (2025)
Segment to Focus: Guiding Latent Action Models in the Presence of Distractors
by: Fechner, Marcus, et al.
Published: (2026)
by: Fechner, Marcus, et al.
Published: (2026)
Improving Out-of-distribution Human Activity Recognition via IMU-Video Cross-modal Representation Learning
by: Cheshmi, Seyyed Saeid, et al.
Published: (2025)
by: Cheshmi, Seyyed Saeid, et al.
Published: (2025)
Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving
by: Yu, Hanyi, et al.
Published: (2024)
by: Yu, Hanyi, et al.
Published: (2024)
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
by: Zhou, Youwei, et al.
Published: (2024)
by: Zhou, Youwei, et al.
Published: (2024)
ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition
by: Xiao, Shiting, et al.
Published: (2024)
by: Xiao, Shiting, et al.
Published: (2024)
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition
by: Wen, Yuhang, et al.
Published: (2024)
by: Wen, Yuhang, et al.
Published: (2024)
GraphAU-Pain: Graph-based Action Unit Representation for Pain Intensity Estimation
by: Wang, Zhiyu, et al.
Published: (2025)
by: Wang, Zhiyu, et al.
Published: (2025)
Box2Flow: Instance-based Action Flow Graphs from Videos
by: Li, Jiatong, et al.
Published: (2024)
by: Li, Jiatong, et al.
Published: (2024)
LLVD: LSTM-based Explicit Motion Modeling in Latent Space for Blind Video Denoising
by: Rashid, Loay, et al.
Published: (2025)
by: Rashid, Loay, et al.
Published: (2025)
Gaze-Guided Graph Neural Network for Action Anticipation Conditioned on Intention
by: Ozdel, Suleyman, et al.
Published: (2024)
by: Ozdel, Suleyman, et al.
Published: (2024)
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
by: Li, Kun, et al.
Published: (2024)
by: Li, Kun, et al.
Published: (2024)
On the Utility of 3D Hand Poses for Action Recognition
by: Shamil, Md Salman, et al.
Published: (2024)
by: Shamil, Md Salman, et al.
Published: (2024)
Latent Equivariant Operators for Robust Object Recognition: Promises and Challenges
by: Dinh, Minh, et al.
Published: (2026)
by: Dinh, Minh, et al.
Published: (2026)
Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition
by: Aganian, Dustin, et al.
Published: (2025)
by: Aganian, Dustin, et al.
Published: (2025)
Enhancing Interpretability of Sparse Latent Representations with Class Information
by: Abiz, Farshad Sangari, et al.
Published: (2025)
by: Abiz, Farshad Sangari, et al.
Published: (2025)
Hierarchical Uncertainty Estimation for Learning-based Registration in Neuroimaging
by: Hu, Xiaoling, et al.
Published: (2024)
by: Hu, Xiaoling, et al.
Published: (2024)
TrajFusionNet: Pedestrian Crossing Intention Prediction via Fusion of Sequential and Visual Trajectory Representations
by: Landry, François G., et al.
Published: (2025)
by: Landry, François G., et al.
Published: (2025)
Motus: A Unified Latent Action World Model
by: Bi, Hongzhe, et al.
Published: (2025)
by: Bi, Hongzhe, et al.
Published: (2025)
Balancing the Scales: Enhancing Fairness in Facial Expression Recognition with Latent Alignment
by: Rizvi, Syed Sameen Ahmad, et al.
Published: (2024)
by: Rizvi, Syed Sameen Ahmad, et al.
Published: (2024)
Mind2Drive: Predicting Driver Intentions from EEG in Real-world On-Road Driving
by: Alosaimi, Ghadah, et al.
Published: (2026)
by: Alosaimi, Ghadah, et al.
Published: (2026)
Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
by: Hahm, Jaehoon, et al.
Published: (2024)
by: Hahm, Jaehoon, et al.
Published: (2024)
Similar Items
-
Last Layer Hamiltonian Monte Carlo
by: Vellenga, Koen, et al.
Published: (2025) -
Taylor Videos for Action Recognition
by: Wang, Lei, et al.
Published: (2024) -
Video RWKV:Video Action Recognition Based RWKV
by: Yin, Zhuowen, et al.
Published: (2024) -
CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition
by: Wang, Ruoyu, et al.
Published: (2024) -
Uncertainties of Latent Representations in Computer Vision
by: Kirchhof, Michael
Published: (2024)