:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Vellenga, Koen, Steinhauer, H. Joe, Andersson, Jonas, Sjögren, Anders
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2510.05006
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Last Layer Hamiltonian Monte Carlo
by: Vellenga, Koen, et al.
Published: (2025)

Taylor Videos for Action Recognition
by: Wang, Lei, et al.
Published: (2024)

Video RWKV:Video Action Recognition Based RWKV
by: Yin, Zhuowen, et al.
Published: (2024)

CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition
by: Wang, Ruoyu, et al.
Published: (2024)

Uncertainties of Latent Representations in Computer Vision
by: Kirchhof, Michael
Published: (2024)

Latent Action Pretraining from Videos
by: Ye, Seonghyeon, et al.
Published: (2024)

EZ-CLIP: Efficient Zeroshot Video Action Recognition
by: Ahmad, Shahzad, et al.
Published: (2023)

Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics
by: Hoang, Christopher, et al.
Published: (2025)

Designing deep neural networks for driver intention recognition
by: Vellenga, Koen, et al.
Published: (2024)

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
by: Lin, Han, et al.
Published: (2024)

Action-slot: Visual Action-centric Representations for Multi-label Atomic Activity Recognition in Traffic Scenes
by: Kung, Chi-Hsi, et al.
Published: (2023)

Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation
by: Soufleri, Efstathia, et al.
Published: (2024)

VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition
by: Yadav, Tanush, et al.
Published: (2026)

Olaf-World: Orienting Latent Actions for Video World Modeling
by: Jiang, Yuxin, et al.
Published: (2026)

Being-H0.7: A Latent World-Action Model from Egocentric Videos
by: Luo, Hao, et al.
Published: (2026)

DeCo-VAE: Learning Compact Latents for Video Reconstruction via Decoupled Representation
by: Yin, Xiangchen, et al.
Published: (2025)

Exploring Video-Based Driver Activity Recognition under Noisy Labels
by: Fan, Linjuan, et al.
Published: (2025)

When Spatial meets Temporal in Action Recognition
by: Chen, Huilin, et al.
Published: (2024)

Canonical Latent Representations in Conditional Diffusion Models
by: Xu, Yitao, et al.
Published: (2025)

Segment to Focus: Guiding Latent Action Models in the Presence of Distractors
by: Fechner, Marcus, et al.
Published: (2026)

Improving Out-of-distribution Human Activity Recognition via IMU-Video Cross-modal Representation Learning
by: Cheshmi, Seyyed Saeid, et al.
Published: (2025)

Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving
by: Yu, Hanyi, et al.
Published: (2024)

Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
by: Zhou, Youwei, et al.
Published: (2024)

ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition
by: Xiao, Shiting, et al.
Published: (2024)

CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition
by: Wen, Yuhang, et al.
Published: (2024)

GraphAU-Pain: Graph-based Action Unit Representation for Pain Intensity Estimation
by: Wang, Zhiyu, et al.
Published: (2025)

Box2Flow: Instance-based Action Flow Graphs from Videos
by: Li, Jiatong, et al.
Published: (2024)

LLVD: LSTM-based Explicit Motion Modeling in Latent Space for Blind Video Denoising
by: Rashid, Loay, et al.
Published: (2025)

Gaze-Guided Graph Neural Network for Action Anticipation Conditioned on Intention
by: Ozdel, Suleyman, et al.
Published: (2024)

Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
by: Li, Kun, et al.
Published: (2024)

On the Utility of 3D Hand Poses for Action Recognition
by: Shamil, Md Salman, et al.
Published: (2024)

Latent Equivariant Operators for Robust Object Recognition: Promises and Challenges
by: Dinh, Minh, et al.
Published: (2026)

Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition
by: Aganian, Dustin, et al.
Published: (2025)

Enhancing Interpretability of Sparse Latent Representations with Class Information
by: Abiz, Farshad Sangari, et al.
Published: (2025)

Hierarchical Uncertainty Estimation for Learning-based Registration in Neuroimaging
by: Hu, Xiaoling, et al.
Published: (2024)

TrajFusionNet: Pedestrian Crossing Intention Prediction via Fusion of Sequential and Visual Trajectory Representations
by: Landry, François G., et al.
Published: (2025)

Motus: A Unified Latent Action World Model
by: Bi, Hongzhe, et al.
Published: (2025)

Balancing the Scales: Enhancing Fairness in Facial Expression Recognition with Latent Alignment
by: Rizvi, Syed Sameen Ahmad, et al.
Published: (2024)

Mind2Drive: Predicting Driver Intentions from EEG in Real-world On-Road Driving
by: Alosaimi, Ghadah, et al.
Published: (2026)

Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
by: Hahm, Jaehoon, et al.
Published: (2024)