Saved in:
| Main Authors: | Mai, Jinjie, Hamdi, Abdullah, Giancola, Silvio, Zhao, Chen, Ghanem, Bernard |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.08023 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MVTN: Learning Multi-View Transformations for 3D Understanding
by: Hamdi, Abdullah, et al.
Published: (2022)
by: Hamdi, Abdullah, et al.
Published: (2022)
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
by: Mai, Jinjie, et al.
Published: (2024)
by: Mai, Jinjie, et al.
Published: (2024)
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
by: Held, Jan, et al.
Published: (2023)
by: Held, Jan, et al.
Published: (2023)
Investigating Event-Based Cameras for Video Frame Interpolation in Sports
by: Deckyvere, Antoine, et al.
Published: (2024)
by: Deckyvere, Antoine, et al.
Published: (2024)
Towards AI-Powered Video Assistant Referee System (VARS) for Association Football
by: Held, Jan, et al.
Published: (2024)
by: Held, Jan, et al.
Published: (2024)
Deep learning for action spotting in association football videos
by: Giancola, Silvio, et al.
Published: (2024)
by: Giancola, Silvio, et al.
Published: (2024)
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes
by: Held, Jan, et al.
Published: (2024)
by: Held, Jan, et al.
Published: (2024)
GoTTA be Diverse: Rethinking Memory Policies for Test-Time Adaptation
by: Alhuwaider, Shyma, et al.
Published: (2026)
by: Alhuwaider, Shyma, et al.
Published: (2026)
Pixels or Positions? Benchmarking Modalities in Group Activity Recognition
by: Karki, Drishya, et al.
Published: (2025)
by: Karki, Drishya, et al.
Published: (2025)
Learning Semantic Segmentation with Query Points Supervision on Aerial Images
by: Rivier, Santiago, et al.
Published: (2023)
by: Rivier, Santiago, et al.
Published: (2023)
SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries
by: Mkhallati, Hassan, et al.
Published: (2023)
by: Mkhallati, Hassan, et al.
Published: (2023)
Triangle Splatting for Real-Time Radiance Field Rendering
by: Held, Jan, et al.
Published: (2025)
by: Held, Jan, et al.
Published: (2025)
SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images
by: Hamdi, Abdullah, et al.
Published: (2022)
by: Hamdi, Abdullah, et al.
Published: (2022)
Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding
by: Qian, Guocheng, et al.
Published: (2022)
by: Qian, Guocheng, et al.
Published: (2022)
X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model
by: Held, Jan, et al.
Published: (2024)
by: Held, Jan, et al.
Published: (2024)
OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos
by: Benzakour, Yassine, et al.
Published: (2024)
by: Benzakour, Yassine, et al.
Published: (2024)
GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering
by: Hamdi, Abdullah, et al.
Published: (2024)
by: Hamdi, Abdullah, et al.
Published: (2024)
SoccerLens: Grounded Soccer Video Understanding Beyond Accuracy
by: Elsharkawi, Ismael, et al.
Published: (2026)
by: Elsharkawi, Ismael, et al.
Published: (2026)
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding
by: Zhu, Wenxuan, et al.
Published: (2025)
by: Zhu, Wenxuan, et al.
Published: (2025)
Exploring Missing Modality in Multimodal Egocentric Datasets
by: Ramazanova, Merey, et al.
Published: (2024)
by: Ramazanova, Merey, et al.
Published: (2024)
Towards Active Learning for Action Spotting in Association Football Videos
by: Giancola, Silvio, et al.
Published: (2023)
by: Giancola, Silvio, et al.
Published: (2023)
SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos
by: Cioppa, Anthony, et al.
Published: (2022)
by: Cioppa, Anthony, et al.
Published: (2022)
Video Self-Stitching Graph Network for Temporal Action Localization
by: Zhao, Chen, et al.
Published: (2020)
by: Zhao, Chen, et al.
Published: (2020)
Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos
by: Ramazanova, Merey, et al.
Published: (2024)
by: Ramazanova, Merey, et al.
Published: (2024)
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders
by: Eymaël, Alexandre, et al.
Published: (2024)
by: Eymaël, Alexandre, et al.
Published: (2024)
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)
by: Li, Bing, et al.
Published: (2024)
Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)
by: Mai, Jinjie, et al.
Published: (2025)
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)
by: Thoker, Fida Mohammad, et al.
Published: (2025)
Camera Relocalization in Shadow-free Neural Radiance Fields
by: Xu, Shiyao, et al.
Published: (2024)
by: Xu, Shiyao, et al.
Published: (2024)
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
by: Liu, Shuming, et al.
Published: (2025)
by: Liu, Shuming, et al.
Published: (2025)
WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization
by: Wang, Jialu, et al.
Published: (2024)
by: Wang, Jialu, et al.
Published: (2024)
Differentiable Product Quantization for Memory Efficient Camera Relocalization
by: Laskar, Zakaria, et al.
Published: (2024)
by: Laskar, Zakaria, et al.
Published: (2024)
OmniEgoCap: Camera-Agnostic Sequence-Level Egocentric Motion Reconstruction
by: Cho, Kyungwon, et al.
Published: (2025)
by: Cho, Kyungwon, et al.
Published: (2025)
Semantic Object-level Modeling for Robust Visual Camera Relocalization
by: Zhu, Yifan, et al.
Published: (2024)
by: Zhu, Yifan, et al.
Published: (2024)
EasyV2V: A High-quality Instruction-based Video Editing Framework
by: Mai, Jinjie, et al.
Published: (2025)
by: Mai, Jinjie, et al.
Published: (2025)
Estimating 2D Camera Motion with Hybrid Motion Basis
by: Li, Haipeng, et al.
Published: (2025)
by: Li, Haipeng, et al.
Published: (2025)
EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization
by: Xiao, Zhendong, et al.
Published: (2024)
by: Xiao, Zhendong, et al.
Published: (2024)
Action Anticipation from SoccerNet Football Video Broadcasts
by: Dalal, Mohamad, et al.
Published: (2025)
by: Dalal, Mohamad, et al.
Published: (2025)
Improved 3D Point-Line Mapping Regression for Camera Relocalization
by: Bui, Bach-Thuan, et al.
Published: (2025)
by: Bui, Bach-Thuan, et al.
Published: (2025)
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
by: Liu, Shuming, et al.
Published: (2025)
by: Liu, Shuming, et al.
Published: (2025)
Similar Items
-
MVTN: Learning Multi-View Transformations for 3D Understanding
by: Hamdi, Abdullah, et al.
Published: (2022) -
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
by: Mai, Jinjie, et al.
Published: (2024) -
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
by: Held, Jan, et al.
Published: (2023) -
Investigating Event-Based Cameras for Video Frame Interpolation in Sports
by: Deckyvere, Antoine, et al.
Published: (2024) -
Towards AI-Powered Video Assistant Referee System (VARS) for Association Football
by: Held, Jan, et al.
Published: (2024)