:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mai, Jinjie, Hamdi, Abdullah, Giancola, Silvio, Zhao, Chen, Ghanem, Bernard
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2407.08023
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MVTN: Learning Multi-View Transformations for 3D Understanding
by: Hamdi, Abdullah, et al.
Published: (2022)

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
by: Mai, Jinjie, et al.
Published: (2024)

VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
by: Held, Jan, et al.
Published: (2023)

Investigating Event-Based Cameras for Video Frame Interpolation in Sports
by: Deckyvere, Antoine, et al.
Published: (2024)

Towards AI-Powered Video Assistant Referee System (VARS) for Association Football
by: Held, Jan, et al.
Published: (2024)

Deep learning for action spotting in association football videos
by: Giancola, Silvio, et al.
Published: (2024)

3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes
by: Held, Jan, et al.
Published: (2024)

GoTTA be Diverse: Rethinking Memory Policies for Test-Time Adaptation
by: Alhuwaider, Shyma, et al.
Published: (2026)

Pixels or Positions? Benchmarking Modalities in Group Activity Recognition
by: Karki, Drishya, et al.
Published: (2025)

Learning Semantic Segmentation with Query Points Supervision on Aerial Images
by: Rivier, Santiago, et al.
Published: (2023)

SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries
by: Mkhallati, Hassan, et al.
Published: (2023)

Triangle Splatting for Real-Time Radiance Field Rendering
by: Held, Jan, et al.
Published: (2025)

SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images
by: Hamdi, Abdullah, et al.
Published: (2022)

Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding
by: Qian, Guocheng, et al.
Published: (2022)

X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model
by: Held, Jan, et al.
Published: (2024)

OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos
by: Benzakour, Yassine, et al.
Published: (2024)

GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering
by: Hamdi, Abdullah, et al.
Published: (2024)

SoccerLens: Grounded Soccer Video Understanding Beyond Accuracy
by: Elsharkawi, Ismael, et al.
Published: (2026)

4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding
by: Zhu, Wenxuan, et al.
Published: (2025)

Exploring Missing Modality in Multimodal Egocentric Datasets
by: Ramazanova, Merey, et al.
Published: (2024)

Towards Active Learning for Action Spotting in Association Football Videos
by: Giancola, Silvio, et al.
Published: (2023)

SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos
by: Cioppa, Anthony, et al.
Published: (2022)

Video Self-Stitching Graph Network for Temporal Action Localization
by: Zhao, Chen, et al.
Published: (2020)

Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos
by: Ramazanova, Merey, et al.
Published: (2024)

Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders
by: Eymaël, Alexandre, et al.
Published: (2024)

Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)

Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)

Camera Relocalization in Shadow-free Neural Radiance Fields
by: Xu, Shiyao, et al.
Published: (2024)

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
by: Liu, Shuming, et al.
Published: (2025)

WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization
by: Wang, Jialu, et al.
Published: (2024)

Differentiable Product Quantization for Memory Efficient Camera Relocalization
by: Laskar, Zakaria, et al.
Published: (2024)

OmniEgoCap: Camera-Agnostic Sequence-Level Egocentric Motion Reconstruction
by: Cho, Kyungwon, et al.
Published: (2025)

Semantic Object-level Modeling for Robust Visual Camera Relocalization
by: Zhu, Yifan, et al.
Published: (2024)

EasyV2V: A High-quality Instruction-based Video Editing Framework
by: Mai, Jinjie, et al.
Published: (2025)

Estimating 2D Camera Motion with Hybrid Motion Basis
by: Li, Haipeng, et al.
Published: (2025)

EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization
by: Xiao, Zhendong, et al.
Published: (2024)

Action Anticipation from SoccerNet Football Video Broadcasts
by: Dalal, Mohamad, et al.
Published: (2025)

Improved 3D Point-Line Mapping Regression for Camera Relocalization
by: Bui, Bach-Thuan, et al.
Published: (2025)

BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
by: Liu, Shuming, et al.
Published: (2025)