:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Chen, Thabet, Ali, Ghanem, Bernard
Format:	Preprint
Published:	2020
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2011.14598
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Harnessing Temporal Causality for Advanced Temporal Action Detection
by: Liu, Shuming, et al.
Published: (2024)

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
by: Liu, Shuming, et al.
Published: (2023)

OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos
by: Benzakour, Yassine, et al.
Published: (2024)

BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
by: Liu, Shuming, et al.
Published: (2025)

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)

Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization
by: Mai, Jinjie, et al.
Published: (2024)

Towards Active Learning for Action Spotting in Association Football Videos
by: Giancola, Silvio, et al.
Published: (2023)

TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos
by: Zhang, Chen-Lin, et al.
Published: (2025)

StabStitch++: Unsupervised Online Video Stitching with Spatiotemporal Bidirectional Warps
by: Nie, Lang, et al.
Published: (2025)

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
by: Liu, Shuming, et al.
Published: (2025)

Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos
by: Ramazanova, Merey, et al.
Published: (2024)

$β$-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
by: Zohra, Fatimah, et al.
Published: (2025)

Eliminating Warping Shakes for Unsupervised Online Video Stitching
by: Nie, Lang, et al.
Published: (2024)

Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?
by: Xie, Jianyang, et al.
Published: (2025)

Style Transfer: From Stitching to Neural Networks
by: Xu, Xinhe, et al.
Published: (2024)

TrackMAE: Video Representation Learning via Track Mask and Predict
by: Vandeghen, Renaud, et al.
Published: (2026)

DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition
by: Ullah, Hayat, et al.
Published: (2025)

Joint Self-Supervised Video Alignment and Action Segmentation
by: Ali, Ali Shah, et al.
Published: (2025)

Investigating Event-Based Cameras for Video Frame Interpolation in Sports
by: Deckyvere, Antoine, et al.
Published: (2024)

AutoSew: A Geometric Approach to Stitching Prediction with Graph Neural Networks
by: Ríos-Navarro, Pablo, et al.
Published: (2026)

SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries
by: Mkhallati, Hassan, et al.
Published: (2023)

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)

Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching
by: Liao, Tianli, et al.
Published: (2023)

MoCA-Video: Motion-Aware Concept Alignment for Consistent Video Editing
by: Zhang, Tong, et al.
Published: (2025)

ResidualViT for Efficient Temporally Dense Video Encoding
by: Soldan, Mattia, et al.
Published: (2025)

Temporal Divide-and-Conquer Anomaly Actions Localization in Semi-Supervised Videos with Hierarchical Transformer
by: Osman, Nada, et al.
Published: (2024)

Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator
by: Go, Hyojun, et al.
Published: (2025)

Stitch-a-Demo: Video Demonstrations from Multistep Descriptions
by: Wu, Chi Hsuan, et al.
Published: (2025)

Technical Report for ActivityNet Challenge 2022 -- Temporal Action Localization
by: Chen, Shimin, et al.
Published: (2024)

UniStitch: Unifying Semantic and Geometric Features for Image Stitching
by: Mei, Yuan, et al.
Published: (2026)

Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos
by: Tian, Haitao, et al.
Published: (2024)

Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization
by: Shihab, Ibne Farabi, et al.
Published: (2025)

TDS-CLIP: Temporal Difference Side Network for Efficient VideoAction Recognition
by: Wang, Bin, et al.
Published: (2024)

Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)

Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)

UniVerse-1: Unified Audio-Video Generation via Stitching of Experts
by: Wang, Duomin, et al.
Published: (2025)

STAT: Towards Generalizable Temporal Action Localization
by: Liu, Yangcen, et al.
Published: (2024)

VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
by: Held, Jan, et al.
Published: (2023)

Leveraging Temporal Contextualization for Video Action Recognition
by: Kim, Minji, et al.
Published: (2024)

Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
by: Hyun, Jeongseok, et al.
Published: (2024)