Saved in:
| Main Authors: | Zhao, Chen, Thabet, Ali, Ghanem, Bernard |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2011.14598 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Harnessing Temporal Causality for Advanced Temporal Action Detection
by: Liu, Shuming, et al.
Published: (2024)
by: Liu, Shuming, et al.
Published: (2024)
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
by: Liu, Shuming, et al.
Published: (2023)
by: Liu, Shuming, et al.
Published: (2023)
OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos
by: Benzakour, Yassine, et al.
Published: (2024)
by: Benzakour, Yassine, et al.
Published: (2024)
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
by: Liu, Shuming, et al.
Published: (2025)
by: Liu, Shuming, et al.
Published: (2025)
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)
by: Thoker, Fida Mohammad, et al.
Published: (2025)
Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization
by: Mai, Jinjie, et al.
Published: (2024)
by: Mai, Jinjie, et al.
Published: (2024)
Towards Active Learning for Action Spotting in Association Football Videos
by: Giancola, Silvio, et al.
Published: (2023)
by: Giancola, Silvio, et al.
Published: (2023)
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos
by: Zhang, Chen-Lin, et al.
Published: (2025)
by: Zhang, Chen-Lin, et al.
Published: (2025)
StabStitch++: Unsupervised Online Video Stitching with Spatiotemporal Bidirectional Warps
by: Nie, Lang, et al.
Published: (2025)
by: Nie, Lang, et al.
Published: (2025)
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
by: Liu, Shuming, et al.
Published: (2025)
by: Liu, Shuming, et al.
Published: (2025)
Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos
by: Ramazanova, Merey, et al.
Published: (2024)
by: Ramazanova, Merey, et al.
Published: (2024)
$β$-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
by: Zohra, Fatimah, et al.
Published: (2025)
by: Zohra, Fatimah, et al.
Published: (2025)
Eliminating Warping Shakes for Unsupervised Online Video Stitching
by: Nie, Lang, et al.
Published: (2024)
by: Nie, Lang, et al.
Published: (2024)
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?
by: Xie, Jianyang, et al.
Published: (2025)
by: Xie, Jianyang, et al.
Published: (2025)
Style Transfer: From Stitching to Neural Networks
by: Xu, Xinhe, et al.
Published: (2024)
by: Xu, Xinhe, et al.
Published: (2024)
TrackMAE: Video Representation Learning via Track Mask and Predict
by: Vandeghen, Renaud, et al.
Published: (2026)
by: Vandeghen, Renaud, et al.
Published: (2026)
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition
by: Ullah, Hayat, et al.
Published: (2025)
by: Ullah, Hayat, et al.
Published: (2025)
Joint Self-Supervised Video Alignment and Action Segmentation
by: Ali, Ali Shah, et al.
Published: (2025)
by: Ali, Ali Shah, et al.
Published: (2025)
Investigating Event-Based Cameras for Video Frame Interpolation in Sports
by: Deckyvere, Antoine, et al.
Published: (2024)
by: Deckyvere, Antoine, et al.
Published: (2024)
AutoSew: A Geometric Approach to Stitching Prediction with Graph Neural Networks
by: Ríos-Navarro, Pablo, et al.
Published: (2026)
by: Ríos-Navarro, Pablo, et al.
Published: (2026)
SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries
by: Mkhallati, Hassan, et al.
Published: (2023)
by: Mkhallati, Hassan, et al.
Published: (2023)
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)
by: Thoker, Fida Mohammad, et al.
Published: (2025)
Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching
by: Liao, Tianli, et al.
Published: (2023)
by: Liao, Tianli, et al.
Published: (2023)
MoCA-Video: Motion-Aware Concept Alignment for Consistent Video Editing
by: Zhang, Tong, et al.
Published: (2025)
by: Zhang, Tong, et al.
Published: (2025)
ResidualViT for Efficient Temporally Dense Video Encoding
by: Soldan, Mattia, et al.
Published: (2025)
by: Soldan, Mattia, et al.
Published: (2025)
Temporal Divide-and-Conquer Anomaly Actions Localization in Semi-Supervised Videos with Hierarchical Transformer
by: Osman, Nada, et al.
Published: (2024)
by: Osman, Nada, et al.
Published: (2024)
Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator
by: Go, Hyojun, et al.
Published: (2025)
by: Go, Hyojun, et al.
Published: (2025)
Stitch-a-Demo: Video Demonstrations from Multistep Descriptions
by: Wu, Chi Hsuan, et al.
Published: (2025)
by: Wu, Chi Hsuan, et al.
Published: (2025)
Technical Report for ActivityNet Challenge 2022 -- Temporal Action Localization
by: Chen, Shimin, et al.
Published: (2024)
by: Chen, Shimin, et al.
Published: (2024)
UniStitch: Unifying Semantic and Geometric Features for Image Stitching
by: Mei, Yuan, et al.
Published: (2026)
by: Mei, Yuan, et al.
Published: (2026)
Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos
by: Tian, Haitao, et al.
Published: (2024)
by: Tian, Haitao, et al.
Published: (2024)
Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization
by: Shihab, Ibne Farabi, et al.
Published: (2025)
by: Shihab, Ibne Farabi, et al.
Published: (2025)
TDS-CLIP: Temporal Difference Side Network for Efficient VideoAction Recognition
by: Wang, Bin, et al.
Published: (2024)
by: Wang, Bin, et al.
Published: (2024)
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)
by: Li, Bing, et al.
Published: (2024)
Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)
by: Mai, Jinjie, et al.
Published: (2025)
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts
by: Wang, Duomin, et al.
Published: (2025)
by: Wang, Duomin, et al.
Published: (2025)
STAT: Towards Generalizable Temporal Action Localization
by: Liu, Yangcen, et al.
Published: (2024)
by: Liu, Yangcen, et al.
Published: (2024)
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
by: Held, Jan, et al.
Published: (2023)
by: Held, Jan, et al.
Published: (2023)
Leveraging Temporal Contextualization for Video Action Recognition
by: Kim, Minji, et al.
Published: (2024)
by: Kim, Minji, et al.
Published: (2024)
Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
by: Hyun, Jeongseok, et al.
Published: (2024)
by: Hyun, Jeongseok, et al.
Published: (2024)
Similar Items
-
Harnessing Temporal Causality for Advanced Temporal Action Detection
by: Liu, Shuming, et al.
Published: (2024) -
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
by: Liu, Shuming, et al.
Published: (2023) -
OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos
by: Benzakour, Yassine, et al.
Published: (2024) -
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
by: Liu, Shuming, et al.
Published: (2025) -
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)