:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yi, Jinhui, Luo, Yanan, Deichmann, Marion, Schaaf, Gabriel, Gall, Juergen
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2409.00903
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
by: Yi, Jinhui, et al.
Published: (2024)

Rethinking temporal self-similarity for repetitive action counting
by: Luo, Yanan, et al.
Published: (2024)

MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction
by: Lee, Jongmin, et al.
Published: (2026)

MV3DIS: Multi-View Mask Matching via 3D Guides for Zero-Shot 3D Instance Segmentation
by: Zhao, Yibo, et al.
Published: (2026)

ADA-Track++: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association
by: Ding, Shuxiao, et al.
Published: (2024)

MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
by: Yan, Tingman, et al.
Published: (2025)

MV-TAP: Tracking Any Point in Multi-View Videos
by: Koo, Jahyeok, et al.
Published: (2025)

Learning a Neural Association Network for Self-supervised Multi-Object Tracking
by: Li, Shuai, et al.
Published: (2024)

ViewBridge:Revisiting Cross-View Localization from Image Matching
by: Xia, Panwang, et al.
Published: (2025)

Towards Generalizing Temporal Action Segmentation to Unseen Views
by: Bahrami, Emad, et al.
Published: (2025)

Identifying Spatio-Temporal Drivers of Extreme Events
by: Eddin, Mohamad Hakam Shams, et al.
Published: (2024)

LC-SLab -- An Object-based Deep Learning Framework for Large-scale Land Cover Classification from Satellite Imagery and Sparse In-situ Labels
by: Leonhardt, Johannes, et al.
Published: (2025)

Analysis of Plant Nutrient Deficiencies Using Multi-Spectral Imaging and Optimized Segmentation Model
by: Wu, Ji-Yan, et al.
Published: (2025)

Self-Supervised Partial Cycle-Consistency for Multi-View Matching
by: Taggenbrock, Fedor, et al.
Published: (2025)

Local Adaptive Clustering Based Image Matching for Automatic Visual Identification
by: Wang, Zhizhen
Published: (2024)

FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training
by: Yin, Ruihong, et al.
Published: (2024)

MV-SAM3D: Adaptive Multi-View Fusion for Layout-Aware 3D Generation
by: Li, Baicheng, et al.
Published: (2026)

MV-VTON: Multi-View Virtual Try-On with Diffusion Models
by: Wang, Haoyu, et al.
Published: (2024)

MV2MAE: Multi-View Video Masked Autoencoders
by: Shah, Ketul, et al.
Published: (2024)

CamC2V: Context-aware Controllable Video Generation
by: Denninger, Luis, et al.
Published: (2025)

MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization
by: Qi, Lei, et al.
Published: (2022)

View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV
by: Ji, Deyi, et al.
Published: (2024)

SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction
by: Pallotta, Enrico, et al.
Published: (2025)

Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation
by: Veeramacheneni, Lokesh, et al.
Published: (2023)

Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
by: Azar, Sina Mokhtarzadeh, et al.
Published: (2025)

Video Panels for Long Video Understanding
by: Doorenbos, Lars, et al.
Published: (2025)

MV-GMN: State Space Model for Multi-View Action Recognition
by: Lin, Yuhui, et al.
Published: (2025)

Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification
by: Shi, Jiangming, et al.
Published: (2024)

Using Visual Anomaly Detection for Task Execution Monitoring
by: Thoduka, Santosh, et al.
Published: (2021)

Hierarchical Vector Quantization for Unsupervised Action Segmentation
by: Spurio, Federico, et al.
Published: (2024)

Reference-Free Omnidirectional Stereo Matching via Multi-View Consistency Maximization
by: Xu, Lehuai, et al.
Published: (2026)

Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching
by: Xu, Peng, et al.
Published: (2023)

MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers
by: Dong, Zichao, et al.
Published: (2024)

GLEAM: Learning to Match and Explain in Cross-View Geo-Localization
by: Lu, Xudong, et al.
Published: (2025)

MVAM: Multi-View Attention Method for Fine-grained Image-Text Matching
by: Cui, Wanqing, et al.
Published: (2024)

FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching
by: Jena, Rohit, et al.
Published: (2024)

StableMamba: Distillation-free Scaling of Large SSMs for Images and Videos
by: Suleman, Hamid, et al.
Published: (2024)

Skeleton Motion Words for Unsupervised Skeleton-Based Temporal Action Segmentation
by: Gökay, Uzay, et al.
Published: (2025)

FlowNar: Scalable Streaming Narration for Long-Form Videos
by: Zhong, Zeyun, et al.
Published: (2026)

DcMatch: Unsupervised Multi-Shape Matching with Dual-Level Consistency
by: Ye, Tianwei, et al.
Published: (2025)