Saved in:
| Main Authors: | Yi, Jinhui, Luo, Yanan, Deichmann, Marion, Schaaf, Gabriel, Gall, Juergen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.00903 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
by: Yi, Jinhui, et al.
Published: (2024)
by: Yi, Jinhui, et al.
Published: (2024)
Rethinking temporal self-similarity for repetitive action counting
by: Luo, Yanan, et al.
Published: (2024)
by: Luo, Yanan, et al.
Published: (2024)
MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction
by: Lee, Jongmin, et al.
Published: (2026)
by: Lee, Jongmin, et al.
Published: (2026)
MV3DIS: Multi-View Mask Matching via 3D Guides for Zero-Shot 3D Instance Segmentation
by: Zhao, Yibo, et al.
Published: (2026)
by: Zhao, Yibo, et al.
Published: (2026)
ADA-Track++: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association
by: Ding, Shuxiao, et al.
Published: (2024)
by: Ding, Shuxiao, et al.
Published: (2024)
MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
by: Yan, Tingman, et al.
Published: (2025)
by: Yan, Tingman, et al.
Published: (2025)
MV-TAP: Tracking Any Point in Multi-View Videos
by: Koo, Jahyeok, et al.
Published: (2025)
by: Koo, Jahyeok, et al.
Published: (2025)
Learning a Neural Association Network for Self-supervised Multi-Object Tracking
by: Li, Shuai, et al.
Published: (2024)
by: Li, Shuai, et al.
Published: (2024)
ViewBridge:Revisiting Cross-View Localization from Image Matching
by: Xia, Panwang, et al.
Published: (2025)
by: Xia, Panwang, et al.
Published: (2025)
Towards Generalizing Temporal Action Segmentation to Unseen Views
by: Bahrami, Emad, et al.
Published: (2025)
by: Bahrami, Emad, et al.
Published: (2025)
Identifying Spatio-Temporal Drivers of Extreme Events
by: Eddin, Mohamad Hakam Shams, et al.
Published: (2024)
by: Eddin, Mohamad Hakam Shams, et al.
Published: (2024)
LC-SLab -- An Object-based Deep Learning Framework for Large-scale Land Cover Classification from Satellite Imagery and Sparse In-situ Labels
by: Leonhardt, Johannes, et al.
Published: (2025)
by: Leonhardt, Johannes, et al.
Published: (2025)
Analysis of Plant Nutrient Deficiencies Using Multi-Spectral Imaging and Optimized Segmentation Model
by: Wu, Ji-Yan, et al.
Published: (2025)
by: Wu, Ji-Yan, et al.
Published: (2025)
Self-Supervised Partial Cycle-Consistency for Multi-View Matching
by: Taggenbrock, Fedor, et al.
Published: (2025)
by: Taggenbrock, Fedor, et al.
Published: (2025)
Local Adaptive Clustering Based Image Matching for Automatic Visual Identification
by: Wang, Zhizhen
Published: (2024)
by: Wang, Zhizhen
Published: (2024)
FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training
by: Yin, Ruihong, et al.
Published: (2024)
by: Yin, Ruihong, et al.
Published: (2024)
MV-SAM3D: Adaptive Multi-View Fusion for Layout-Aware 3D Generation
by: Li, Baicheng, et al.
Published: (2026)
by: Li, Baicheng, et al.
Published: (2026)
MV-VTON: Multi-View Virtual Try-On with Diffusion Models
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
MV2MAE: Multi-View Video Masked Autoencoders
by: Shah, Ketul, et al.
Published: (2024)
by: Shah, Ketul, et al.
Published: (2024)
CamC2V: Context-aware Controllable Video Generation
by: Denninger, Luis, et al.
Published: (2025)
by: Denninger, Luis, et al.
Published: (2025)
MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization
by: Qi, Lei, et al.
Published: (2022)
by: Qi, Lei, et al.
Published: (2022)
View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV
by: Ji, Deyi, et al.
Published: (2024)
by: Ji, Deyi, et al.
Published: (2024)
SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction
by: Pallotta, Enrico, et al.
Published: (2025)
by: Pallotta, Enrico, et al.
Published: (2025)
Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation
by: Veeramacheneni, Lokesh, et al.
Published: (2023)
by: Veeramacheneni, Lokesh, et al.
Published: (2023)
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
by: Azar, Sina Mokhtarzadeh, et al.
Published: (2025)
by: Azar, Sina Mokhtarzadeh, et al.
Published: (2025)
Video Panels for Long Video Understanding
by: Doorenbos, Lars, et al.
Published: (2025)
by: Doorenbos, Lars, et al.
Published: (2025)
MV-GMN: State Space Model for Multi-View Action Recognition
by: Lin, Yuhui, et al.
Published: (2025)
by: Lin, Yuhui, et al.
Published: (2025)
Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification
by: Shi, Jiangming, et al.
Published: (2024)
by: Shi, Jiangming, et al.
Published: (2024)
Using Visual Anomaly Detection for Task Execution Monitoring
by: Thoduka, Santosh, et al.
Published: (2021)
by: Thoduka, Santosh, et al.
Published: (2021)
Hierarchical Vector Quantization for Unsupervised Action Segmentation
by: Spurio, Federico, et al.
Published: (2024)
by: Spurio, Federico, et al.
Published: (2024)
Reference-Free Omnidirectional Stereo Matching via Multi-View Consistency Maximization
by: Xu, Lehuai, et al.
Published: (2026)
by: Xu, Lehuai, et al.
Published: (2026)
Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching
by: Xu, Peng, et al.
Published: (2023)
by: Xu, Peng, et al.
Published: (2023)
MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers
by: Dong, Zichao, et al.
Published: (2024)
by: Dong, Zichao, et al.
Published: (2024)
GLEAM: Learning to Match and Explain in Cross-View Geo-Localization
by: Lu, Xudong, et al.
Published: (2025)
by: Lu, Xudong, et al.
Published: (2025)
MVAM: Multi-View Attention Method for Fine-grained Image-Text Matching
by: Cui, Wanqing, et al.
Published: (2024)
by: Cui, Wanqing, et al.
Published: (2024)
FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching
by: Jena, Rohit, et al.
Published: (2024)
by: Jena, Rohit, et al.
Published: (2024)
StableMamba: Distillation-free Scaling of Large SSMs for Images and Videos
by: Suleman, Hamid, et al.
Published: (2024)
by: Suleman, Hamid, et al.
Published: (2024)
Skeleton Motion Words for Unsupervised Skeleton-Based Temporal Action Segmentation
by: Gökay, Uzay, et al.
Published: (2025)
by: Gökay, Uzay, et al.
Published: (2025)
FlowNar: Scalable Streaming Narration for Long-Form Videos
by: Zhong, Zeyun, et al.
Published: (2026)
by: Zhong, Zeyun, et al.
Published: (2026)
DcMatch: Unsupervised Multi-Shape Matching with Dual-Level Consistency
by: Ye, Tianwei, et al.
Published: (2025)
by: Ye, Tianwei, et al.
Published: (2025)
Similar Items
-
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
by: Yi, Jinhui, et al.
Published: (2024) -
Rethinking temporal self-similarity for repetitive action counting
by: Luo, Yanan, et al.
Published: (2024) -
MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction
by: Lee, Jongmin, et al.
Published: (2026) -
MV3DIS: Multi-View Mask Matching via 3D Guides for Zero-Shot 3D Instance Segmentation
by: Zhao, Yibo, et al.
Published: (2026) -
ADA-Track++: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association
by: Ding, Shuxiao, et al.
Published: (2024)