:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Viertola, Ilpo, Iashin, Vladimir, Rahtu, Esa
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2509.26604
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Temporally Aligned Audio for Video with Autoregression
by: Viertola, Ilpo, et al.
Published: (2024)

Synchformer: Efficient Synchronization from Sparse Cues
by: Iashin, Vladimir, et al.
Published: (2024)

PanDepth: Joint Panoptic Segmentation and Depth Completion
by: Lagos, Juan, et al.
Published: (2022)

GS-Pose: Generalizable Segmentation-based 6D Object Pose Estimation with 3D Gaussian Splatting
by: Cai, Dingding, et al.
Published: (2024)

The Weighting Game: Evaluating Quality of Explainability Methods
by: Raatikainen, Lassi, et al.
Published: (2022)

SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion
by: Lagos, Juan Pablo, et al.
Published: (2022)

Beyond Visual Field of View: Perceiving 3D Environment with Echoes and Vision
by: Zhu, Lingyu, et al.
Published: (2022)

3D Gaussian Splatting with Fisheye Images: Field of View Analysis and Depth-Based Initialization
by: Gunes, Ulas, et al.
Published: (2025)

UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM
by: Mansour, Mostafa, et al.
Published: (2024)

FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
by: Gunes, Ulas, et al.
Published: (2025)

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
by: Turkulainen, Matias, et al.
Published: (2024)

HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry
by: Seiskari, Otto, et al.
Published: (2021)

MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis
by: Ren, Xuqian, et al.
Published: (2023)

AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones
by: Ren, Xuqian, et al.
Published: (2024)

Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion
by: Seiskari, Otto, et al.
Published: (2024)

Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder
by: Iashin, Vladimir, et al.
Published: (2025)

NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines
by: Ahonen, Jukka I., et al.
Published: (2024)

Putting the Object Back into Video Object Segmentation
by: Cheng, Ho Kei, et al.
Published: (2023)

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
by: Huang, Shaofei, et al.
Published: (2024)

Robust Promptable Video Object Segmentation
by: Lee, Sohyun, et al.
Published: (2026)

Multi-Granularity Video Object Segmentation
by: Lim, Sangbeom, et al.
Published: (2024)

Online Reasoning Video Object Segmentation
by: Liu, Jinyuan, et al.
Published: (2026)

Improving Unsupervised Video Object Segmentation via Fake Flow Generation
by: Cho, Suhwan, et al.
Published: (2024)

Bridging the gap between image coding for machines and humans
by: Le, Nam, et al.
Published: (2024)

Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation
by: Zhou, Zikun, et al.
Published: (2024)

Video Object Segmentation with Dynamic Query Modulation
by: Zhou, Hantao, et al.
Published: (2024)

Full-Duplex Strategy for Video Object Segmentation
by: Ji, Ge-Peng, et al.
Published: (2021)

ClickVOS: Click Video Object Segmentation
by: Guo, Pinxue, et al.
Published: (2024)

One-shot Training for Video Object Segmentation
by: Chen, Baiyu, et al.
Published: (2024)

Scalable Video Object Segmentation with Identification Mechanism
by: Yang, Zongxin, et al.
Published: (2022)

Dual Prototype Attention for Unsupervised Video Object Segmentation
by: Cho, Suhwan, et al.
Published: (2022)

Point-VOS: Pointing Up Video Object Segmentation
by: Zulfikar, Idil Esen, et al.
Published: (2024)

Training-Free Robust Interactive Video Object Segmentation
by: Wei, Xiaoli, et al.
Published: (2024)

Guided Slot Attention for Unsupervised Video Object Segmentation
by: Lee, Minhyeok, et al.
Published: (2023)

ActionVOS: Actions as Prompts for Video Object Segmentation
by: Ouyang, Liangyang, et al.
Published: (2024)

Event-assisted Low-Light Video Object Segmentation
by: Li, Hebei, et al.
Published: (2024)

Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future
by: Xu, Guoping, et al.
Published: (2025)

HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness
by: Xue, Zihui, et al.
Published: (2024)

CAVIS: Context-Aware Video Instance Segmentation
by: Lee, Seunghun, et al.
Published: (2024)

Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
by: Wang, Yaoting, et al.
Published: (2024)