:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Hongshen, Tai, Jingkang, Wu, Yuhang, Zhang, Wenkang, Lan, Xi, Wang, Shangyan, Zhang, Tianyu, Yang, Wankou
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.24006
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ClickVOS: Click Video Object Segmentation
by: Guo, Pinxue, et al.
Published: (2024)

Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
by: Xie, Fei, et al.
Published: (2025)

Point-VOS: Pointing Up Video Object Segmentation
by: Zulfikar, Idil Esen, et al.
Published: (2024)

ActionVOS: Actions as Prompts for Video Object Segmentation
by: Ouyang, Liangyang, et al.
Published: (2024)

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework
by: Li, Wanyun, et al.
Published: (2024)

Object-level Geometric Structure Preserving for Natural Image Stitching
by: Cai, Wenxiao, et al.
Published: (2024)

VDD: Varied Drone Dataset for Semantic Segmentation
by: Cai, Wenxiao, et al.
Published: (2023)

LiVOS: Light Video Object Segmentation with Gated Linear Matching
by: Liu, Qin, et al.
Published: (2024)

UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction
by: Wang, Haoran, et al.
Published: (2024)

DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
by: Fedynyak, Volodymyr, et al.
Published: (2024)

InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba
by: Wang, Yuhang, et al.
Published: (2025)

Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
by: Lian, Shijie, et al.
Published: (2024)

UW-3DGS: Underwater 3D Reconstruction with Physics-Aware Gaussian Splatting
by: Xing, Wenpeng, et al.
Published: (2025)

Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track
by: Pan, Feiyu, et al.
Published: (2024)

PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
by: Dai, Ming, et al.
Published: (2025)

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation
by: Yan, Shilin, et al.
Published: (2023)

The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection
by: Mukherjee, Rishi, et al.
Published: (2025)

LOVO: Efficient Complex Object Query in Large-Scale Video Datasets
by: Liu, Yuxin, et al.
Published: (2025)

M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
by: Chen, Zixuan, et al.
Published: (2024)

SVAC: Scaling Is All You Need For Referring Video Object Segmentation
by: Zhang, Li, et al.
Published: (2025)

Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection
by: Gao, Shixuan, et al.
Published: (2024)

Global Motion Understanding in Large-Scale Video Object Segmentation
by: Fedynyak, Volodymyr, et al.
Published: (2024)

UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
by: Le, Huy, et al.
Published: (2025)

SFFR: Spatial-Frequency Feature Reconstruction for Multispectral Aerial Object Detection
by: Zuo, Xin, et al.
Published: (2025)

2nd Place Report of MOSEv2 Challenge 2025: Concept Guided Video Object Segmentation via SeC
by: Zhang, Zhixiong, et al.
Published: (2025)

MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection
by: Gao, Shuyong, et al.
Published: (2025)

MomentSeg: Moment-Centric Sampling for Enhanced Video Pixel Understanding
by: Dai, Ming, et al.
Published: (2025)

UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction from Underwater Multi-view Monocular Images
by: Chen, Zeyu, et al.
Published: (2024)

Advancing Complex Video Object Segmentation via Progressive Concept Construction
by: Zhang, Zhixiong, et al.
Published: (2025)

Implicit Motion-Compensated Network for Unsupervised Video Object Segmentation
by: Xi, Lin, et al.
Published: (2022)

RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
by: Li, Yonglin, et al.
Published: (2023)

MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
by: Rong, Fu, et al.
Published: (2025)

Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
by: Hu, Xiantao, et al.
Published: (2024)

WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
by: Zhang, Chunhui, et al.
Published: (2024)

EM-KD: Distilling Efficient Multimodal Large Language Model with Unbalanced Vision Tokens
by: Feng, Ze, et al.
Published: (2025)

Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering
by: Xi, Lin, et al.
Published: (2023)

Online Reasoning Video Object Segmentation
by: Liu, Jinyuan, et al.
Published: (2026)

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
by: Zhang, Mengchen, et al.
Published: (2024)

Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation
by: Zhao, Xiaoqi, et al.
Published: (2023)

Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection
by: Shen, Jifeng, et al.
Published: (2025)