Saved in:
| Main Authors: | Zhao, Hongshen, Tai, Jingkang, Wu, Yuhang, Zhang, Wenkang, Lan, Xi, Wang, Shangyan, Zhang, Tianyu, Yang, Wankou |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.24006 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ClickVOS: Click Video Object Segmentation
by: Guo, Pinxue, et al.
Published: (2024)
by: Guo, Pinxue, et al.
Published: (2024)
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
by: Xie, Fei, et al.
Published: (2025)
by: Xie, Fei, et al.
Published: (2025)
Point-VOS: Pointing Up Video Object Segmentation
by: Zulfikar, Idil Esen, et al.
Published: (2024)
by: Zulfikar, Idil Esen, et al.
Published: (2024)
ActionVOS: Actions as Prompts for Video Object Segmentation
by: Ouyang, Liangyang, et al.
Published: (2024)
by: Ouyang, Liangyang, et al.
Published: (2024)
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework
by: Li, Wanyun, et al.
Published: (2024)
by: Li, Wanyun, et al.
Published: (2024)
Object-level Geometric Structure Preserving for Natural Image Stitching
by: Cai, Wenxiao, et al.
Published: (2024)
by: Cai, Wenxiao, et al.
Published: (2024)
VDD: Varied Drone Dataset for Semantic Segmentation
by: Cai, Wenxiao, et al.
Published: (2023)
by: Cai, Wenxiao, et al.
Published: (2023)
LiVOS: Light Video Object Segmentation with Gated Linear Matching
by: Liu, Qin, et al.
Published: (2024)
by: Liu, Qin, et al.
Published: (2024)
UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction
by: Wang, Haoran, et al.
Published: (2024)
by: Wang, Haoran, et al.
Published: (2024)
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
by: Fedynyak, Volodymyr, et al.
Published: (2024)
by: Fedynyak, Volodymyr, et al.
Published: (2024)
InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba
by: Wang, Yuhang, et al.
Published: (2025)
by: Wang, Yuhang, et al.
Published: (2025)
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
by: Lian, Shijie, et al.
Published: (2024)
by: Lian, Shijie, et al.
Published: (2024)
UW-3DGS: Underwater 3D Reconstruction with Physics-Aware Gaussian Splatting
by: Xing, Wenpeng, et al.
Published: (2025)
by: Xing, Wenpeng, et al.
Published: (2025)
Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track
by: Pan, Feiyu, et al.
Published: (2024)
by: Pan, Feiyu, et al.
Published: (2024)
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
by: Dai, Ming, et al.
Published: (2025)
by: Dai, Ming, et al.
Published: (2025)
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation
by: Yan, Shilin, et al.
Published: (2023)
by: Yan, Shilin, et al.
Published: (2023)
The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection
by: Mukherjee, Rishi, et al.
Published: (2025)
by: Mukherjee, Rishi, et al.
Published: (2025)
LOVO: Efficient Complex Object Query in Large-Scale Video Datasets
by: Liu, Yuxin, et al.
Published: (2025)
by: Liu, Yuxin, et al.
Published: (2025)
M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
by: Chen, Zixuan, et al.
Published: (2024)
by: Chen, Zixuan, et al.
Published: (2024)
SVAC: Scaling Is All You Need For Referring Video Object Segmentation
by: Zhang, Li, et al.
Published: (2025)
by: Zhang, Li, et al.
Published: (2025)
Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection
by: Gao, Shixuan, et al.
Published: (2024)
by: Gao, Shixuan, et al.
Published: (2024)
Global Motion Understanding in Large-Scale Video Object Segmentation
by: Fedynyak, Volodymyr, et al.
Published: (2024)
by: Fedynyak, Volodymyr, et al.
Published: (2024)
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
by: Le, Huy, et al.
Published: (2025)
by: Le, Huy, et al.
Published: (2025)
SFFR: Spatial-Frequency Feature Reconstruction for Multispectral Aerial Object Detection
by: Zuo, Xin, et al.
Published: (2025)
by: Zuo, Xin, et al.
Published: (2025)
2nd Place Report of MOSEv2 Challenge 2025: Concept Guided Video Object Segmentation via SeC
by: Zhang, Zhixiong, et al.
Published: (2025)
by: Zhang, Zhixiong, et al.
Published: (2025)
MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection
by: Gao, Shuyong, et al.
Published: (2025)
by: Gao, Shuyong, et al.
Published: (2025)
MomentSeg: Moment-Centric Sampling for Enhanced Video Pixel Understanding
by: Dai, Ming, et al.
Published: (2025)
by: Dai, Ming, et al.
Published: (2025)
UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction from Underwater Multi-view Monocular Images
by: Chen, Zeyu, et al.
Published: (2024)
by: Chen, Zeyu, et al.
Published: (2024)
Advancing Complex Video Object Segmentation via Progressive Concept Construction
by: Zhang, Zhixiong, et al.
Published: (2025)
by: Zhang, Zhixiong, et al.
Published: (2025)
Implicit Motion-Compensated Network for Unsupervised Video Object Segmentation
by: Xi, Lin, et al.
Published: (2022)
by: Xi, Lin, et al.
Published: (2022)
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
by: Li, Yonglin, et al.
Published: (2023)
by: Li, Yonglin, et al.
Published: (2023)
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
by: Rong, Fu, et al.
Published: (2025)
by: Rong, Fu, et al.
Published: (2025)
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
by: Hu, Xiantao, et al.
Published: (2024)
by: Hu, Xiantao, et al.
Published: (2024)
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
by: Zhang, Chunhui, et al.
Published: (2024)
by: Zhang, Chunhui, et al.
Published: (2024)
EM-KD: Distilling Efficient Multimodal Large Language Model with Unbalanced Vision Tokens
by: Feng, Ze, et al.
Published: (2025)
by: Feng, Ze, et al.
Published: (2025)
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering
by: Xi, Lin, et al.
Published: (2023)
by: Xi, Lin, et al.
Published: (2023)
Online Reasoning Video Object Segmentation
by: Liu, Jinyuan, et al.
Published: (2026)
by: Liu, Jinyuan, et al.
Published: (2026)
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
by: Zhang, Mengchen, et al.
Published: (2024)
by: Zhang, Mengchen, et al.
Published: (2024)
Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation
by: Zhao, Xiaoqi, et al.
Published: (2023)
by: Zhao, Xiaoqi, et al.
Published: (2023)
Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection
by: Shen, Jifeng, et al.
Published: (2025)
by: Shen, Jifeng, et al.
Published: (2025)
Similar Items
-
ClickVOS: Click Video Object Segmentation
by: Guo, Pinxue, et al.
Published: (2024) -
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
by: Xie, Fei, et al.
Published: (2025) -
Point-VOS: Pointing Up Video Object Segmentation
by: Zulfikar, Idil Esen, et al.
Published: (2024) -
ActionVOS: Actions as Prompts for Video Object Segmentation
by: Ouyang, Liangyang, et al.
Published: (2024) -
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework
by: Li, Wanyun, et al.
Published: (2024)