:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liao, Haojin, Song, Xiaolin, Zhao, Sicheng, Zhang, Shanghang, Yue, Xiangyu, Yao, Xingxu, Zhang, Yueming, Xing, Tengfei, Xu, Pengfei, Wang, Qiang
Format:	Preprint
Published:	2021
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2110.14240
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SAMSON: 3rd Place Solution of LSVOS 2025 VOS Challenge
by: Xie, Yujie, et al.
Published: (2025)

First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024
by: Li, Ruyang, et al.
Published: (2024)

3rd Place Solution to Large-scale Fine-grained Food Recognition
by: Zhong, Yang, et al.
Published: (2025)

The Instance-centric Transformer for the RVOS Track of LSVOS Challenge: 3rd Place Solution
by: Cao, Bin, et al.
Published: (2024)

FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
by: Wang, Mengjiao, et al.
Published: (2025)

3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
by: Wu, Ruipu, et al.
Published: (2024)

First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024
by: Zhang, Tengfei, et al.
Published: (2024)

LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS
by: Liu, Xinyu, et al.
Published: (2024)

MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report
by: Yang, Zhongyu, et al.
Published: (2024)

3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
by: Liu, Xinyu, et al.
Published: (2024)

3rd Place Solution to ICCV LargeFineFoodAI Retrieval
by: Zhong, Yang, et al.
Published: (2025)

Second Place Solution of WSDM2023 Toloka Visual Question Answering Challenge
by: Wu, Xiangyu, et al.
Published: (2024)

The 3rd Place Solution of CCIR CUP 2025: A Framework for Retrieval-Augmented Generation in Multi-Turn Legal Conversation
by: Li, Da, et al.
Published: (2025)

Multi-source Domain Adaptation for Panoramic Semantic Segmentation
by: Jiang, Jing, et al.
Published: (2024)

A generalization of Reifenberg's theorem in R^N for flat cones
by: Liang, Xiangyu, et al.
Published: (2026)

vFusedSeg3D: 3rd Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation
by: Amjad, Osama, et al.
Published: (2024)

Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution
by: Lim, Chang Soo, et al.
Published: (2025)

An Effective End-to-End Solution for Multimodal Action Recognition
by: Wang, Songping, et al.
Published: (2025)

First Place Solution of 2023 Global Artificial Intelligence Technology Innovation Competition Track 1
by: Wu, Xiangyu, et al.
Published: (2024)

Generalized Categories Discovery for Long-tailed Recognition
by: Li, Ziyun, et al.
Published: (2023)

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
by: Chen, Mingjie, et al.
Published: (2024)

PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery
by: Das, Adrito, et al.
Published: (2024)

AutoMR: A Universal Time Series Motion Recognition Pipeline
by: Zhang, Likun, et al.
Published: (2025)

No Vision, No Wearables: 5G-based 2D Human Pose Recognition with Integrated Sensing and Communications
by: Li, Haojin, et al.
Published: (2025)

Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction
by: Yang, Senqiao, et al.
Published: (2023)

2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
by: Wu, Biao, et al.
Published: (2024)

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
by: Liu, Jiaming, et al.
Published: (2023)

3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation
by: Pan, Feiyu, et al.
Published: (2024)

More is Better: Deep Domain Adaptation with Multiple Sources
by: Zhao, Sicheng, et al.
Published: (2024)

4th PVUW MeViS 3rd Place Report: Sa2VA
by: Yuan, Haobo, et al.
Published: (2025)

Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation
by: Huang, Xinyang, et al.
Published: (2024)

METTL16 and YTHDC1 Regulate Spermatogonial Differentiation via m6A
by: Xueying Gu, et al.
Published: (2024)

PathRWKV: Enhancing Whole Slide Image Inference with Asymmetric Recurrent Modeling
by: Zhang, Tianyi, et al.
Published: (2025)

Improving VisNet for Object Recognition
by: Serj, Mehdi Fatan, et al.
Published: (2025)

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track
by: Chen, Zehui, et al.
Published: (2024)

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection
by: Liu, Jiaming, et al.
Published: (2022)

Texo: Formula Recognition within 20M Parameters
by: Mao, Sicheng
Published: (2026)

SAGE: Spatial-visual Adaptive Graph Exploration for Efficient Visual Place Recognition
by: Chen, Shunpeng, et al.
Published: (2025)

Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition
by: Xiao, Jiuhong, et al.
Published: (2025)

The Solution for the CVPR2023 NICE Image Captioning Challenge
by: Wu, Xiangyu, et al.
Published: (2023)