Saved in:
| Main Authors: | Liao, Haojin, Song, Xiaolin, Zhao, Sicheng, Zhang, Shanghang, Yue, Xiangyu, Yao, Xingxu, Zhang, Yueming, Xing, Tengfei, Xu, Pengfei, Wang, Qiang |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2110.14240 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SAMSON: 3rd Place Solution of LSVOS 2025 VOS Challenge
by: Xie, Yujie, et al.
Published: (2025)
by: Xie, Yujie, et al.
Published: (2025)
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024
by: Li, Ruyang, et al.
Published: (2024)
by: Li, Ruyang, et al.
Published: (2024)
3rd Place Solution to Large-scale Fine-grained Food Recognition
by: Zhong, Yang, et al.
Published: (2025)
by: Zhong, Yang, et al.
Published: (2025)
The Instance-centric Transformer for the RVOS Track of LSVOS Challenge: 3rd Place Solution
by: Cao, Bin, et al.
Published: (2024)
by: Cao, Bin, et al.
Published: (2024)
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
by: Wang, Mengjiao, et al.
Published: (2025)
by: Wang, Mengjiao, et al.
Published: (2025)
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
by: Wu, Ruipu, et al.
Published: (2024)
by: Wu, Ruipu, et al.
Published: (2024)
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024
by: Zhang, Tengfei, et al.
Published: (2024)
by: Zhang, Tengfei, et al.
Published: (2024)
LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS
by: Liu, Xinyu, et al.
Published: (2024)
by: Liu, Xinyu, et al.
Published: (2024)
MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report
by: Yang, Zhongyu, et al.
Published: (2024)
by: Yang, Zhongyu, et al.
Published: (2024)
3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
by: Liu, Xinyu, et al.
Published: (2024)
by: Liu, Xinyu, et al.
Published: (2024)
3rd Place Solution to ICCV LargeFineFoodAI Retrieval
by: Zhong, Yang, et al.
Published: (2025)
by: Zhong, Yang, et al.
Published: (2025)
Second Place Solution of WSDM2023 Toloka Visual Question Answering Challenge
by: Wu, Xiangyu, et al.
Published: (2024)
by: Wu, Xiangyu, et al.
Published: (2024)
The 3rd Place Solution of CCIR CUP 2025: A Framework for Retrieval-Augmented Generation in Multi-Turn Legal Conversation
by: Li, Da, et al.
Published: (2025)
by: Li, Da, et al.
Published: (2025)
Multi-source Domain Adaptation for Panoramic Semantic Segmentation
by: Jiang, Jing, et al.
Published: (2024)
by: Jiang, Jing, et al.
Published: (2024)
A generalization of Reifenberg's theorem in R^N for flat cones
by: Liang, Xiangyu, et al.
Published: (2026)
by: Liang, Xiangyu, et al.
Published: (2026)
vFusedSeg3D: 3rd Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation
by: Amjad, Osama, et al.
Published: (2024)
by: Amjad, Osama, et al.
Published: (2024)
Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution
by: Lim, Chang Soo, et al.
Published: (2025)
by: Lim, Chang Soo, et al.
Published: (2025)
An Effective End-to-End Solution for Multimodal Action Recognition
by: Wang, Songping, et al.
Published: (2025)
by: Wang, Songping, et al.
Published: (2025)
First Place Solution of 2023 Global Artificial Intelligence Technology Innovation Competition Track 1
by: Wu, Xiangyu, et al.
Published: (2024)
by: Wu, Xiangyu, et al.
Published: (2024)
Generalized Categories Discovery for Long-tailed Recognition
by: Li, Ziyun, et al.
Published: (2023)
by: Li, Ziyun, et al.
Published: (2023)
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
by: Chen, Mingjie, et al.
Published: (2024)
by: Chen, Mingjie, et al.
Published: (2024)
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery
by: Das, Adrito, et al.
Published: (2024)
by: Das, Adrito, et al.
Published: (2024)
AutoMR: A Universal Time Series Motion Recognition Pipeline
by: Zhang, Likun, et al.
Published: (2025)
by: Zhang, Likun, et al.
Published: (2025)
No Vision, No Wearables: 5G-based 2D Human Pose Recognition with Integrated Sensing and Communications
by: Li, Haojin, et al.
Published: (2025)
by: Li, Haojin, et al.
Published: (2025)
Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction
by: Yang, Senqiao, et al.
Published: (2023)
by: Yang, Senqiao, et al.
Published: (2023)
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
by: Wu, Biao, et al.
Published: (2024)
by: Wu, Biao, et al.
Published: (2024)
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
by: Liu, Jiaming, et al.
Published: (2023)
by: Liu, Jiaming, et al.
Published: (2023)
3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation
by: Pan, Feiyu, et al.
Published: (2024)
by: Pan, Feiyu, et al.
Published: (2024)
More is Better: Deep Domain Adaptation with Multiple Sources
by: Zhao, Sicheng, et al.
Published: (2024)
by: Zhao, Sicheng, et al.
Published: (2024)
4th PVUW MeViS 3rd Place Report: Sa2VA
by: Yuan, Haobo, et al.
Published: (2025)
by: Yuan, Haobo, et al.
Published: (2025)
Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation
by: Huang, Xinyang, et al.
Published: (2024)
by: Huang, Xinyang, et al.
Published: (2024)
METTL16 and YTHDC1 Regulate Spermatogonial Differentiation via m6A
by: Xueying Gu, et al.
Published: (2024)
by: Xueying Gu, et al.
Published: (2024)
PathRWKV: Enhancing Whole Slide Image Inference with Asymmetric Recurrent Modeling
by: Zhang, Tianyi, et al.
Published: (2025)
by: Zhang, Tianyi, et al.
Published: (2025)
Improving VisNet for Object Recognition
by: Serj, Mehdi Fatan, et al.
Published: (2025)
by: Serj, Mehdi Fatan, et al.
Published: (2025)
A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track
by: Chen, Zehui, et al.
Published: (2024)
by: Chen, Zehui, et al.
Published: (2024)
BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection
by: Liu, Jiaming, et al.
Published: (2022)
by: Liu, Jiaming, et al.
Published: (2022)
Texo: Formula Recognition within 20M Parameters
by: Mao, Sicheng
Published: (2026)
by: Mao, Sicheng
Published: (2026)
SAGE: Spatial-visual Adaptive Graph Exploration for Efficient Visual Place Recognition
by: Chen, Shunpeng, et al.
Published: (2025)
by: Chen, Shunpeng, et al.
Published: (2025)
Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition
by: Xiao, Jiuhong, et al.
Published: (2025)
by: Xiao, Jiuhong, et al.
Published: (2025)
The Solution for the CVPR2023 NICE Image Captioning Challenge
by: Wu, Xiangyu, et al.
Published: (2023)
by: Wu, Xiangyu, et al.
Published: (2023)
Similar Items
-
SAMSON: 3rd Place Solution of LSVOS 2025 VOS Challenge
by: Xie, Yujie, et al.
Published: (2025) -
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024
by: Li, Ruyang, et al.
Published: (2024) -
3rd Place Solution to Large-scale Fine-grained Food Recognition
by: Zhong, Yang, et al.
Published: (2025) -
The Instance-centric Transformer for the RVOS Track of LSVOS Challenge: 3rd Place Solution
by: Cao, Bin, et al.
Published: (2024) -
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
by: Wang, Mengjiao, et al.
Published: (2025)