:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sheng, Lei, Xu, Shuai-Shuai
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2409.05125
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ALScope: A Unified Toolkit for Deep Active Learning
by: Wu, Chenkai, et al.
Published: (2025)

UniVS: Unified and Universal Video Segmentation with Prompts as Queries
by: Li, Minghan, et al.
Published: (2024)

Cross-view geo-localization, Image retrieval, Multiscale geometric modeling, Frequency domain enhancement
by: Zhang, Hongying, et al.
Published: (2026)

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
by: Wan, Jianqiang, et al.
Published: (2024)

Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond
by: Zhang, Jiahang, et al.
Published: (2024)

Active Learning for Multilingual Fingerspelling Corpora
by: Wang, Shuai, et al.
Published: (2023)

Student Classroom Behavior Recognition Based on Improved YOLOv8s
by: Gao, Xiang, et al.
Published: (2026)

MSCI: Addressing CLIP's Inherent Limitations for Compositional Zero-Shot Learning
by: Wang, Yue, et al.
Published: (2025)

GCT: Graph Co-Training for Semi-Supervised Few-Shot Learning
by: Xu, Rui, et al.
Published: (2022)

CPiRi: Channel Permutation-Invariant Relational Interaction for Multivariate Time Series Forecasting
by: Xu, Jiyuan, et al.
Published: (2026)

Absolute-Unified Multi-Class Anomaly Detection via Class-Agnostic Distribution Alignment
by: Guo, Jia, et al.
Published: (2024)

The Evolution of Video Anomaly Detection: A Unified Framework from DNN to MLLM
by: Gao, Shibo, et al.
Published: (2025)

Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction
by: Bandara, Suren
Published: (2025)

Wave-Particle (Continuous-Discrete) Dualistic Visual Tokenization for Unified Understanding and Generation
by: Chen, Yizhu, et al.
Published: (2025)

Financial Table Extraction in Image Documents
by: Watson, William, et al.
Published: (2024)

Forgedit: Text Guided Image Editing via Learning and Forgetting
by: Zhang, Shiwen, et al.
Published: (2023)

EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
by: Yang, Shiyuan, et al.
Published: (2026)

CASP: Few-Shot Class-Incremental Learning with CLS Token Attention Steering Prompts
by: Huang, Shuai, et al.
Published: (2026)

PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models
by: Zhang, Qiyuan, et al.
Published: (2026)

Unleashing the Power of Pre-trained Encoders for Universal Adversarial Attack Detection
by: Zhang, Yinghe, et al.
Published: (2025)

EduStory: A Unified Framework for Pedagogically-Consistent Multi-Shot STEM Instructional Video Generation
by: Wu, Xinyi, et al.
Published: (2026)

Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches
by: Elharrouss, Omar, et al.
Published: (2022)

EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)

UniDWM: Towards a Unified Driving World Model via Multifaceted Representation Learning
by: Liu, Shuai, et al.
Published: (2026)

UniTable: Towards a Unified Framework for Table Recognition via Self-Supervised Pretraining
by: Peng, ShengYun, et al.
Published: (2024)

Slimmable Networks for Contrastive Self-supervised Learning
by: Zhao, Shuai, et al.
Published: (2022)

PROFIT: A Specialized Optimizer for Deep Fine Tuning
by: Chakravarthy, Anirudh S, et al.
Published: (2024)

Random Registers for Cross-Domain Few-Shot Learning
by: Yi, Shuai, et al.
Published: (2025)

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
by: Wang, Pengfei, et al.
Published: (2024)

Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes
by: Zhang, Zhilu, et al.
Published: (2023)

One Dinomaly2 Detect Them All: A Unified Framework for Full-Spectrum Unsupervised Anomaly Detection
by: Guo, Jia, et al.
Published: (2025)

Revisiting Continuity of Image Tokens for Cross-domain Few-shot Learning
by: Yi, Shuai, et al.
Published: (2025)

High-Precision Fabric Defect Detection via Adaptive Shape Convolutions and Large Kernel Spatial Modeling
by: Wang, Shuai, et al.
Published: (2025)

Language-based Image Colorization: A Benchmark and Beyond
by: Li, Yifan, et al.
Published: (2025)

Learning a Neural Association Network for Self-supervised Multi-Object Tracking
by: Li, Shuai, et al.
Published: (2024)

UniD-Shift: Towards Unified Semantic Segmentation via Interpretable Share-Private Multimodal Decomposition
by: Zhang, Shuai, et al.
Published: (2026)

Addressing Exacerbated Attention Sink for Source-Free Cross-Domain Few-Shot Learning
by: Yi, Shuai, et al.
Published: (2026)

Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image
by: Zeng, Jianshun, et al.
Published: (2024)

Improving CLIP Adaptation by Breaking Tail Alignment for Source-Free Cross-Domain Few-Shot Learning
by: Yi, Shuai, et al.
Published: (2026)

Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition
by: Li, Yu, et al.
Published: (2025)