Saved in:
| Main Authors: | Sheng, Lei, Xu, Shuai-Shuai |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.05125 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ALScope: A Unified Toolkit for Deep Active Learning
by: Wu, Chenkai, et al.
Published: (2025)
by: Wu, Chenkai, et al.
Published: (2025)
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
by: Li, Minghan, et al.
Published: (2024)
by: Li, Minghan, et al.
Published: (2024)
Cross-view geo-localization, Image retrieval, Multiscale geometric modeling, Frequency domain enhancement
by: Zhang, Hongying, et al.
Published: (2026)
by: Zhang, Hongying, et al.
Published: (2026)
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
by: Wan, Jianqiang, et al.
Published: (2024)
by: Wan, Jianqiang, et al.
Published: (2024)
Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond
by: Zhang, Jiahang, et al.
Published: (2024)
by: Zhang, Jiahang, et al.
Published: (2024)
Active Learning for Multilingual Fingerspelling Corpora
by: Wang, Shuai, et al.
Published: (2023)
by: Wang, Shuai, et al.
Published: (2023)
Student Classroom Behavior Recognition Based on Improved YOLOv8s
by: Gao, Xiang, et al.
Published: (2026)
by: Gao, Xiang, et al.
Published: (2026)
MSCI: Addressing CLIP's Inherent Limitations for Compositional Zero-Shot Learning
by: Wang, Yue, et al.
Published: (2025)
by: Wang, Yue, et al.
Published: (2025)
GCT: Graph Co-Training for Semi-Supervised Few-Shot Learning
by: Xu, Rui, et al.
Published: (2022)
by: Xu, Rui, et al.
Published: (2022)
CPiRi: Channel Permutation-Invariant Relational Interaction for Multivariate Time Series Forecasting
by: Xu, Jiyuan, et al.
Published: (2026)
by: Xu, Jiyuan, et al.
Published: (2026)
Absolute-Unified Multi-Class Anomaly Detection via Class-Agnostic Distribution Alignment
by: Guo, Jia, et al.
Published: (2024)
by: Guo, Jia, et al.
Published: (2024)
The Evolution of Video Anomaly Detection: A Unified Framework from DNN to MLLM
by: Gao, Shibo, et al.
Published: (2025)
by: Gao, Shibo, et al.
Published: (2025)
Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction
by: Bandara, Suren
Published: (2025)
by: Bandara, Suren
Published: (2025)
Wave-Particle (Continuous-Discrete) Dualistic Visual Tokenization for Unified Understanding and Generation
by: Chen, Yizhu, et al.
Published: (2025)
by: Chen, Yizhu, et al.
Published: (2025)
Financial Table Extraction in Image Documents
by: Watson, William, et al.
Published: (2024)
by: Watson, William, et al.
Published: (2024)
Forgedit: Text Guided Image Editing via Learning and Forgetting
by: Zhang, Shiwen, et al.
Published: (2023)
by: Zhang, Shiwen, et al.
Published: (2023)
EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
by: Yang, Shiyuan, et al.
Published: (2026)
by: Yang, Shiyuan, et al.
Published: (2026)
CASP: Few-Shot Class-Incremental Learning with CLS Token Attention Steering Prompts
by: Huang, Shuai, et al.
Published: (2026)
by: Huang, Shuai, et al.
Published: (2026)
PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models
by: Zhang, Qiyuan, et al.
Published: (2026)
by: Zhang, Qiyuan, et al.
Published: (2026)
Unleashing the Power of Pre-trained Encoders for Universal Adversarial Attack Detection
by: Zhang, Yinghe, et al.
Published: (2025)
by: Zhang, Yinghe, et al.
Published: (2025)
EduStory: A Unified Framework for Pedagogically-Consistent Multi-Shot STEM Instructional Video Generation
by: Wu, Xinyi, et al.
Published: (2026)
by: Wu, Xinyi, et al.
Published: (2026)
Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches
by: Elharrouss, Omar, et al.
Published: (2022)
by: Elharrouss, Omar, et al.
Published: (2022)
EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)
by: Tan, Shuai, et al.
Published: (2025)
UniDWM: Towards a Unified Driving World Model via Multifaceted Representation Learning
by: Liu, Shuai, et al.
Published: (2026)
by: Liu, Shuai, et al.
Published: (2026)
UniTable: Towards a Unified Framework for Table Recognition via Self-Supervised Pretraining
by: Peng, ShengYun, et al.
Published: (2024)
by: Peng, ShengYun, et al.
Published: (2024)
Slimmable Networks for Contrastive Self-supervised Learning
by: Zhao, Shuai, et al.
Published: (2022)
by: Zhao, Shuai, et al.
Published: (2022)
PROFIT: A Specialized Optimizer for Deep Fine Tuning
by: Chakravarthy, Anirudh S, et al.
Published: (2024)
by: Chakravarthy, Anirudh S, et al.
Published: (2024)
Random Registers for Cross-Domain Few-Shot Learning
by: Yi, Shuai, et al.
Published: (2025)
by: Yi, Shuai, et al.
Published: (2025)
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
by: Wang, Pengfei, et al.
Published: (2024)
by: Wang, Pengfei, et al.
Published: (2024)
Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes
by: Zhang, Zhilu, et al.
Published: (2023)
by: Zhang, Zhilu, et al.
Published: (2023)
One Dinomaly2 Detect Them All: A Unified Framework for Full-Spectrum Unsupervised Anomaly Detection
by: Guo, Jia, et al.
Published: (2025)
by: Guo, Jia, et al.
Published: (2025)
Revisiting Continuity of Image Tokens for Cross-domain Few-shot Learning
by: Yi, Shuai, et al.
Published: (2025)
by: Yi, Shuai, et al.
Published: (2025)
High-Precision Fabric Defect Detection via Adaptive Shape Convolutions and Large Kernel Spatial Modeling
by: Wang, Shuai, et al.
Published: (2025)
by: Wang, Shuai, et al.
Published: (2025)
Language-based Image Colorization: A Benchmark and Beyond
by: Li, Yifan, et al.
Published: (2025)
by: Li, Yifan, et al.
Published: (2025)
Learning a Neural Association Network for Self-supervised Multi-Object Tracking
by: Li, Shuai, et al.
Published: (2024)
by: Li, Shuai, et al.
Published: (2024)
UniD-Shift: Towards Unified Semantic Segmentation via Interpretable Share-Private Multimodal Decomposition
by: Zhang, Shuai, et al.
Published: (2026)
by: Zhang, Shuai, et al.
Published: (2026)
Addressing Exacerbated Attention Sink for Source-Free Cross-Domain Few-Shot Learning
by: Yi, Shuai, et al.
Published: (2026)
by: Yi, Shuai, et al.
Published: (2026)
Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image
by: Zeng, Jianshun, et al.
Published: (2024)
by: Zeng, Jianshun, et al.
Published: (2024)
Improving CLIP Adaptation by Breaking Tail Alignment for Source-Free Cross-Domain Few-Shot Learning
by: Yi, Shuai, et al.
Published: (2026)
by: Yi, Shuai, et al.
Published: (2026)
Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition
by: Li, Yu, et al.
Published: (2025)
by: Li, Yu, et al.
Published: (2025)
Similar Items
-
ALScope: A Unified Toolkit for Deep Active Learning
by: Wu, Chenkai, et al.
Published: (2025) -
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
by: Li, Minghan, et al.
Published: (2024) -
Cross-view geo-localization, Image retrieval, Multiscale geometric modeling, Frequency domain enhancement
by: Zhang, Hongying, et al.
Published: (2026) -
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
by: Wan, Jianqiang, et al.
Published: (2024) -
Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond
by: Zhang, Jiahang, et al.
Published: (2024)