Saved in:
| Main Authors: | Qiao, Xiaozhen, Wang, Wenjia, Zhao, Zhiyuan, Sun, Jiacheng, Luo, Ping, Zhang, Hongyuan, Li, Xuelong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.23951 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models
by: Qiao, Xiaozhen, et al.
Published: (2025)
by: Qiao, Xiaozhen, et al.
Published: (2025)
ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models
by: Zhu, Ruishu, et al.
Published: (2025)
by: Zhu, Ruishu, et al.
Published: (2025)
ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model
by: Chen, Yifan, et al.
Published: (2024)
by: Chen, Yifan, et al.
Published: (2024)
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
by: Zhang, Zeyu, et al.
Published: (2024)
by: Zhang, Zeyu, et al.
Published: (2024)
MTVCraft: Tokenizing 4D Motion for Arbitrary Character Animation
by: Ding, Yanbo, et al.
Published: (2025)
by: Ding, Yanbo, et al.
Published: (2025)
DRACO: Differentiable Reconstruction for Arbitrary CBCT Orbits
by: Ye, Chengze, et al.
Published: (2024)
by: Ye, Chengze, et al.
Published: (2024)
Multimodal Continual Learning with MLLMs from Multi-scenario Perspectives
by: Jiang, Kai, et al.
Published: (2025)
by: Jiang, Kai, et al.
Published: (2025)
SPAST: Arbitrary Style Transfer with Style Priors via Pre-trained Large-scale Model
by: Zhang, Zhanjie, et al.
Published: (2025)
by: Zhang, Zhanjie, et al.
Published: (2025)
PGAHum: Prior-Guided Geometry and Appearance Learning for High-Fidelity Animatable Human Reconstruction
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding
by: Liang, Yujie, et al.
Published: (2024)
by: Liang, Yujie, et al.
Published: (2024)
Arbitrary Generative Video Interpolation
by: Zhang, Guozhen, et al.
Published: (2025)
by: Zhang, Guozhen, et al.
Published: (2025)
Mitigating Long-Tail Bias in HOI Detection via Adaptive Diversity Cache
by: Jiang, Yuqiu, et al.
Published: (2025)
by: Jiang, Yuqiu, et al.
Published: (2025)
4D Monocular Surgical Reconstruction under Arbitrary Camera Motions
by: Shan, Jiwei, et al.
Published: (2026)
by: Shan, Jiwei, et al.
Published: (2026)
DiffCamera: Arbitrary Refocusing on Images
by: Wang, Yiyang, et al.
Published: (2025)
by: Wang, Yiyang, et al.
Published: (2025)
MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment
by: Li, Bingyu, et al.
Published: (2025)
by: Li, Bingyu, et al.
Published: (2025)
AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model
by: Chen, Yutian, et al.
Published: (2026)
by: Chen, Yutian, et al.
Published: (2026)
HuPrior3R: Incorporating Human Priors for Better 3D Dynamic Reconstruction from Monocular Videos
by: Xiong, Weitao, et al.
Published: (2025)
by: Xiong, Weitao, et al.
Published: (2025)
Enhance Vision-Language Alignment with Noise
by: Huang, Sida, et al.
Published: (2024)
by: Huang, Sida, et al.
Published: (2024)
CNN2GNN: How to Bridge CNN with GNN
by: Jiao, Ziheng, et al.
Published: (2024)
by: Jiao, Ziheng, et al.
Published: (2024)
Human Mesh Recovery from Arbitrary Multi-view Images
by: Li, Xiaoben, et al.
Published: (2024)
by: Li, Xiaoben, et al.
Published: (2024)
CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
by: Huang, Haojian, et al.
Published: (2024)
by: Huang, Haojian, et al.
Published: (2024)
Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors
by: Shang, Wei, et al.
Published: (2024)
by: Shang, Wei, et al.
Published: (2024)
LucidFusion: Reconstructing 3D Gaussians with Arbitrary Unposed Images
by: He, Hao, et al.
Published: (2024)
by: He, Hao, et al.
Published: (2024)
EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models
by: Zhang, Jiacheng, et al.
Published: (2024)
by: Zhang, Jiacheng, et al.
Published: (2024)
RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
by: Wang, Ruiqi, et al.
Published: (2025)
by: Wang, Ruiqi, et al.
Published: (2025)
Arbitrary Ratio Feature Compression via Next Token Prediction
by: Liu, Yufan, et al.
Published: (2026)
by: Liu, Yufan, et al.
Published: (2026)
Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks
by: Tsai, Yi Ting, et al.
Published: (2025)
by: Tsai, Yi Ting, et al.
Published: (2025)
Robust Phase-Shifting Profilometry for Arbitrary Motion
by: Zhang, Geyou, et al.
Published: (2025)
by: Zhang, Geyou, et al.
Published: (2025)
Universal Segmentation at Arbitrary Granularity with Language Instruction
by: Liu, Yong, et al.
Published: (2023)
by: Liu, Yong, et al.
Published: (2023)
Weak Supervision with Arbitrary Single Frame for Micro- and Macro-expression Spotting
by: Yu, Wang-Wang, et al.
Published: (2024)
by: Yu, Wang-Wang, et al.
Published: (2024)
Rethink Arbitrary Style Transfer with Transformer and Contrastive Learning
by: Zhang, Zhanjie, et al.
Published: (2024)
by: Zhang, Zhanjie, et al.
Published: (2024)
Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolution
by: Luo, Xihaier, et al.
Published: (2024)
by: Luo, Xihaier, et al.
Published: (2024)
ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
by: Wang, Haowen, et al.
Published: (2025)
by: Wang, Haowen, et al.
Published: (2025)
Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles
by: Wang, Peng, et al.
Published: (2025)
by: Wang, Peng, et al.
Published: (2025)
Object-AVEdit: An Object-level Audio-Visual Editing Model
by: Fu, Youquan, et al.
Published: (2025)
by: Fu, Youquan, et al.
Published: (2025)
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning
by: Du, Hang, et al.
Published: (2024)
by: Du, Hang, et al.
Published: (2024)
DiffusionPoser: Real-time Human Motion Reconstruction From Arbitrary Sparse Sensors Using Autoregressive Diffusion
by: Van Wouwe, Tom, et al.
Published: (2023)
by: Van Wouwe, Tom, et al.
Published: (2023)
ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction
by: Huang, Ding-Jiun, et al.
Published: (2024)
by: Huang, Ding-Jiun, et al.
Published: (2024)
OmniScaleSR: Unleashing Scale-Controlled Diffusion Prior for Faithful and Realistic Arbitrary-Scale Image Super-Resolution
by: Chai, Xinning, et al.
Published: (2025)
by: Chai, Xinning, et al.
Published: (2025)
ASTRA: Let Arbitrary Subjects Transform in Video Editing
by: Shen, Fei, et al.
Published: (2025)
by: Shen, Fei, et al.
Published: (2025)
Similar Items
-
Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models
by: Qiao, Xiaozhen, et al.
Published: (2025) -
ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models
by: Zhu, Ruishu, et al.
Published: (2025) -
ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model
by: Chen, Yifan, et al.
Published: (2024) -
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
by: Zhang, Zeyu, et al.
Published: (2024) -
MTVCraft: Tokenizing 4D Motion for Arbitrary Character Animation
by: Ding, Yanbo, et al.
Published: (2025)