Saved in:
| Main Authors: | Gao, Fang, Li, Xuetao, Wang, Jiabao, Ma, Shengheng, Yu, Jun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.05762 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
by: Li, Kunchang, et al.
Published: (2022)
by: Li, Kunchang, et al.
Published: (2022)
Paying more attention to local contrast: improving infrared small target detection performance via prior knowledge
by: Wang, Peichao, et al.
Published: (2024)
by: Wang, Peichao, et al.
Published: (2024)
DistinctAD: Distinctive Audio Description Generation in Contexts
by: Fang, Bo, et al.
Published: (2024)
by: Fang, Bo, et al.
Published: (2024)
SceneLCM: End-to-End Layout-Guided Interactive Indoor Scene Generation with Latent Consistency Model
by: Lin, Yangkai, et al.
Published: (2025)
by: Lin, Yangkai, et al.
Published: (2025)
Finding Visual Task Vectors
by: Hojel, Alberto, et al.
Published: (2024)
by: Hojel, Alberto, et al.
Published: (2024)
Self-Supervised Selective-Guided Diffusion Model for Old-Photo Face Restoration
by: Li, Wenjie, et al.
Published: (2025)
by: Li, Wenjie, et al.
Published: (2025)
Geometry-Guided Self-Supervision for Ultra-Fine-Grained Recognition with Limited Data
by: Wang, Shijie, et al.
Published: (2026)
by: Wang, Shijie, et al.
Published: (2026)
Render-in-the-Loop: Vector Graphics Generation via Visual Self-Feedback
by: Liang, Guotao, et al.
Published: (2026)
by: Liang, Guotao, et al.
Published: (2026)
GazeCLIP: Gaze-Guided CLIP with Adaptive-Enhanced Fine-Grained Language Prompt for Deepfake Attribution and Detection
by: Zhang, Yaning, et al.
Published: (2026)
by: Zhang, Yaning, et al.
Published: (2026)
Stroke Modeling Enables Vectorized Character Generation with Large Vectorized Glyph Model
by: Zhang, Xinyue, et al.
Published: (2025)
by: Zhang, Xinyue, et al.
Published: (2025)
DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning
by: Wei, Jiabao, et al.
Published: (2024)
by: Wei, Jiabao, et al.
Published: (2024)
ERASOR++: Height Coding Plus Egocentric Ratio Based Dynamic Object Removal for Static Point Cloud Mapping
by: Zhang, Jiabao, et al.
Published: (2024)
by: Zhang, Jiabao, et al.
Published: (2024)
IVGF: The Fusion-Guided Infrared and Visible General Framework
by: Liu, Fangcen, et al.
Published: (2024)
by: Liu, Fangcen, et al.
Published: (2024)
GeoDiffMM: Geometry-Guided Conditional Diffusion for Motion Magnification
by: Liu, Xuedeng, et al.
Published: (2025)
by: Liu, Xuedeng, et al.
Published: (2025)
HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery
by: Wang, Yu, et al.
Published: (2025)
by: Wang, Yu, et al.
Published: (2025)
LottieGPT: Tokenizing Vector Animation for Autoregressive Generation
by: Chen, Junhao, et al.
Published: (2026)
by: Chen, Junhao, et al.
Published: (2026)
GCA-ResUNet:Image segmentation in medical images using grouped coordinate attention
by: Ding, Jun, et al.
Published: (2025)
by: Ding, Jun, et al.
Published: (2025)
Epoch-evolving Gaussian Process Guided Learning
by: Cui, Jiabao, et al.
Published: (2020)
by: Cui, Jiabao, et al.
Published: (2020)
Open-Attribute Recognition for Person Retrieval: Finding People Through Distinctive and Novel Attributes
by: Park, Minjeong, et al.
Published: (2025)
by: Park, Minjeong, et al.
Published: (2025)
Skeleton-Guided Instance Separation for Fine-Grained Segmentation in Microscopy
by: Wang, Jun, et al.
Published: (2024)
by: Wang, Jun, et al.
Published: (2024)
Xuanwu: Evolving General Multimodal Models into an Industrial-Grade Foundation for Content Ecosystems
by: Zhang, Zhiqian, et al.
Published: (2026)
by: Zhang, Zhiqian, et al.
Published: (2026)
OmniSVG: A Unified Scalable Vector Graphics Generation Model
by: Yang, Yiying, et al.
Published: (2025)
by: Yang, Yiying, et al.
Published: (2025)
AMSA-UNet: An Asymmetric Multiple Scales U-net Based on Self-attention for Deblurring
by: Wang, Yingying
Published: (2024)
by: Wang, Yingying
Published: (2024)
Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
by: Li, Tiancheng, et al.
Published: (2024)
by: Li, Tiancheng, et al.
Published: (2024)
Frequency Error-Guided Under-sampling Optimization for Multi-Contrast MRI Reconstruction
by: Fang, Xinming, et al.
Published: (2026)
by: Fang, Xinming, et al.
Published: (2026)
SceneCraft: Layout-Guided 3D Scene Generation
by: Yang, Xiuyu, et al.
Published: (2024)
by: Yang, Xiuyu, et al.
Published: (2024)
Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation
by: Zhou, Yupeng, et al.
Published: (2026)
by: Zhou, Yupeng, et al.
Published: (2026)
MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction
by: Liu, Xiaolu, et al.
Published: (2024)
by: Liu, Xiaolu, et al.
Published: (2024)
EIANet: A Novel Domain Adaptation Approach to Maximize Class Distinction with Neural Collapse Principles
by: Pan, Zicheng, et al.
Published: (2024)
by: Pan, Zicheng, et al.
Published: (2024)
Emulating Self-attention with Convolution for Efficient Image Super-Resolution
by: Lee, Dongheon, et al.
Published: (2025)
by: Lee, Dongheon, et al.
Published: (2025)
Guided Real Image Dehazing using YCbCr Color Space
by: Fang, Wenxuan, et al.
Published: (2024)
by: Fang, Wenxuan, et al.
Published: (2024)
UniVector: Unified Vector Extraction via Instance-Geometry Interaction
by: Yan, Yinglong, et al.
Published: (2025)
by: Yan, Yinglong, et al.
Published: (2025)
Fine-Tuning Stable Diffusion XL for Stylistic Icon Generation: A Comparison of Caption Size
by: Sultan, Youssef, et al.
Published: (2024)
by: Sultan, Youssef, et al.
Published: (2024)
SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation
by: Chen, Sichen, et al.
Published: (2024)
by: Chen, Sichen, et al.
Published: (2024)
Domain Generalization for Face Anti-spoofing via Content-aware Composite Prompt Engineering
by: Guo, Jiabao, et al.
Published: (2025)
by: Guo, Jiabao, et al.
Published: (2025)
Exploring Diversity-based Active Learning for 3D Object Detection in Autonomous Driving
by: Lin, Jinpeng, et al.
Published: (2022)
by: Lin, Jinpeng, et al.
Published: (2022)
Uncertainty Guided Refinement for Fine-Grained Salient Object Detection
by: Yuan, Yao, et al.
Published: (2025)
by: Yuan, Yao, et al.
Published: (2025)
Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
by: Yue, Xiaoyu, et al.
Published: (2025)
by: Yue, Xiaoyu, et al.
Published: (2025)
CAD-Judge: Toward Efficient Morphological Grading and Verification for Text-to-CAD Generation
by: Zhou, Zheyuan, et al.
Published: (2025)
by: Zhou, Zheyuan, et al.
Published: (2025)
Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models
by: Huang, Yuhang, et al.
Published: (2024)
by: Huang, Yuhang, et al.
Published: (2024)
Similar Items
-
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
by: Li, Kunchang, et al.
Published: (2022) -
Paying more attention to local contrast: improving infrared small target detection performance via prior knowledge
by: Wang, Peichao, et al.
Published: (2024) -
DistinctAD: Distinctive Audio Description Generation in Contexts
by: Fang, Bo, et al.
Published: (2024) -
SceneLCM: End-to-End Layout-Guided Interactive Indoor Scene Generation with Latent Consistency Model
by: Lin, Yangkai, et al.
Published: (2025) -
Finding Visual Task Vectors
by: Hojel, Alberto, et al.
Published: (2024)