:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cheng, Ailing, Ye, Jiaojiao, Yang, Fei, Lu, Shufang, Gao, Fei
Format:	Preprint
Published:	2021
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2112.09873
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Chinese Continuous Sign Language Dataset Based on Complex Environments
by: Zhu, Qidan, et al.
Published: (2024)

RT-DETR++ for UAV Object Detection
by: Shufang, Yuan
Published: (2025)

VideoGen-Eval: Agent-based System for Video Generation Evaluation
by: Yang, Yuhang, et al.
Published: (2025)

Characterization of dim light response in DVS pixel: Discontinuity of event triggering time
by: Jiang, Xiao, et al.
Published: (2024)

State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
by: Cui, Fei, et al.
Published: (2024)

GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds
by: Zhang, Shengjun, et al.
Published: (2024)

UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation
by: Guo, Qin, et al.
Published: (2025)

FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation
by: Wang, Fei, et al.
Published: (2024)

FingerSplat: Contactless Fingerprint 3D Reconstruction and Generation based on 3D Gaussian Splatting
by: Jia, Yuwei, et al.
Published: (2025)

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
by: Zeng, Ailing, et al.
Published: (2024)

X-Pose: Detecting Any Keypoints
by: Yang, Jie, et al.
Published: (2023)

AeroDeshadow: Physics-Guided Shadow Synthesis and Penumbra-Aware Deshadowing for Aerospace Imagery
by: Lu, Wei, et al.
Published: (2026)

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
by: Ye, Tian, et al.
Published: (2025)

PQD: Post-training Quantization for Efficient Diffusion Models
by: Ye, Jiaojiao, et al.
Published: (2024)

Generalized W-Net: Arbitrary-style Chinese Character Synthesization
by: Jiang, Haochuan, et al.
Published: (2024)

TPCNet: Triple physical constraints for Low-light Image Enhancement
by: Shi, Jing-Yi, et al.
Published: (2025)

SHIFT: Stochastic Hidden-Trajectory Deflection for Removing Diffusion-based Watermark
by: Bao, Rui, et al.
Published: (2026)

Data Extrapolation for Text-to-image Generation on Small Datasets
by: Ye, Senmao, et al.
Published: (2024)

Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases
by: Liu, Xinpeng, et al.
Published: (2023)

TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
by: Liu, Yunfei, et al.
Published: (2025)

L-CLIPScore: a Lightweight Embedding-based Captioning Metric for Evaluating and Training
by: Li, Li, et al.
Published: (2025)

Robust Brain Tumor Segmentation with Incomplete MRI Modalities Using Hölder Divergence and Mutual Information-Enhanced Knowledge Transfer
by: Cheng, Runze, et al.
Published: (2025)

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts
by: Long, Jinqiang, et al.
Published: (2024)

Hierarchical IoU Tracking based on Interval
by: Du, Yunhao, et al.
Published: (2024)

Smartphone-based Circular Plot Sampling for Forest Inventory
by: Sun, Su, et al.
Published: (2026)

Open-World Human-Object Interaction Detection via Multi-modal Prompts
by: Yang, Jie, et al.
Published: (2024)

Knowledge-guided Causal Intervention for Weakly-supervised Object Localization
by: Shao, Feifei, et al.
Published: (2023)

LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer
by: Fei, Song, et al.
Published: (2025)

Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis
by: Qi, Zipeng, et al.
Published: (2023)

SegChange-R1: LLM-Augmented Remote Sensing Change Detection
by: Zhou, Fei
Published: (2025)

Multi-scale Unified Network for Image Classification
by: Liu, Wenzhuo, et al.
Published: (2024)

Towards Non-Exemplar Semi-Supervised Class-Incremental Learning
by: Liu, Wenzhuo, et al.
Published: (2024)

YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5
by: Ji, Chun-Lin, et al.
Published: (2024)

LAVA: Layered Audio-Visual Anti-tampering Watermarking for Robust Deepfake Detection and Localization
by: Zeng, Bokang, et al.
Published: (2026)

ConText: Driving In-context Learning for Text Removal and Segmentation
by: Zhang, Fei, et al.
Published: (2025)

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
by: Yang, Jihan, et al.
Published: (2024)

Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
by: Shen, Fei, et al.
Published: (2023)

Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
by: Peng, Zhimao, et al.
Published: (2025)

A Deep Equilibrium Network for Hyperspectral Unmixing
by: Wang, Chentong, et al.
Published: (2026)

Fast Sparse View Guided NeRF Update for Object Reconfigurations
by: Lu, Ziqi, et al.
Published: (2024)