Saved in:
| Main Authors: | Cheng, Ailing, Ye, Jiaojiao, Yang, Fei, Lu, Shufang, Gao, Fei |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2112.09873 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Chinese Continuous Sign Language Dataset Based on Complex Environments
by: Zhu, Qidan, et al.
Published: (2024)
by: Zhu, Qidan, et al.
Published: (2024)
RT-DETR++ for UAV Object Detection
by: Shufang, Yuan
Published: (2025)
by: Shufang, Yuan
Published: (2025)
VideoGen-Eval: Agent-based System for Video Generation Evaluation
by: Yang, Yuhang, et al.
Published: (2025)
by: Yang, Yuhang, et al.
Published: (2025)
Characterization of dim light response in DVS pixel: Discontinuity of event triggering time
by: Jiang, Xiao, et al.
Published: (2024)
by: Jiang, Xiao, et al.
Published: (2024)
State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
by: Cui, Fei, et al.
Published: (2024)
by: Cui, Fei, et al.
Published: (2024)
GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds
by: Zhang, Shengjun, et al.
Published: (2024)
by: Zhang, Shengjun, et al.
Published: (2024)
UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation
by: Guo, Qin, et al.
Published: (2025)
by: Guo, Qin, et al.
Published: (2025)
FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation
by: Wang, Fei, et al.
Published: (2024)
by: Wang, Fei, et al.
Published: (2024)
FingerSplat: Contactless Fingerprint 3D Reconstruction and Generation based on 3D Gaussian Splatting
by: Jia, Yuwei, et al.
Published: (2025)
by: Jia, Yuwei, et al.
Published: (2025)
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
by: Zeng, Ailing, et al.
Published: (2024)
by: Zeng, Ailing, et al.
Published: (2024)
X-Pose: Detecting Any Keypoints
by: Yang, Jie, et al.
Published: (2023)
by: Yang, Jie, et al.
Published: (2023)
AeroDeshadow: Physics-Guided Shadow Synthesis and Penumbra-Aware Deshadowing for Aerospace Imagery
by: Lu, Wei, et al.
Published: (2026)
by: Lu, Wei, et al.
Published: (2026)
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
by: Ye, Tian, et al.
Published: (2025)
by: Ye, Tian, et al.
Published: (2025)
PQD: Post-training Quantization for Efficient Diffusion Models
by: Ye, Jiaojiao, et al.
Published: (2024)
by: Ye, Jiaojiao, et al.
Published: (2024)
Generalized W-Net: Arbitrary-style Chinese Character Synthesization
by: Jiang, Haochuan, et al.
Published: (2024)
by: Jiang, Haochuan, et al.
Published: (2024)
TPCNet: Triple physical constraints for Low-light Image Enhancement
by: Shi, Jing-Yi, et al.
Published: (2025)
by: Shi, Jing-Yi, et al.
Published: (2025)
SHIFT: Stochastic Hidden-Trajectory Deflection for Removing Diffusion-based Watermark
by: Bao, Rui, et al.
Published: (2026)
by: Bao, Rui, et al.
Published: (2026)
Data Extrapolation for Text-to-image Generation on Small Datasets
by: Ye, Senmao, et al.
Published: (2024)
by: Ye, Senmao, et al.
Published: (2024)
Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases
by: Liu, Xinpeng, et al.
Published: (2023)
by: Liu, Xinpeng, et al.
Published: (2023)
TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
by: Liu, Yunfei, et al.
Published: (2025)
by: Liu, Yunfei, et al.
Published: (2025)
L-CLIPScore: a Lightweight Embedding-based Captioning Metric for Evaluating and Training
by: Li, Li, et al.
Published: (2025)
by: Li, Li, et al.
Published: (2025)
Robust Brain Tumor Segmentation with Incomplete MRI Modalities Using Hölder Divergence and Mutual Information-Enhanced Knowledge Transfer
by: Cheng, Runze, et al.
Published: (2025)
by: Cheng, Runze, et al.
Published: (2025)
Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts
by: Long, Jinqiang, et al.
Published: (2024)
by: Long, Jinqiang, et al.
Published: (2024)
Hierarchical IoU Tracking based on Interval
by: Du, Yunhao, et al.
Published: (2024)
by: Du, Yunhao, et al.
Published: (2024)
Smartphone-based Circular Plot Sampling for Forest Inventory
by: Sun, Su, et al.
Published: (2026)
by: Sun, Su, et al.
Published: (2026)
Open-World Human-Object Interaction Detection via Multi-modal Prompts
by: Yang, Jie, et al.
Published: (2024)
by: Yang, Jie, et al.
Published: (2024)
Knowledge-guided Causal Intervention for Weakly-supervised Object Localization
by: Shao, Feifei, et al.
Published: (2023)
by: Shao, Feifei, et al.
Published: (2023)
LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer
by: Fei, Song, et al.
Published: (2025)
by: Fei, Song, et al.
Published: (2025)
Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis
by: Qi, Zipeng, et al.
Published: (2023)
by: Qi, Zipeng, et al.
Published: (2023)
SegChange-R1: LLM-Augmented Remote Sensing Change Detection
by: Zhou, Fei
Published: (2025)
by: Zhou, Fei
Published: (2025)
Multi-scale Unified Network for Image Classification
by: Liu, Wenzhuo, et al.
Published: (2024)
by: Liu, Wenzhuo, et al.
Published: (2024)
Towards Non-Exemplar Semi-Supervised Class-Incremental Learning
by: Liu, Wenzhuo, et al.
Published: (2024)
by: Liu, Wenzhuo, et al.
Published: (2024)
YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5
by: Ji, Chun-Lin, et al.
Published: (2024)
by: Ji, Chun-Lin, et al.
Published: (2024)
LAVA: Layered Audio-Visual Anti-tampering Watermarking for Robust Deepfake Detection and Localization
by: Zeng, Bokang, et al.
Published: (2026)
by: Zeng, Bokang, et al.
Published: (2026)
ConText: Driving In-context Learning for Text Removal and Segmentation
by: Zhang, Fei, et al.
Published: (2025)
by: Zhang, Fei, et al.
Published: (2025)
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
by: Yang, Jihan, et al.
Published: (2024)
by: Yang, Jihan, et al.
Published: (2024)
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
by: Shen, Fei, et al.
Published: (2023)
by: Shen, Fei, et al.
Published: (2023)
Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
by: Peng, Zhimao, et al.
Published: (2025)
by: Peng, Zhimao, et al.
Published: (2025)
A Deep Equilibrium Network for Hyperspectral Unmixing
by: Wang, Chentong, et al.
Published: (2026)
by: Wang, Chentong, et al.
Published: (2026)
Fast Sparse View Guided NeRF Update for Object Reconfigurations
by: Lu, Ziqi, et al.
Published: (2024)
by: Lu, Ziqi, et al.
Published: (2024)
Similar Items
-
A Chinese Continuous Sign Language Dataset Based on Complex Environments
by: Zhu, Qidan, et al.
Published: (2024) -
RT-DETR++ for UAV Object Detection
by: Shufang, Yuan
Published: (2025) -
VideoGen-Eval: Agent-based System for Video Generation Evaluation
by: Yang, Yuhang, et al.
Published: (2025) -
Characterization of dim light response in DVS pixel: Discontinuity of event triggering time
by: Jiang, Xiao, et al.
Published: (2024) -
State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
by: Cui, Fei, et al.
Published: (2024)