:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lin, Qing, Zhang, Jingfeng, Ong, Yew-Soon, Zhang, Mengmi
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.08255
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model
by: Wang, Fangjinhua, et al.
Published: (2025)

Learning to See Through a Baby's Eyes: Early Visual Diets Enable Robust Visual Intelligence in Humans and Machines
by: Cai, Yusen, et al.
Published: (2025)

Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection
by: Zhu, Jiawen, et al.
Published: (2024)

EmoEdit: Evoking Emotions through Image Manipulation
by: Yang, Jingyuan, et al.
Published: (2024)

Agentic Spatio-Temporal Grounding via Collaborative Reasoning
by: Zhao, Heng, et al.
Published: (2026)

Hierarchically Robust Zero-shot Vision-language Models
by: Dong, Junhao, et al.
Published: (2026)

Prototype Optimization with Neural ODE for Few-Shot Learning
by: Zhang, Baoquan, et al.
Published: (2024)

Few-shot NeRF by Adaptive Rendering Loss Regularization
by: Xu, Qingshan, et al.
Published: (2024)

Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis
by: Hua, Yu, et al.
Published: (2025)

SiamNAS: Siamese Surrogate Model for Dominance Relation Prediction in Multi-objective Neural Architecture Search
by: Zhou, Yuyang, et al.
Published: (2025)

Precise-Physics Driven Text-to-3D Generation
by: Xu, Qingshan, et al.
Published: (2024)

Learning to Perceive "Where": Spatial Pretext Tasks for Robust Self-Supervised Learning
by: Shen, Yang, et al.
Published: (2026)

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
by: Xie, Jiahao, et al.
Published: (2023)

Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training
by: Dong, Junhao, et al.
Published: (2024)

Video Set Distillation: Information Diversification and Temporal Densification
by: Zhao, Yinjie, et al.
Published: (2024)

Pushing Rendering Boundaries: Hard Gaussian Splatting
by: Xu, Qingshan, et al.
Published: (2024)

Possibilistic Predictive Uncertainty for Deep Learning
by: Ni, Yao, et al.
Published: (2026)

Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception
by: Han, Shuangpeng, et al.
Published: (2024)

Pose Prior Learner: Unsupervised Categorical Prior Learning for Pose Estimation
by: Wang, Ziyu, et al.
Published: (2024)

PRISM: Progressive Reasoning through Iterative Slot Memory for Vision
by: Wang, Ziyu, et al.
Published: (2026)

Adaptive Visual Scene Understanding: Incremental Scene Graph Generation
by: Khandelwal, Naitik, et al.
Published: (2023)

LLM-to-Phy3D: Physically Conform Online 3D Object Generation with LLMs
by: Wong, Melvin, et al.
Published: (2025)

EmoAttack: Emotion-to-Image Diffusion Models for Emotional Backdoor Generation
by: Wei, Tianyu, et al.
Published: (2024)

Pix2Fact: When Vision Is Not Enough -- Benchmarking Fine-Grained VQA with Web Verification on High-Resolution Real-World Scenes
by: Jiang, Yifan, et al.
Published: (2026)

Generative AI-based Prompt Evolution Engineering Design Optimization With Vision-Language Model
by: Wong, Melvin, et al.
Published: (2024)

Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction
by: Zhang, Zhengquan, et al.
Published: (2025)

A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation
by: Yu, Hua, et al.
Published: (2025)

Dynamic-Aware Video Distillation: Optimizing Temporal Resolution Based on Video Semantics
by: Zhao, Yinjie, et al.
Published: (2025)

MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models
by: Li, Yuqi, et al.
Published: (2025)

Unforgettable Lessons from Forgettable Images: Intra-Class Memorability Matters in Computer Vision
by: Jing, Jie, et al.
Published: (2024)

EEmo-Logic: A Unified Dataset and Multi-Stage Framework for Comprehensive Image-Evoked Emotion Assessment
by: Gao, Lancheng, et al.
Published: (2026)

Hard-Label Black-Box Attacks on 3D Point Clouds
by: Liu, Daizong, et al.
Published: (2024)

EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models
by: Zhang, Yixuan, et al.
Published: (2025)

NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from Videos
by: Xu, Qingshan, et al.
Published: (2025)

ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
by: Ma, Yiyang, et al.
Published: (2025)

LLM2TEA: An Agentic AI Designer for Discovery with Generative Evolutionary Multitasking
by: Wong, Melvin, et al.
Published: (2024)

Seeing Through Uncertainty: A Free-Energy Approach for Real-Time Perceptual Adaptation in Robust Visual Navigation
by: Piriyajitakonkij, Maytus, et al.
Published: (2024)

Preserving Image Properties Through Initializations in Diffusion Models
by: Zhang, Jeffrey, et al.
Published: (2024)

Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
by: Pu, Yujiang, et al.
Published: (2025)

Unveiling the Tapestry: the Interplay of Generalization and Forgetting in Continual Learning
by: Shi, Zenglin, et al.
Published: (2022)