Saved in:
| Main Authors: | Lin, Qing, Zhang, Jingfeng, Ong, Yew-Soon, Zhang, Mengmi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.08255 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model
by: Wang, Fangjinhua, et al.
Published: (2025)
by: Wang, Fangjinhua, et al.
Published: (2025)
Learning to See Through a Baby's Eyes: Early Visual Diets Enable Robust Visual Intelligence in Humans and Machines
by: Cai, Yusen, et al.
Published: (2025)
by: Cai, Yusen, et al.
Published: (2025)
Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection
by: Zhu, Jiawen, et al.
Published: (2024)
by: Zhu, Jiawen, et al.
Published: (2024)
EmoEdit: Evoking Emotions through Image Manipulation
by: Yang, Jingyuan, et al.
Published: (2024)
by: Yang, Jingyuan, et al.
Published: (2024)
Agentic Spatio-Temporal Grounding via Collaborative Reasoning
by: Zhao, Heng, et al.
Published: (2026)
by: Zhao, Heng, et al.
Published: (2026)
Hierarchically Robust Zero-shot Vision-language Models
by: Dong, Junhao, et al.
Published: (2026)
by: Dong, Junhao, et al.
Published: (2026)
Prototype Optimization with Neural ODE for Few-Shot Learning
by: Zhang, Baoquan, et al.
Published: (2024)
by: Zhang, Baoquan, et al.
Published: (2024)
Few-shot NeRF by Adaptive Rendering Loss Regularization
by: Xu, Qingshan, et al.
Published: (2024)
by: Xu, Qingshan, et al.
Published: (2024)
Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis
by: Hua, Yu, et al.
Published: (2025)
by: Hua, Yu, et al.
Published: (2025)
SiamNAS: Siamese Surrogate Model for Dominance Relation Prediction in Multi-objective Neural Architecture Search
by: Zhou, Yuyang, et al.
Published: (2025)
by: Zhou, Yuyang, et al.
Published: (2025)
Precise-Physics Driven Text-to-3D Generation
by: Xu, Qingshan, et al.
Published: (2024)
by: Xu, Qingshan, et al.
Published: (2024)
Learning to Perceive "Where": Spatial Pretext Tasks for Robust Self-Supervised Learning
by: Shen, Yang, et al.
Published: (2026)
by: Shen, Yang, et al.
Published: (2026)
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
by: Xie, Jiahao, et al.
Published: (2023)
by: Xie, Jiahao, et al.
Published: (2023)
Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training
by: Dong, Junhao, et al.
Published: (2024)
by: Dong, Junhao, et al.
Published: (2024)
Video Set Distillation: Information Diversification and Temporal Densification
by: Zhao, Yinjie, et al.
Published: (2024)
by: Zhao, Yinjie, et al.
Published: (2024)
Pushing Rendering Boundaries: Hard Gaussian Splatting
by: Xu, Qingshan, et al.
Published: (2024)
by: Xu, Qingshan, et al.
Published: (2024)
Possibilistic Predictive Uncertainty for Deep Learning
by: Ni, Yao, et al.
Published: (2026)
by: Ni, Yao, et al.
Published: (2026)
Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception
by: Han, Shuangpeng, et al.
Published: (2024)
by: Han, Shuangpeng, et al.
Published: (2024)
Pose Prior Learner: Unsupervised Categorical Prior Learning for Pose Estimation
by: Wang, Ziyu, et al.
Published: (2024)
by: Wang, Ziyu, et al.
Published: (2024)
PRISM: Progressive Reasoning through Iterative Slot Memory for Vision
by: Wang, Ziyu, et al.
Published: (2026)
by: Wang, Ziyu, et al.
Published: (2026)
Adaptive Visual Scene Understanding: Incremental Scene Graph Generation
by: Khandelwal, Naitik, et al.
Published: (2023)
by: Khandelwal, Naitik, et al.
Published: (2023)
LLM-to-Phy3D: Physically Conform Online 3D Object Generation with LLMs
by: Wong, Melvin, et al.
Published: (2025)
by: Wong, Melvin, et al.
Published: (2025)
EmoAttack: Emotion-to-Image Diffusion Models for Emotional Backdoor Generation
by: Wei, Tianyu, et al.
Published: (2024)
by: Wei, Tianyu, et al.
Published: (2024)
Pix2Fact: When Vision Is Not Enough -- Benchmarking Fine-Grained VQA with Web Verification on High-Resolution Real-World Scenes
by: Jiang, Yifan, et al.
Published: (2026)
by: Jiang, Yifan, et al.
Published: (2026)
Generative AI-based Prompt Evolution Engineering Design Optimization With Vision-Language Model
by: Wong, Melvin, et al.
Published: (2024)
by: Wong, Melvin, et al.
Published: (2024)
Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction
by: Zhang, Zhengquan, et al.
Published: (2025)
by: Zhang, Zhengquan, et al.
Published: (2025)
A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation
by: Yu, Hua, et al.
Published: (2025)
by: Yu, Hua, et al.
Published: (2025)
Dynamic-Aware Video Distillation: Optimizing Temporal Resolution Based on Video Semantics
by: Zhao, Yinjie, et al.
Published: (2025)
by: Zhao, Yinjie, et al.
Published: (2025)
MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models
by: Li, Yuqi, et al.
Published: (2025)
by: Li, Yuqi, et al.
Published: (2025)
Unforgettable Lessons from Forgettable Images: Intra-Class Memorability Matters in Computer Vision
by: Jing, Jie, et al.
Published: (2024)
by: Jing, Jie, et al.
Published: (2024)
EEmo-Logic: A Unified Dataset and Multi-Stage Framework for Comprehensive Image-Evoked Emotion Assessment
by: Gao, Lancheng, et al.
Published: (2026)
by: Gao, Lancheng, et al.
Published: (2026)
Hard-Label Black-Box Attacks on 3D Point Clouds
by: Liu, Daizong, et al.
Published: (2024)
by: Liu, Daizong, et al.
Published: (2024)
EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models
by: Zhang, Yixuan, et al.
Published: (2025)
by: Zhang, Yixuan, et al.
Published: (2025)
NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from Videos
by: Xu, Qingshan, et al.
Published: (2025)
by: Xu, Qingshan, et al.
Published: (2025)
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
by: Ma, Yiyang, et al.
Published: (2025)
by: Ma, Yiyang, et al.
Published: (2025)
LLM2TEA: An Agentic AI Designer for Discovery with Generative Evolutionary Multitasking
by: Wong, Melvin, et al.
Published: (2024)
by: Wong, Melvin, et al.
Published: (2024)
Seeing Through Uncertainty: A Free-Energy Approach for Real-Time Perceptual Adaptation in Robust Visual Navigation
by: Piriyajitakonkij, Maytus, et al.
Published: (2024)
by: Piriyajitakonkij, Maytus, et al.
Published: (2024)
Preserving Image Properties Through Initializations in Diffusion Models
by: Zhang, Jeffrey, et al.
Published: (2024)
by: Zhang, Jeffrey, et al.
Published: (2024)
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
by: Pu, Yujiang, et al.
Published: (2025)
by: Pu, Yujiang, et al.
Published: (2025)
Unveiling the Tapestry: the Interplay of Generalization and Forgetting in Continual Learning
by: Shi, Zenglin, et al.
Published: (2022)
by: Shi, Zenglin, et al.
Published: (2022)
Similar Items
-
Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model
by: Wang, Fangjinhua, et al.
Published: (2025) -
Learning to See Through a Baby's Eyes: Early Visual Diets Enable Robust Visual Intelligence in Humans and Machines
by: Cai, Yusen, et al.
Published: (2025) -
Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection
by: Zhu, Jiawen, et al.
Published: (2024) -
EmoEdit: Evoking Emotions through Image Manipulation
by: Yang, Jingyuan, et al.
Published: (2024) -
Agentic Spatio-Temporal Grounding via Collaborative Reasoning
by: Zhao, Heng, et al.
Published: (2026)