:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, YuAn, Li, Xiaofan, Huang, Chi, Zhang, Wenhao, Li, Hao, Wang, Bosheng, Sun, Xun, Wang, Jun
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2511.21113
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Video4Edit: Viewing Image Editing as a Degenerate Temporal Process
by: Li, Xiaofan, et al.
Published: (2025)

NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection
by: Huang, Chi, et al.
Published: (2024)

GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
by: Jia, Ding, et al.
Published: (2024)

Learning Pixel-wise Continuous Depth Representation via Clustering for Depth Completion
by: Shenglun, Chen, et al.
Published: (2024)

BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
by: Kou, Siqi, et al.
Published: (2023)

Not All Pixels Are Equal: Pixel-wise Meta-Learning for Medical Segmentation with Noisy Labels
by: Mu, Chenyu, et al.
Published: (2025)

Guiding Quantitative MRI Reconstruction with Phase-wise Uncertainty
by: Sun, Haozhong, et al.
Published: (2025)

IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
by: Hu, Wenhao, et al.
Published: (2025)

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
by: Fan, Weichen, et al.
Published: (2025)

PQTNet: Pixel-wise Quantitative Thermography Neural Network for Estimating Defect Depth in Polylactic Acid Parts by Additive Manufacturing
by: Deng, Lei, et al.
Published: (2026)

Pseudo-View Enhancement via Confidence Fusion for Unposed Sparse-View Reconstruction
by: Zhao, Beizhen, et al.
Published: (2026)

ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
by: Zhao, Guosheng, et al.
Published: (2025)

Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
by: Pan, Zijie, et al.
Published: (2023)

Boosting Edge Detection with Pixel-wise Feature Selection: The Extractor-Selector Paradigm
by: Shu, Hao
Published: (2025)

Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
by: Xie, Zhuyang, et al.
Published: (2024)

Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning
by: Chen, Xingchi, et al.
Published: (2024)

Let Geometry GUIDE: Layer-wise Unrolling of Geometric Priors in Multimodal LLMs
by: Wang, Chongyu, et al.
Published: (2026)

PCIM: Learning Pixel Attributions via Pixel-wise Channel Isolation Mixing in High Content Imaging
by: Siegismund, Daniel, et al.
Published: (2024)

MultiGO++: Monocular 3D Clothed Human Reconstruction via Geometry-Texture Collaboration
by: Yao, Nanjie, et al.
Published: (2026)

FaithfulFaces: Pose-Faithful Facial Identity Preservation for Text-to-Video Generation
by: Wang, Yuanzhi, et al.
Published: (2026)

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
by: Lee, Jae Joong, et al.
Published: (2024)

Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion
by: Li, Jiawei, et al.
Published: (2024)

DogWeave: High-Fidelity 3D Canine Reconstruction from a Single Image via Normal Fusion and Conditional Inpainting
by: Sun, Shufan, et al.
Published: (2026)

Towards Pixel-Wise Anomaly Location for High-Resolution PCBA via Self-Supervised Image Reconstruction
by: Liu, Wuyi, et al.
Published: (2025)

MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation
by: Wang, Hongpeng, et al.
Published: (2026)

Beyond Semantic Features: Pixel-level Mapping for Generalized AI-Generated Image Detection
by: Zhou, Chenming, et al.
Published: (2025)

Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting
by: Zhang, Rong, et al.
Published: (2025)

Task-wise Sampling Convolutions for Arbitrary-Oriented Object Detection in Aerial Images
by: Huang, Zhanchao, et al.
Published: (2022)

InterFusion: Text-Driven Generation of 3D Human-Object Interaction
by: Dai, Sisi, et al.
Published: (2024)

Faithful-MR1: Faithful Multimodal Reasoning via Anchoring and Reinforcing Visual Attention
by: Tian, Changyuan, et al.
Published: (2026)

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
by: Huang, Dingbang, et al.
Published: (2025)

Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations
by: Hu, Hanjiang, et al.
Published: (2023)

Inter-Image Pixel Shuffling for Multi-focus Image Fusion
by: Lin, Huangxing, et al.
Published: (2026)

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation
by: Xu, Honghao, et al.
Published: (2024)

U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration
by: Li, Xiaofan, et al.
Published: (2025)

Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning
by: Hu, Zhangchi, et al.
Published: (2026)

Breaking Shallow Limits: Task-Driven Pixel Fusion for Gap-free RGBT Tracking
by: Lu, Andong, et al.
Published: (2025)

PixelThink: Towards Efficient Chain-of-Pixel Reasoning
by: Wang, Song, et al.
Published: (2025)

Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
by: Miao, Boming, et al.
Published: (2024)

Generating Faithful and Salient Text from Multimodal Data
by: Hashem, Tahsina, et al.
Published: (2024)