Saved in:
| Main Authors: | Wang, YuAn, Li, Xiaofan, Huang, Chi, Zhang, Wenhao, Li, Hao, Wang, Bosheng, Sun, Xun, Wang, Jun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.21113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Video4Edit: Viewing Image Editing as a Degenerate Temporal Process
by: Li, Xiaofan, et al.
Published: (2025)
by: Li, Xiaofan, et al.
Published: (2025)
NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection
by: Huang, Chi, et al.
Published: (2024)
by: Huang, Chi, et al.
Published: (2024)
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
by: Jia, Ding, et al.
Published: (2024)
by: Jia, Ding, et al.
Published: (2024)
Learning Pixel-wise Continuous Depth Representation via Clustering for Depth Completion
by: Shenglun, Chen, et al.
Published: (2024)
by: Shenglun, Chen, et al.
Published: (2024)
BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
by: Kou, Siqi, et al.
Published: (2023)
by: Kou, Siqi, et al.
Published: (2023)
Not All Pixels Are Equal: Pixel-wise Meta-Learning for Medical Segmentation with Noisy Labels
by: Mu, Chenyu, et al.
Published: (2025)
by: Mu, Chenyu, et al.
Published: (2025)
Guiding Quantitative MRI Reconstruction with Phase-wise Uncertainty
by: Sun, Haozhong, et al.
Published: (2025)
by: Sun, Haozhong, et al.
Published: (2025)
IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
by: Hu, Wenhao, et al.
Published: (2025)
by: Hu, Wenhao, et al.
Published: (2025)
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
by: Fan, Weichen, et al.
Published: (2025)
by: Fan, Weichen, et al.
Published: (2025)
PQTNet: Pixel-wise Quantitative Thermography Neural Network for Estimating Defect Depth in Polylactic Acid Parts by Additive Manufacturing
by: Deng, Lei, et al.
Published: (2026)
by: Deng, Lei, et al.
Published: (2026)
Pseudo-View Enhancement via Confidence Fusion for Unposed Sparse-View Reconstruction
by: Zhao, Beizhen, et al.
Published: (2026)
by: Zhao, Beizhen, et al.
Published: (2026)
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
by: Zhao, Guosheng, et al.
Published: (2025)
by: Zhao, Guosheng, et al.
Published: (2025)
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
by: Pan, Zijie, et al.
Published: (2023)
by: Pan, Zijie, et al.
Published: (2023)
Boosting Edge Detection with Pixel-wise Feature Selection: The Extractor-Selector Paradigm
by: Shu, Hao
Published: (2025)
by: Shu, Hao
Published: (2025)
Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
by: Xie, Zhuyang, et al.
Published: (2024)
by: Xie, Zhuyang, et al.
Published: (2024)
Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning
by: Chen, Xingchi, et al.
Published: (2024)
by: Chen, Xingchi, et al.
Published: (2024)
Let Geometry GUIDE: Layer-wise Unrolling of Geometric Priors in Multimodal LLMs
by: Wang, Chongyu, et al.
Published: (2026)
by: Wang, Chongyu, et al.
Published: (2026)
PCIM: Learning Pixel Attributions via Pixel-wise Channel Isolation Mixing in High Content Imaging
by: Siegismund, Daniel, et al.
Published: (2024)
by: Siegismund, Daniel, et al.
Published: (2024)
MultiGO++: Monocular 3D Clothed Human Reconstruction via Geometry-Texture Collaboration
by: Yao, Nanjie, et al.
Published: (2026)
by: Yao, Nanjie, et al.
Published: (2026)
FaithfulFaces: Pose-Faithful Facial Identity Preservation for Text-to-Video Generation
by: Wang, Yuanzhi, et al.
Published: (2026)
by: Wang, Yuanzhi, et al.
Published: (2026)
Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
by: Lee, Jae Joong, et al.
Published: (2024)
by: Lee, Jae Joong, et al.
Published: (2024)
Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion
by: Li, Jiawei, et al.
Published: (2024)
by: Li, Jiawei, et al.
Published: (2024)
DogWeave: High-Fidelity 3D Canine Reconstruction from a Single Image via Normal Fusion and Conditional Inpainting
by: Sun, Shufan, et al.
Published: (2026)
by: Sun, Shufan, et al.
Published: (2026)
Towards Pixel-Wise Anomaly Location for High-Resolution PCBA via Self-Supervised Image Reconstruction
by: Liu, Wuyi, et al.
Published: (2025)
by: Liu, Wuyi, et al.
Published: (2025)
MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation
by: Wang, Hongpeng, et al.
Published: (2026)
by: Wang, Hongpeng, et al.
Published: (2026)
Beyond Semantic Features: Pixel-level Mapping for Generalized AI-Generated Image Detection
by: Zhou, Chenming, et al.
Published: (2025)
by: Zhou, Chenming, et al.
Published: (2025)
Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting
by: Zhang, Rong, et al.
Published: (2025)
by: Zhang, Rong, et al.
Published: (2025)
Task-wise Sampling Convolutions for Arbitrary-Oriented Object Detection in Aerial Images
by: Huang, Zhanchao, et al.
Published: (2022)
by: Huang, Zhanchao, et al.
Published: (2022)
InterFusion: Text-Driven Generation of 3D Human-Object Interaction
by: Dai, Sisi, et al.
Published: (2024)
by: Dai, Sisi, et al.
Published: (2024)
Faithful-MR1: Faithful Multimodal Reasoning via Anchoring and Reinforcing Visual Attention
by: Tian, Changyuan, et al.
Published: (2026)
by: Tian, Changyuan, et al.
Published: (2026)
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
by: Huang, Dingbang, et al.
Published: (2025)
by: Huang, Dingbang, et al.
Published: (2025)
Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations
by: Hu, Hanjiang, et al.
Published: (2023)
by: Hu, Hanjiang, et al.
Published: (2023)
Inter-Image Pixel Shuffling for Multi-focus Image Fusion
by: Lin, Huangxing, et al.
Published: (2026)
by: Lin, Huangxing, et al.
Published: (2026)
FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation
by: Xu, Honghao, et al.
Published: (2024)
by: Xu, Honghao, et al.
Published: (2024)
U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration
by: Li, Xiaofan, et al.
Published: (2025)
by: Li, Xiaofan, et al.
Published: (2025)
Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning
by: Hu, Zhangchi, et al.
Published: (2026)
by: Hu, Zhangchi, et al.
Published: (2026)
Breaking Shallow Limits: Task-Driven Pixel Fusion for Gap-free RGBT Tracking
by: Lu, Andong, et al.
Published: (2025)
by: Lu, Andong, et al.
Published: (2025)
PixelThink: Towards Efficient Chain-of-Pixel Reasoning
by: Wang, Song, et al.
Published: (2025)
by: Wang, Song, et al.
Published: (2025)
Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
by: Miao, Boming, et al.
Published: (2024)
by: Miao, Boming, et al.
Published: (2024)
Generating Faithful and Salient Text from Multimodal Data
by: Hashem, Tahsina, et al.
Published: (2024)
by: Hashem, Tahsina, et al.
Published: (2024)
Similar Items
-
Video4Edit: Viewing Image Editing as a Degenerate Temporal Process
by: Li, Xiaofan, et al.
Published: (2025) -
NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection
by: Huang, Chi, et al.
Published: (2024) -
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
by: Jia, Ding, et al.
Published: (2024) -
Learning Pixel-wise Continuous Depth Representation via Clustering for Depth Completion
by: Shenglun, Chen, et al.
Published: (2024) -
BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
by: Kou, Siqi, et al.
Published: (2023)