:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Shih, Chun-Yen, Peng, Li-Xuan, Liao, Jia-Wei, Chu, Ernie, Chou, Cheng-Fu, Chen, Jun-Cheng
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2408.11810
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models
by: Weng, Ju-Hsuan, et al.
Published: (2025)

Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance
by: Liao, Jia-Wei, et al.
Published: (2024)

PixelLM: Pixel Reasoning with Large Multimodal Model
by: Ren, Zhongwei, et al.
Published: (2023)

PixelDiT: Pixel Diffusion Transformers for Image Generation
by: Yu, Yongsheng, et al.
Published: (2025)

DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement
by: Liao, Jia-Wei, et al.
Published: (2024)

PixelArena: A benchmark for Pixel-Precision Visual Intelligence
by: Liang, Feng, et al.
Published: (2025)

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
by: Xu, Gangwei, et al.
Published: (2025)

FrequencyBooster: Full-Frequency Modeling for High-Fidelity Pixel Diffusion
by: Ma, Lichen, et al.
Published: (2026)

PixelGen: Improving Pixel Diffusion with Perceptual Supervision
by: Ma, Zehong, et al.
Published: (2026)

PixelFlow: Pixel-Space Generative Models with Flow
by: Chen, Shoufa, et al.
Published: (2025)

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
by: Zhang, David Junhao, et al.
Published: (2023)

Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think
by: Xue, Haotian, et al.
Published: (2024)

Not All Pixels Are Equal: Pixel-wise Meta-Learning for Medical Segmentation with Noisy Labels
by: Mu, Chenyu, et al.
Published: (2025)

PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
by: Liu, Lihua, et al.
Published: (2025)

Beyond Pixels: Semantic-aware Typographic Attack for Geo-Privacy Protection
by: Zhu, Jiayi, et al.
Published: (2025)

DiP: Taming Diffusion Models in Pixel Space
by: Chen, Zhennan, et al.
Published: (2025)

Novel View Synthesis with Pixel-Space Diffusion Models
by: Elata, Noam, et al.
Published: (2024)

PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation
by: Jiang, Liyao, et al.
Published: (2024)

GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing
by: Shabbir, Akashah, et al.
Published: (2025)

Not All Tokens Need 40 Steps: Heterogeneous Step Allocation in Diffusion Transformers for Efficient Video Generation
by: Chu, Ernie, et al.
Published: (2026)

Reading Between the Pixels: An Inscriptive Jailbreak Attack on Text-to-Image Models
by: Ying, Zonghao, et al.
Published: (2026)

Inter-Image Pixel Shuffling for Multi-focus Image Fusion
by: Lin, Huangxing, et al.
Published: (2026)

BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks
by: Cheng, Zhiyuan, et al.
Published: (2024)

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
by: Zhang, Tao, et al.
Published: (2025)

PixNerd: Pixel Neural Field Diffusion
by: Wang, Shuai, et al.
Published: (2025)

Registers Matter for Pixel-Space Diffusion Transformers
by: Starodubcev, Nikita, et al.
Published: (2026)

Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation
by: Du, Ye, et al.
Published: (2024)

JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion
by: Wang, Haoyu, et al.
Published: (2025)

GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm
by: Wang, Hanrui, et al.
Published: (2025)

Pixel-Optimization-Free Patch Attack on Stereo Depth Estimation
by: Liu, Hangcheng, et al.
Published: (2025)

Lattice Boltzmann Model for Learning Real-World Pixel Dynamicity
by: Zheng, Guangze, et al.
Published: (2025)

Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model
by: Xu, Jiahua, et al.
Published: (2024)

PixelThink: Towards Efficient Chain-of-Pixel Reasoning
by: Wang, Song, et al.
Published: (2025)

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
by: NVIDIA, et al.
Published: (2024)

PixelVLA: Advancing Pixel-level Understanding in Vision-Language-Action Model
by: Liang, Wenqi, et al.
Published: (2025)

VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models
by: Chou, Sheng-Yen, et al.
Published: (2023)

HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusion
by: He, Yu, et al.
Published: (2026)

Pixel-Perfect Visual Geometry Estimation
by: Xu, Gangwei, et al.
Published: (2026)

PixelSmile: Toward Fine-Grained Facial Expression Editing
by: Hua, Jiabin, et al.
Published: (2026)

From Waveforms to Pixels: A Survey on Audio-Visual Segmentation
by: Li, Jia, et al.
Published: (2025)