Saved in:
| Main Authors: | Shih, Chun-Yen, Peng, Li-Xuan, Liao, Jia-Wei, Chu, Ernie, Chou, Cheng-Fu, Chen, Jun-Cheng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.11810 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models
by: Weng, Ju-Hsuan, et al.
Published: (2025)
by: Weng, Ju-Hsuan, et al.
Published: (2025)
Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance
by: Liao, Jia-Wei, et al.
Published: (2024)
by: Liao, Jia-Wei, et al.
Published: (2024)
PixelLM: Pixel Reasoning with Large Multimodal Model
by: Ren, Zhongwei, et al.
Published: (2023)
by: Ren, Zhongwei, et al.
Published: (2023)
PixelDiT: Pixel Diffusion Transformers for Image Generation
by: Yu, Yongsheng, et al.
Published: (2025)
by: Yu, Yongsheng, et al.
Published: (2025)
DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement
by: Liao, Jia-Wei, et al.
Published: (2024)
by: Liao, Jia-Wei, et al.
Published: (2024)
PixelArena: A benchmark for Pixel-Precision Visual Intelligence
by: Liang, Feng, et al.
Published: (2025)
by: Liang, Feng, et al.
Published: (2025)
Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
by: Xu, Gangwei, et al.
Published: (2025)
by: Xu, Gangwei, et al.
Published: (2025)
FrequencyBooster: Full-Frequency Modeling for High-Fidelity Pixel Diffusion
by: Ma, Lichen, et al.
Published: (2026)
by: Ma, Lichen, et al.
Published: (2026)
PixelGen: Improving Pixel Diffusion with Perceptual Supervision
by: Ma, Zehong, et al.
Published: (2026)
by: Ma, Zehong, et al.
Published: (2026)
PixelFlow: Pixel-Space Generative Models with Flow
by: Chen, Shoufa, et al.
Published: (2025)
by: Chen, Shoufa, et al.
Published: (2025)
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
by: Zhang, David Junhao, et al.
Published: (2023)
by: Zhang, David Junhao, et al.
Published: (2023)
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think
by: Xue, Haotian, et al.
Published: (2024)
by: Xue, Haotian, et al.
Published: (2024)
Not All Pixels Are Equal: Pixel-wise Meta-Learning for Medical Segmentation with Noisy Labels
by: Mu, Chenyu, et al.
Published: (2025)
by: Mu, Chenyu, et al.
Published: (2025)
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
by: Liu, Lihua, et al.
Published: (2025)
by: Liu, Lihua, et al.
Published: (2025)
Beyond Pixels: Semantic-aware Typographic Attack for Geo-Privacy Protection
by: Zhu, Jiayi, et al.
Published: (2025)
by: Zhu, Jiayi, et al.
Published: (2025)
DiP: Taming Diffusion Models in Pixel Space
by: Chen, Zhennan, et al.
Published: (2025)
by: Chen, Zhennan, et al.
Published: (2025)
Novel View Synthesis with Pixel-Space Diffusion Models
by: Elata, Noam, et al.
Published: (2024)
by: Elata, Noam, et al.
Published: (2024)
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation
by: Jiang, Liyao, et al.
Published: (2024)
by: Jiang, Liyao, et al.
Published: (2024)
GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing
by: Shabbir, Akashah, et al.
Published: (2025)
by: Shabbir, Akashah, et al.
Published: (2025)
Not All Tokens Need 40 Steps: Heterogeneous Step Allocation in Diffusion Transformers for Efficient Video Generation
by: Chu, Ernie, et al.
Published: (2026)
by: Chu, Ernie, et al.
Published: (2026)
Reading Between the Pixels: An Inscriptive Jailbreak Attack on Text-to-Image Models
by: Ying, Zonghao, et al.
Published: (2026)
by: Ying, Zonghao, et al.
Published: (2026)
Inter-Image Pixel Shuffling for Multi-focus Image Fusion
by: Lin, Huangxing, et al.
Published: (2026)
by: Lin, Huangxing, et al.
Published: (2026)
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks
by: Cheng, Zhiyuan, et al.
Published: (2024)
by: Cheng, Zhiyuan, et al.
Published: (2024)
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
by: Zhang, Tao, et al.
Published: (2025)
by: Zhang, Tao, et al.
Published: (2025)
PixNerd: Pixel Neural Field Diffusion
by: Wang, Shuai, et al.
Published: (2025)
by: Wang, Shuai, et al.
Published: (2025)
Registers Matter for Pixel-Space Diffusion Transformers
by: Starodubcev, Nikita, et al.
Published: (2026)
by: Starodubcev, Nikita, et al.
Published: (2026)
Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation
by: Du, Ye, et al.
Published: (2024)
by: Du, Ye, et al.
Published: (2024)
JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm
by: Wang, Hanrui, et al.
Published: (2025)
by: Wang, Hanrui, et al.
Published: (2025)
Pixel-Optimization-Free Patch Attack on Stereo Depth Estimation
by: Liu, Hangcheng, et al.
Published: (2025)
by: Liu, Hangcheng, et al.
Published: (2025)
Lattice Boltzmann Model for Learning Real-World Pixel Dynamicity
by: Zheng, Guangze, et al.
Published: (2025)
by: Zheng, Guangze, et al.
Published: (2025)
Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model
by: Xu, Jiahua, et al.
Published: (2024)
by: Xu, Jiahua, et al.
Published: (2024)
PixelThink: Towards Efficient Chain-of-Pixel Reasoning
by: Wang, Song, et al.
Published: (2025)
by: Wang, Song, et al.
Published: (2025)
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
by: NVIDIA, et al.
Published: (2024)
by: NVIDIA, et al.
Published: (2024)
PixelVLA: Advancing Pixel-level Understanding in Vision-Language-Action Model
by: Liang, Wenqi, et al.
Published: (2025)
by: Liang, Wenqi, et al.
Published: (2025)
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models
by: Chou, Sheng-Yen, et al.
Published: (2023)
by: Chou, Sheng-Yen, et al.
Published: (2023)
HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusion
by: He, Yu, et al.
Published: (2026)
by: He, Yu, et al.
Published: (2026)
Pixel-Perfect Visual Geometry Estimation
by: Xu, Gangwei, et al.
Published: (2026)
by: Xu, Gangwei, et al.
Published: (2026)
PixelSmile: Toward Fine-Grained Facial Expression Editing
by: Hua, Jiabin, et al.
Published: (2026)
by: Hua, Jiabin, et al.
Published: (2026)
From Waveforms to Pixels: A Survey on Audio-Visual Segmentation
by: Li, Jia, et al.
Published: (2025)
by: Li, Jia, et al.
Published: (2025)
Similar Items
-
M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models
by: Weng, Ju-Hsuan, et al.
Published: (2025) -
Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance
by: Liao, Jia-Wei, et al.
Published: (2024) -
PixelLM: Pixel Reasoning with Large Multimodal Model
by: Ren, Zhongwei, et al.
Published: (2023) -
PixelDiT: Pixel Diffusion Transformers for Image Generation
by: Yu, Yongsheng, et al.
Published: (2025) -
DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement
by: Liao, Jia-Wei, et al.
Published: (2024)