Saved in:
| Main Authors: | Fu, Tsu-Jui, Wang, Xin Eric, Grafton, Scott, Eckstein, Miguel, Wang, William Yang |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2009.09566 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
by: Fu, Tsu-Jui, et al.
Published: (2019)
by: Fu, Tsu-Jui, et al.
Published: (2019)
Guiding Instruction-based Image Editing via Multimodal Large Language Models
by: Fu, Tsu-Jui, et al.
Published: (2023)
by: Fu, Tsu-Jui, et al.
Published: (2023)
From Text to Pixel: Advancing Long-Context Understanding in MLLMs
by: Lu, Yujie, et al.
Published: (2024)
by: Lu, Yujie, et al.
Published: (2024)
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
by: Fu, Tsu-Jui, et al.
Published: (2025)
by: Fu, Tsu-Jui, et al.
Published: (2025)
GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
by: Qian, Yusu, et al.
Published: (2025)
by: Qian, Yusu, et al.
Published: (2025)
Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners
by: He, Xuehai, et al.
Published: (2023)
by: He, Xuehai, et al.
Published: (2023)
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
by: Schumann, Raphael, et al.
Published: (2023)
by: Schumann, Raphael, et al.
Published: (2023)
DReX: Pure Vision Fusion of Self-Supervised and Convolutional Representations for Image Complexity Prediction
by: Skaza, Jonathan, et al.
Published: (2025)
by: Skaza, Jonathan, et al.
Published: (2025)
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
by: Li, Jiachen, et al.
Published: (2024)
by: Li, Jiachen, et al.
Published: (2024)
Counterfactual Image Editing
by: Pan, Yushu, et al.
Published: (2024)
by: Pan, Yushu, et al.
Published: (2024)
MIRA: Multimodal Iterative Reasoning Agent for Image Editing
by: Zeng, Ziyun, et al.
Published: (2025)
by: Zeng, Ziyun, et al.
Published: (2025)
Revealing the Gap in Human and VLM Scene Perception through Counterfactual Semantic Saliency
by: Wen, Ziqi, et al.
Published: (2026)
by: Wen, Ziqi, et al.
Published: (2026)
Iterative Motion Editing with Natural Language
by: Goel, Purvi, et al.
Published: (2023)
by: Goel, Purvi, et al.
Published: (2023)
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
by: Zhang, Haotian, et al.
Published: (2024)
by: Zhang, Haotian, et al.
Published: (2024)
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
by: Zhu, Hongyang, et al.
Published: (2025)
by: Zhu, Hongyang, et al.
Published: (2025)
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
by: He, Qingdong, et al.
Published: (2025)
by: He, Qingdong, et al.
Published: (2025)
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
by: You, Haoxuan, et al.
Published: (2023)
by: You, Haoxuan, et al.
Published: (2023)
Understanding the Implicit User Intention via Reasoning with Large Language Model for Image Editing
by: Wang, Yijia, et al.
Published: (2025)
by: Wang, Yijia, et al.
Published: (2025)
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation
by: Feng, Weixi, et al.
Published: (2024)
by: Feng, Weixi, et al.
Published: (2024)
Taming Outlier Tokens in Diffusion Transformers
by: Wu, Xiaoyu, et al.
Published: (2026)
by: Wu, Xiaoyu, et al.
Published: (2026)
VibeFlow: Versatile Video Chroma-Lux Editing through Self-Supervised Learning
by: Li, Yifan, et al.
Published: (2026)
by: Li, Yifan, et al.
Published: (2026)
Delta-Adapter: Scalable Exemplar-Based Image Editing with Single-Pair Supervision
by: Chen, Jiacheng, et al.
Published: (2026)
by: Chen, Jiacheng, et al.
Published: (2026)
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
by: Chen, Chen, et al.
Published: (2025)
by: Chen, Chen, et al.
Published: (2025)
Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training
by: Xie, Ming-Kun, et al.
Published: (2024)
by: Xie, Ming-Kun, et al.
Published: (2024)
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
by: Li, Ming, et al.
Published: (2025)
by: Li, Ming, et al.
Published: (2025)
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals
by: Stojanov, Stefan, et al.
Published: (2025)
by: Stojanov, Stefan, et al.
Published: (2025)
Doubly Abductive Counterfactual Inference for Text-based Image Editing
by: Song, Xue, et al.
Published: (2024)
by: Song, Xue, et al.
Published: (2024)
Beyond Generation: Unlocking Universal Editing via Self-Supervised Fine-Tuning
by: Chen, Harold Haodong, et al.
Published: (2024)
by: Chen, Harold Haodong, et al.
Published: (2024)
ReasonEdit: Towards Reasoning-Enhanced Image Editing Models
by: Yin, Fukun, et al.
Published: (2025)
by: Yin, Fukun, et al.
Published: (2025)
ProtoFair: Fair Self-Supervised Contrastive Learning via Pseudo-Counterfactual Pairs
by: Halawa, Marah, et al.
Published: (2026)
by: Halawa, Marah, et al.
Published: (2026)
An Interpretable Local Editing Model for Counterfactual Medical Image Generation
by: Min, Hyungi, et al.
Published: (2026)
by: Min, Hyungi, et al.
Published: (2026)
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
by: Bai, Chengyu, et al.
Published: (2025)
by: Bai, Chengyu, et al.
Published: (2025)
GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing
by: Liu, Mingxin, et al.
Published: (2026)
by: Liu, Mingxin, et al.
Published: (2026)
Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement
by: Wu, Chenxu, et al.
Published: (2025)
by: Wu, Chenxu, et al.
Published: (2025)
UniREditBench: A Unified Reasoning-based Image Editing Benchmark
by: Han, Feng, et al.
Published: (2025)
by: Han, Feng, et al.
Published: (2025)
CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching
by: Chen, Chen, et al.
Published: (2025)
by: Chen, Chen, et al.
Published: (2025)
Single Image Iterative Subject-driven Generation and Editing
by: Shpitzer, Yair, et al.
Published: (2025)
by: Shpitzer, Yair, et al.
Published: (2025)
MedEdit: Counterfactual Diffusion-based Image Editing on Brain MRI
by: Alaya, Malek Ben, et al.
Published: (2024)
by: Alaya, Malek Ben, et al.
Published: (2024)
They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias
by: Magid, Salma Abdel, et al.
Published: (2024)
by: Magid, Salma Abdel, et al.
Published: (2024)
Exploring Iterative Manifold Constraint for Zero-shot Image Editing
by: Li, Maomao, et al.
Published: (2025)
by: Li, Maomao, et al.
Published: (2025)
Similar Items
-
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
by: Fu, Tsu-Jui, et al.
Published: (2019) -
Guiding Instruction-based Image Editing via Multimodal Large Language Models
by: Fu, Tsu-Jui, et al.
Published: (2023) -
From Text to Pixel: Advancing Long-Context Understanding in MLLMs
by: Lu, Yujie, et al.
Published: (2024) -
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
by: Fu, Tsu-Jui, et al.
Published: (2025) -
GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
by: Qian, Yusu, et al.
Published: (2025)