Saved in:
| Main Authors: | Tang, Kenan, Arunshankar, Praveen, Hua, Andong, Yang, Anthony, Qin, Yao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.03400 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
by: Ma, Xingjun, et al.
Published: (2026)
by: Ma, Xingjun, et al.
Published: (2026)
SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow
by: Tang, Kenan, et al.
Published: (2025)
by: Tang, Kenan, et al.
Published: (2025)
GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows
by: Yan, Zexuan, et al.
Published: (2026)
by: Yan, Zexuan, et al.
Published: (2026)
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets
by: Zuo, Jialong, et al.
Published: (2025)
by: Zuo, Jialong, et al.
Published: (2025)
Can Nano Banana 2 Replace Traditional Image Restoration Models? An Evaluation of Its Performance on Image Restoration Tasks
by: Sun, Weixiong, et al.
Published: (2026)
by: Sun, Weixiong, et al.
Published: (2026)
Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling
by: Ye, Ruijie, et al.
Published: (2026)
by: Ye, Ruijie, et al.
Published: (2026)
Recognizing Pneumonia in Real-World Chest X-rays with a Classifier Trained with Images Synthetically Generated by Nano Banana
by: Peng, Jiachuan, et al.
Published: (2025)
by: Peng, Jiachuan, et al.
Published: (2025)
Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for Efficient Banana Plantation Segmentation in UAV Imagery
by: He, Ang, et al.
Published: (2024)
by: He, Ang, et al.
Published: (2024)
MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation
by: Oshima, Yuta, et al.
Published: (2025)
by: Oshima, Yuta, et al.
Published: (2025)
LMM-IQA: Image Quality Assessment for Low-Dose CT Imaging
by: Celik, Kagan, et al.
Published: (2025)
by: Celik, Kagan, et al.
Published: (2025)
DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild
by: Fu, Honghao, et al.
Published: (2024)
by: Fu, Honghao, et al.
Published: (2024)
PaperBanana: Automating Academic Illustration for AI Scientists
by: Zhu, Dawei, et al.
Published: (2026)
by: Zhu, Dawei, et al.
Published: (2026)
Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment
by: Jia, Ziheng, et al.
Published: (2025)
by: Jia, Ziheng, et al.
Published: (2025)
Cross-IQA: Unsupervised Learning for Image Quality Assessment
by: Zhang, Zhen
Published: (2024)
by: Zhang, Zhen
Published: (2024)
ExpressEdit: Fast Editing of Stylized Facial Expressions with Diffusion Models in Photoshop
by: Tang, Kenan, et al.
Published: (2026)
by: Tang, Kenan, et al.
Published: (2026)
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
by: Gao, Yuan, et al.
Published: (2025)
by: Gao, Yuan, et al.
Published: (2025)
Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling
by: Tang, Long, et al.
Published: (2025)
by: Tang, Long, et al.
Published: (2025)
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
by: Xun, Siyi, et al.
Published: (2025)
by: Xun, Siyi, et al.
Published: (2025)
Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment
by: Liu, Kai, et al.
Published: (2024)
by: Liu, Kai, et al.
Published: (2024)
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
by: Qian, Yusu, et al.
Published: (2025)
by: Qian, Yusu, et al.
Published: (2025)
Robustness as Architecture: Designing IQA Models to Withstand Adversarial Perturbations
by: Meleshin, Igor, et al.
Published: (2025)
by: Meleshin, Igor, et al.
Published: (2025)
ClimateIQA: A New Dataset and Benchmark to Advance Vision-Language Models in Meteorology Anomalies Analysis
by: Chen, Jian, et al.
Published: (2024)
by: Chen, Jian, et al.
Published: (2024)
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models
by: Abud, Khaled, et al.
Published: (2024)
by: Abud, Khaled, et al.
Published: (2024)
Parameter-Efficient Adaptation of mPLUG-Owl2 via Pixel-Level Visual Prompts for NR-IQA
by: Benmahane, Yahya, et al.
Published: (2025)
by: Benmahane, Yahya, et al.
Published: (2025)
Exploring Bias in over 100 Text-to-Image Generative Models
by: Vice, Jordan, et al.
Published: (2025)
by: Vice, Jordan, et al.
Published: (2025)
Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack
by: Gu, Chenhe, et al.
Published: (2025)
by: Gu, Chenhe, et al.
Published: (2025)
Med-Banana-50K: A Cross-modality Large-Scale Dataset for Text-guided Medical Image Editing
by: Chen, Zhihui, et al.
Published: (2025)
by: Chen, Zhihui, et al.
Published: (2025)
From Global to Granular: Revealing IQA Model Performance via Correlation Surface
by: Chen, Baoliang, et al.
Published: (2026)
by: Chen, Baoliang, et al.
Published: (2026)
MieDB-100k: A Comprehensive Dataset for Medical Image Editing
by: Lai, Yongfan, et al.
Published: (2026)
by: Lai, Yongfan, et al.
Published: (2026)
Towards Robust Optical-SAR Object Detection under Missing Modalities: A Dynamic Quality-Aware Fusion Framework
by: Zhao, Zhicheng, et al.
Published: (2025)
by: Zhao, Zhicheng, et al.
Published: (2025)
PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
by: Chen, Zewen, et al.
Published: (2024)
by: Chen, Zewen, et al.
Published: (2024)
1%>100%: High-Efficiency Visual Adapter with Complex Linear Projection Optimization
by: Yin, Dongshuo, et al.
Published: (2026)
by: Yin, Dongshuo, et al.
Published: (2026)
Initialization Matters for Adversarial Transfer Learning
by: Hua, Andong, et al.
Published: (2023)
by: Hua, Andong, et al.
Published: (2023)
Zero-Training Task-Specific Model Synthesis for Few-Shot Medical Image Classification
by: Qin, Yao, et al.
Published: (2025)
by: Qin, Yao, et al.
Published: (2025)
Improved GUI Grounding via Iterative Narrowing
by: Nguyen, Anthony
Published: (2024)
by: Nguyen, Anthony
Published: (2024)
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks
by: Yin, Dongshuo, et al.
Published: (2024)
by: Yin, Dongshuo, et al.
Published: (2024)
ME-IQA: Memory-Enhanced Image Quality Assessment via Re-Ranking
by: Fan, Kanglong, et al.
Published: (2026)
by: Fan, Kanglong, et al.
Published: (2026)
StreamPro: From Reactive Perception to Proactive Decision-Making in Streaming Video
by: Li, Ao, et al.
Published: (2026)
by: Li, Ao, et al.
Published: (2026)
Dual-IPO: Dual-Iterative Preference Optimization for Text-to-Video Generation
by: Yang, Xiaomeng, et al.
Published: (2025)
by: Yang, Xiaomeng, et al.
Published: (2025)
Breaking Shallow Limits: Task-Driven Pixel Fusion for Gap-free RGBT Tracking
by: Lu, Andong, et al.
Published: (2025)
by: Lu, Andong, et al.
Published: (2025)
Similar Items
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
by: Ma, Xingjun, et al.
Published: (2026) -
SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow
by: Tang, Kenan, et al.
Published: (2025) -
GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows
by: Yan, Zexuan, et al.
Published: (2026) -
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets
by: Zuo, Jialong, et al.
Published: (2025) -
Can Nano Banana 2 Replace Traditional Image Restoration Models? An Evaluation of Its Performance on Image Restoration Tasks
by: Sun, Weixiong, et al.
Published: (2026)