Saved in:
| Main Authors: | Qi, Daiqing, Zhao, Handong, Shi, Jing, Jenni, Simon, Fan, Yifei, Dernoncourt, Franck, Cohen, Scott, Li, Sheng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.18582 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags
by: Qi, Daiqing, et al.
Published: (2024)
by: Qi, Daiqing, et al.
Published: (2024)
Better Generative Replay for Continual Federated Learning
by: Qi, Daiqing, et al.
Published: (2023)
by: Qi, Daiqing, et al.
Published: (2023)
PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding
by: Che, Lirong, et al.
Published: (2026)
by: Che, Lirong, et al.
Published: (2026)
Seeing Through Words: Controlling Visual Retrieval Quality with Language Models
by: Lu, Jianglin, et al.
Published: (2026)
by: Lu, Jianglin, et al.
Published: (2026)
More Than the Final Answer: Improving Visual Extraction and Logical Consistency in Vision-Language Models
by: Just, Hoang Anh, et al.
Published: (2025)
by: Just, Hoang Anh, et al.
Published: (2025)
Image Photograph
by: Lafia, Marc
Published: (2019)
by: Lafia, Marc
Published: (2019)
RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward
by: Wu, Qiucheng, et al.
Published: (2026)
by: Wu, Qiucheng, et al.
Published: (2026)
MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities
by: Khosla, Savya, et al.
Published: (2025)
by: Khosla, Savya, et al.
Published: (2025)
A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images
by: Liang, Xiaoyi, et al.
Published: (2025)
by: Liang, Xiaoyi, et al.
Published: (2025)
Group Photograph
by:
Published: (2002)
by:
Published: (2002)
Group Photograph
by:
Published: (2005)
by:
Published: (2005)
Group Photographs
by:
Published: (2003)
by:
Published: (2003)
Group Photograph
by:
Published: (2001)
by:
Published: (2001)
Photographic records
by: NA
Published: (1954)
by: NA
Published: (1954)
Group Photograph
by:
Published: (2006)
by:
Published: (2006)
Group Photograph
by:
Published: (2002)
by:
Published: (2002)
The Photographic Fix
by: Court, Justin
Published: (2026)
by: Court, Justin
Published: (2026)
Cover Photograph
Published: (2024)
Published: (2024)
Cover Photograph
Published: (2024)
Published: (2024)
Cover Photograph
Published: (2025)
Published: (2025)
Cover Photograph
Published: (2024)
Published: (2024)
Cover Photograph
Published: (2025)
Published: (2025)
Cover Photograph
Published: (2024)
Published: (2024)
Cover Photograph
Published: (2024)
Published: (2024)
Cover Photograph
Published: (2024)
Published: (2024)
Cover Photograph
Published: (2024)
Published: (2024)
Group Photograph
by:
Published: (2003)
by:
Published: (2003)
Group Photograph
by:
Published: (2006)
by:
Published: (2006)
Cover Photograph
Published: (2026)
Published: (2026)
Cover Photograph
Published: (2025)
Published: (2025)
Cover Photograph
Published: (2026)
Published: (2026)
Cover Photograph
Published: (2026)
Published: (2026)
Similar Items
-
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags
by: Qi, Daiqing, et al.
Published: (2024) -
Better Generative Replay for Continual Federated Learning
by: Qi, Daiqing, et al.
Published: (2023) -
PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding
by: Che, Lirong, et al.
Published: (2026) -
Seeing Through Words: Controlling Visual Retrieval Quality with Language Models
by: Lu, Jianglin, et al.
Published: (2026) -
More Than the Final Answer: Improving Visual Extraction and Logical Consistency in Vision-Language Models
by: Just, Hoang Anh, et al.
Published: (2025)