:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Jae Myung, Alaniz, Stephan, Schmid, Cordelia, Akata, Zeynep
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.11181
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance
by: Kim, Jae Myung, et al.
Published: (2025)

DataDream: Few-shot Guided Dataset Generation
by: Kim, Jae Myung, et al.
Published: (2024)

Discovering Chunks in Neural Embeddings for Interpretability
by: Wu, Shuchen, et al.
Published: (2025)

FLAIR: VLM with Fine-grained Language-informed Image Representations
by: Xiao, Rui, et al.
Published: (2024)

SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
by: Bader, Jessica, et al.
Published: (2025)

FINER: MLLMs Hallucinate under Fine-grained Negative Queries
by: Xiao, Rui, et al.
Published: (2026)

COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
by: Kim, Sanghwan, et al.
Published: (2024)

Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models
by: Singhi, Nishad, et al.
Published: (2024)

Concept-Guided Interpretability via Neural Chunking
by: Wu, Shuchen, et al.
Published: (2025)

Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
by: Liu, Yiwen, et al.
Published: (2025)

From Drop-off to Recovery: A Mechanistic Analysis of Segmentation in MLLMs
by: Wu, Boyong, et al.
Published: (2026)

Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
by: Pach, Mateusz, et al.
Published: (2025)

Explaining CLIP Zero-shot Predictions Through Concepts
by: Ozdemir, Onat, et al.
Published: (2026)

A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models
by: Girrbach, Leander, et al.
Published: (2025)

SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport
by: Roschmann, Simon, et al.
Published: (2026)

Are Reasoning LLMs Robust to Interventions on Their Chain-of-Thought?
by: von Recum, Alexander, et al.
Published: (2026)

SemioLLM: Evaluating Large Language Models for Diagnostic Reasoning from Unstructured Clinical Narratives in Epilepsy
by: Dani, Meghal, et al.
Published: (2024)

Benchmarking Open-Source Large Language Models for Persian in Zero-Shot and Few-Shot Learning
by: Cherakhloo, Mahdi, et al.
Published: (2025)

Learning Primitive Relations for Compositional Zero-Shot Learning
by: Lee, Insu, et al.
Published: (2025)

Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
by: Girrbach, Leander, et al.
Published: (2024)

Do LLMs Experience an Internal Polylogue? Investigating Reasoning through the Lens of Personas
by: Herrmann, Nils A., et al.
Published: (2026)

Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
by: Kim, Sanghwan, et al.
Published: (2025)

Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation
by: Chen, Shizhe, et al.
Published: (2025)

Structured Cognitive Loop for Behavioral Intelligence in Large Language Model Agents
by: Kim, Myung Ho
Published: (2025)

Continual Learning in Vision-Language Models via Aligned Model Merging
by: Sokar, Ghada, et al.
Published: (2025)

Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models
by: Girrbach, Leander, et al.
Published: (2025)

Memory-Free Continual Learning with Null Space Adaptation for Zero-Shot Vision-Language Models
by: Jo, Yujin, et al.
Published: (2025)

CaptionFormer: Unified Segmentation, Tracking, and Captioning for Spatio-Temporal Objects
by: Fiastre, Gabriel, et al.
Published: (2025)

A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation
by: Kim, Yoon Jo, et al.
Published: (2026)

Unlocking Transfer Learning for Open-World Few-Shot Recognition
by: Kim, Byeonggeun, et al.
Published: (2024)

Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
by: Prasanna, Sai, et al.
Published: (2024)

Self-Prompting Large Language Models for Zero-Shot Open-Domain QA
by: Li, Junlong, et al.
Published: (2022)

What Drives Compositional Generalization? The Importance of Continuous Training Objectives in Visual Generative Models
by: Farid, Karim, et al.
Published: (2025)

Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
by: Wu, Shuchen, et al.
Published: (2024)

Visual Lexicon: Rich Image Features in Language Space
by: Wang, XuDong, et al.
Published: (2024)

Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
by: Rocamonde, Juan, et al.
Published: (2023)

Prompt-Based Continual Compositional Zero-Shot Learning
by: Maryam, Sauda, et al.
Published: (2025)

Zero-Shot Robustification of Zero-Shot Models
by: Adila, Dyah, et al.
Published: (2023)

Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition
by: Quartey, Benedict, et al.
Published: (2026)

Zero-Shot Goal Recognition with Large Language Models
by: Gusmão, Kin Max Piamolini, et al.
Published: (2026)