Saved in:
| Main Authors: | Kim, Jae Myung, Alaniz, Stephan, Schmid, Cordelia, Akata, Zeynep |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11181 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance
by: Kim, Jae Myung, et al.
Published: (2025)
by: Kim, Jae Myung, et al.
Published: (2025)
DataDream: Few-shot Guided Dataset Generation
by: Kim, Jae Myung, et al.
Published: (2024)
by: Kim, Jae Myung, et al.
Published: (2024)
Discovering Chunks in Neural Embeddings for Interpretability
by: Wu, Shuchen, et al.
Published: (2025)
by: Wu, Shuchen, et al.
Published: (2025)
FLAIR: VLM with Fine-grained Language-informed Image Representations
by: Xiao, Rui, et al.
Published: (2024)
by: Xiao, Rui, et al.
Published: (2024)
SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
by: Bader, Jessica, et al.
Published: (2025)
by: Bader, Jessica, et al.
Published: (2025)
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
by: Xiao, Rui, et al.
Published: (2026)
by: Xiao, Rui, et al.
Published: (2026)
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
by: Kim, Sanghwan, et al.
Published: (2024)
by: Kim, Sanghwan, et al.
Published: (2024)
Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models
by: Singhi, Nishad, et al.
Published: (2024)
by: Singhi, Nishad, et al.
Published: (2024)
Concept-Guided Interpretability via Neural Chunking
by: Wu, Shuchen, et al.
Published: (2025)
by: Wu, Shuchen, et al.
Published: (2025)
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
by: Liu, Yiwen, et al.
Published: (2025)
by: Liu, Yiwen, et al.
Published: (2025)
From Drop-off to Recovery: A Mechanistic Analysis of Segmentation in MLLMs
by: Wu, Boyong, et al.
Published: (2026)
by: Wu, Boyong, et al.
Published: (2026)
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
by: Pach, Mateusz, et al.
Published: (2025)
by: Pach, Mateusz, et al.
Published: (2025)
Explaining CLIP Zero-shot Predictions Through Concepts
by: Ozdemir, Onat, et al.
Published: (2026)
by: Ozdemir, Onat, et al.
Published: (2026)
A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models
by: Girrbach, Leander, et al.
Published: (2025)
by: Girrbach, Leander, et al.
Published: (2025)
SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport
by: Roschmann, Simon, et al.
Published: (2026)
by: Roschmann, Simon, et al.
Published: (2026)
Are Reasoning LLMs Robust to Interventions on Their Chain-of-Thought?
by: von Recum, Alexander, et al.
Published: (2026)
by: von Recum, Alexander, et al.
Published: (2026)
SemioLLM: Evaluating Large Language Models for Diagnostic Reasoning from Unstructured Clinical Narratives in Epilepsy
by: Dani, Meghal, et al.
Published: (2024)
by: Dani, Meghal, et al.
Published: (2024)
Benchmarking Open-Source Large Language Models for Persian in Zero-Shot and Few-Shot Learning
by: Cherakhloo, Mahdi, et al.
Published: (2025)
by: Cherakhloo, Mahdi, et al.
Published: (2025)
Learning Primitive Relations for Compositional Zero-Shot Learning
by: Lee, Insu, et al.
Published: (2025)
by: Lee, Insu, et al.
Published: (2025)
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
by: Girrbach, Leander, et al.
Published: (2024)
by: Girrbach, Leander, et al.
Published: (2024)
Do LLMs Experience an Internal Polylogue? Investigating Reasoning through the Lens of Personas
by: Herrmann, Nils A., et al.
Published: (2026)
by: Herrmann, Nils A., et al.
Published: (2026)
Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
by: Kim, Sanghwan, et al.
Published: (2025)
by: Kim, Sanghwan, et al.
Published: (2025)
Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation
by: Chen, Shizhe, et al.
Published: (2025)
by: Chen, Shizhe, et al.
Published: (2025)
Structured Cognitive Loop for Behavioral Intelligence in Large Language Model Agents
by: Kim, Myung Ho
Published: (2025)
by: Kim, Myung Ho
Published: (2025)
Continual Learning in Vision-Language Models via Aligned Model Merging
by: Sokar, Ghada, et al.
Published: (2025)
by: Sokar, Ghada, et al.
Published: (2025)
Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models
by: Girrbach, Leander, et al.
Published: (2025)
by: Girrbach, Leander, et al.
Published: (2025)
Memory-Free Continual Learning with Null Space Adaptation for Zero-Shot Vision-Language Models
by: Jo, Yujin, et al.
Published: (2025)
by: Jo, Yujin, et al.
Published: (2025)
CaptionFormer: Unified Segmentation, Tracking, and Captioning for Spatio-Temporal Objects
by: Fiastre, Gabriel, et al.
Published: (2025)
by: Fiastre, Gabriel, et al.
Published: (2025)
A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation
by: Kim, Yoon Jo, et al.
Published: (2026)
by: Kim, Yoon Jo, et al.
Published: (2026)
Unlocking Transfer Learning for Open-World Few-Shot Recognition
by: Kim, Byeonggeun, et al.
Published: (2024)
by: Kim, Byeonggeun, et al.
Published: (2024)
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
by: Prasanna, Sai, et al.
Published: (2024)
by: Prasanna, Sai, et al.
Published: (2024)
Self-Prompting Large Language Models for Zero-Shot Open-Domain QA
by: Li, Junlong, et al.
Published: (2022)
by: Li, Junlong, et al.
Published: (2022)
What Drives Compositional Generalization? The Importance of Continuous Training Objectives in Visual Generative Models
by: Farid, Karim, et al.
Published: (2025)
by: Farid, Karim, et al.
Published: (2025)
Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
by: Wu, Shuchen, et al.
Published: (2024)
by: Wu, Shuchen, et al.
Published: (2024)
Visual Lexicon: Rich Image Features in Language Space
by: Wang, XuDong, et al.
Published: (2024)
by: Wang, XuDong, et al.
Published: (2024)
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
by: Rocamonde, Juan, et al.
Published: (2023)
by: Rocamonde, Juan, et al.
Published: (2023)
Prompt-Based Continual Compositional Zero-Shot Learning
by: Maryam, Sauda, et al.
Published: (2025)
by: Maryam, Sauda, et al.
Published: (2025)
Zero-Shot Robustification of Zero-Shot Models
by: Adila, Dyah, et al.
Published: (2023)
by: Adila, Dyah, et al.
Published: (2023)
Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition
by: Quartey, Benedict, et al.
Published: (2026)
by: Quartey, Benedict, et al.
Published: (2026)
Zero-Shot Goal Recognition with Large Language Models
by: Gusmão, Kin Max Piamolini, et al.
Published: (2026)
by: Gusmão, Kin Max Piamolini, et al.
Published: (2026)
Similar Items
-
LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance
by: Kim, Jae Myung, et al.
Published: (2025) -
DataDream: Few-shot Guided Dataset Generation
by: Kim, Jae Myung, et al.
Published: (2024) -
Discovering Chunks in Neural Embeddings for Interpretability
by: Wu, Shuchen, et al.
Published: (2025) -
FLAIR: VLM with Fine-grained Language-informed Image Representations
by: Xiao, Rui, et al.
Published: (2024) -
SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
by: Bader, Jessica, et al.
Published: (2025)