:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Burghouts, Gertjan, Schaaphok, Marianne, van Bekkum, Michael, Meijer, Wouter, Hillerström, Fieke, van Mil, Jelle
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2407.13368
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Which objects help me to act effectively? Reasoning about physically-grounded affordances
by: Kemmeren, Anne, et al.
Published: (2024)

Towards Probabilistic Inductive Logic Programming with Neurosymbolic Inference and Relaxation
by: Hillerstrom, Fieke, et al.
Published: (2024)

Open-World Visual Reasoning by a Neuro-Symbolic Program of Zero-Shot Symbols
by: Burghouts, Gertjan, et al.
Published: (2024)

Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting
by: Ruis, Frank, et al.
Published: (2025)

Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
by: Hoftijzer, Dennis, et al.
Published: (2024)

Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning
by: Brouwer, Eric, et al.
Published: (2024)

Guided SAM: Label-Efficient Part Segmentation
by: van Rooij, S. B., et al.
Published: (2025)

Self-Supervised Partial Cycle-Consistency for Multi-View Matching
by: Taggenbrock, Fedor, et al.
Published: (2025)

Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries
by: Mezzi, Emanuele, et al.
Published: (2025)

Anticipating Future Object Compositions without Forgetting
by: Zahran, Youssef, et al.
Published: (2024)

Better Language Models Exhibit Higher Visual Alignment
by: Ruthardt, Jona, et al.
Published: (2024)

OASIC: Occlusion-Agnostic and Severity-Informed Classification
by: Gijzen, Kay, et al.
Published: (2026)

Incremental Learning of Affordances using Markov Logic Networks
by: Potter, George, et al.
Published: (2024)

Near, far: Patch-ordering enhances vision foundation models' scene understanding
by: Pariza, Valentinos, et al.
Published: (2024)

Occlusion Robustness of CLIP for Military Vehicle Classification
by: van Woerden, Jan Erik, et al.
Published: (2025)

AffordanceLLM: Grounding Affordance from Vision Language Models
by: Qian, Shengyi, et al.
Published: (2024)

Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability
by: Dijk, Judith, et al.
Published: (2024)

Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs
by: Meijer, W. J., et al.
Published: (2024)

Beyond Perception Errors: Semantic Fixation in Large Vision-Language Models
by: Alam, Md Tanvirul
Published: (2026)

Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves?
by: Liao, Yuan-Hong, et al.
Published: (2024)

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
by: Gao, Xianqiang, et al.
Published: (2024)

StateVLM: A State-Aware Vision-Language Model for Robotic Affordance Reasoning
by: Sun, Xiaowen, et al.
Published: (2026)

Object Referring-Guided Scanpath Prediction with Perception-Enhanced Vision-Language Models
by: Quan, Rong, et al.
Published: (2026)

PAVLM: Advancing Point Cloud based Affordance Understanding Via Vision-Language Model
by: Liu, Shang-Ching, et al.
Published: (2024)

ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling
by: Özsoy, Ege, et al.
Published: (2024)

RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics
by: Yuan, Wentao, et al.
Published: (2024)

Beam-Guided Knowledge Replay for Knowledge-Rich Image Captioning using Vision-Language Model
by: AlJunaid, Reem, et al.
Published: (2025)

Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Model
by: Wang, Hanqing, et al.
Published: (2025)

Attention Guided Alignment in Efficient Vision-Language Models
by: Mahajan, Shweta, et al.
Published: (2025)

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
by: Liu, Yang, et al.
Published: (2025)

Probing and Bridging Geometry-Interaction Cues for Affordance Reasoning in Vision Foundation Models
by: Zhang, Qing, et al.
Published: (2026)

HemBLIP: A Vision-Language Model for Interpretable Leukemia Cell Morphology Analysis
by: van Logtestijn, Julie, et al.
Published: (2026)

NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
by: Xu, Ran, et al.
Published: (2024)

On Error Propagation of Diffusion Models
by: Li, Yangming, et al.
Published: (2023)

Affordance-Guided Diffusion Prior for 3D Hand Reconstruction
by: Suzuki, Naru, et al.
Published: (2025)

GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning
by: Ma, Guoqing, et al.
Published: (2026)

$Δ$VLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation
by: Zhu, Yijie, et al.
Published: (2026)

Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts
by: Guo, Mingning, et al.
Published: (2025)

Same or Not? Enhancing Visual Perception in Vision-Language Models
by: Marsili, Damiano, et al.
Published: (2025)

Resource Efficient Perception for Vision Systems
by: Subramanyam, A V, et al.
Published: (2024)