Saved in:
| Main Authors: | Burghouts, Gertjan, Schaaphok, Marianne, van Bekkum, Michael, Meijer, Wouter, Hillerström, Fieke, van Mil, Jelle |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.13368 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Which objects help me to act effectively? Reasoning about physically-grounded affordances
by: Kemmeren, Anne, et al.
Published: (2024)
by: Kemmeren, Anne, et al.
Published: (2024)
Towards Probabilistic Inductive Logic Programming with Neurosymbolic Inference and Relaxation
by: Hillerstrom, Fieke, et al.
Published: (2024)
by: Hillerstrom, Fieke, et al.
Published: (2024)
Open-World Visual Reasoning by a Neuro-Symbolic Program of Zero-Shot Symbols
by: Burghouts, Gertjan, et al.
Published: (2024)
by: Burghouts, Gertjan, et al.
Published: (2024)
Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting
by: Ruis, Frank, et al.
Published: (2025)
by: Ruis, Frank, et al.
Published: (2025)
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
by: Hoftijzer, Dennis, et al.
Published: (2024)
by: Hoftijzer, Dennis, et al.
Published: (2024)
Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning
by: Brouwer, Eric, et al.
Published: (2024)
by: Brouwer, Eric, et al.
Published: (2024)
Guided SAM: Label-Efficient Part Segmentation
by: van Rooij, S. B., et al.
Published: (2025)
by: van Rooij, S. B., et al.
Published: (2025)
Self-Supervised Partial Cycle-Consistency for Multi-View Matching
by: Taggenbrock, Fedor, et al.
Published: (2025)
by: Taggenbrock, Fedor, et al.
Published: (2025)
Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries
by: Mezzi, Emanuele, et al.
Published: (2025)
by: Mezzi, Emanuele, et al.
Published: (2025)
Anticipating Future Object Compositions without Forgetting
by: Zahran, Youssef, et al.
Published: (2024)
by: Zahran, Youssef, et al.
Published: (2024)
Better Language Models Exhibit Higher Visual Alignment
by: Ruthardt, Jona, et al.
Published: (2024)
by: Ruthardt, Jona, et al.
Published: (2024)
OASIC: Occlusion-Agnostic and Severity-Informed Classification
by: Gijzen, Kay, et al.
Published: (2026)
by: Gijzen, Kay, et al.
Published: (2026)
Incremental Learning of Affordances using Markov Logic Networks
by: Potter, George, et al.
Published: (2024)
by: Potter, George, et al.
Published: (2024)
Near, far: Patch-ordering enhances vision foundation models' scene understanding
by: Pariza, Valentinos, et al.
Published: (2024)
by: Pariza, Valentinos, et al.
Published: (2024)
Occlusion Robustness of CLIP for Military Vehicle Classification
by: van Woerden, Jan Erik, et al.
Published: (2025)
by: van Woerden, Jan Erik, et al.
Published: (2025)
AffordanceLLM: Grounding Affordance from Vision Language Models
by: Qian, Shengyi, et al.
Published: (2024)
by: Qian, Shengyi, et al.
Published: (2024)
Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability
by: Dijk, Judith, et al.
Published: (2024)
by: Dijk, Judith, et al.
Published: (2024)
Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs
by: Meijer, W. J., et al.
Published: (2024)
by: Meijer, W. J., et al.
Published: (2024)
Beyond Perception Errors: Semantic Fixation in Large Vision-Language Models
by: Alam, Md Tanvirul
Published: (2026)
by: Alam, Md Tanvirul
Published: (2026)
Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves?
by: Liao, Yuan-Hong, et al.
Published: (2024)
by: Liao, Yuan-Hong, et al.
Published: (2024)
Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
by: Gao, Xianqiang, et al.
Published: (2024)
by: Gao, Xianqiang, et al.
Published: (2024)
StateVLM: A State-Aware Vision-Language Model for Robotic Affordance Reasoning
by: Sun, Xiaowen, et al.
Published: (2026)
by: Sun, Xiaowen, et al.
Published: (2026)
Object Referring-Guided Scanpath Prediction with Perception-Enhanced Vision-Language Models
by: Quan, Rong, et al.
Published: (2026)
by: Quan, Rong, et al.
Published: (2026)
PAVLM: Advancing Point Cloud based Affordance Understanding Via Vision-Language Model
by: Liu, Shang-Ching, et al.
Published: (2024)
by: Liu, Shang-Ching, et al.
Published: (2024)
ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling
by: Özsoy, Ege, et al.
Published: (2024)
by: Özsoy, Ege, et al.
Published: (2024)
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics
by: Yuan, Wentao, et al.
Published: (2024)
by: Yuan, Wentao, et al.
Published: (2024)
Beam-Guided Knowledge Replay for Knowledge-Rich Image Captioning using Vision-Language Model
by: AlJunaid, Reem, et al.
Published: (2025)
by: AlJunaid, Reem, et al.
Published: (2025)
Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Model
by: Wang, Hanqing, et al.
Published: (2025)
by: Wang, Hanqing, et al.
Published: (2025)
Attention Guided Alignment in Efficient Vision-Language Models
by: Mahajan, Shweta, et al.
Published: (2025)
by: Mahajan, Shweta, et al.
Published: (2025)
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
by: Liu, Yang, et al.
Published: (2025)
by: Liu, Yang, et al.
Published: (2025)
Probing and Bridging Geometry-Interaction Cues for Affordance Reasoning in Vision Foundation Models
by: Zhang, Qing, et al.
Published: (2026)
by: Zhang, Qing, et al.
Published: (2026)
HemBLIP: A Vision-Language Model for Interpretable Leukemia Cell Morphology Analysis
by: van Logtestijn, Julie, et al.
Published: (2026)
by: van Logtestijn, Julie, et al.
Published: (2026)
NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
by: Xu, Ran, et al.
Published: (2024)
by: Xu, Ran, et al.
Published: (2024)
On Error Propagation of Diffusion Models
by: Li, Yangming, et al.
Published: (2023)
by: Li, Yangming, et al.
Published: (2023)
Affordance-Guided Diffusion Prior for 3D Hand Reconstruction
by: Suzuki, Naru, et al.
Published: (2025)
by: Suzuki, Naru, et al.
Published: (2025)
GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning
by: Ma, Guoqing, et al.
Published: (2026)
by: Ma, Guoqing, et al.
Published: (2026)
$Δ$VLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation
by: Zhu, Yijie, et al.
Published: (2026)
by: Zhu, Yijie, et al.
Published: (2026)
Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts
by: Guo, Mingning, et al.
Published: (2025)
by: Guo, Mingning, et al.
Published: (2025)
Same or Not? Enhancing Visual Perception in Vision-Language Models
by: Marsili, Damiano, et al.
Published: (2025)
by: Marsili, Damiano, et al.
Published: (2025)
Resource Efficient Perception for Vision Systems
by: Subramanyam, A V, et al.
Published: (2024)
by: Subramanyam, A V, et al.
Published: (2024)
Similar Items
-
Which objects help me to act effectively? Reasoning about physically-grounded affordances
by: Kemmeren, Anne, et al.
Published: (2024) -
Towards Probabilistic Inductive Logic Programming with Neurosymbolic Inference and Relaxation
by: Hillerstrom, Fieke, et al.
Published: (2024) -
Open-World Visual Reasoning by a Neuro-Symbolic Program of Zero-Shot Symbols
by: Burghouts, Gertjan, et al.
Published: (2024) -
Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting
by: Ruis, Frank, et al.
Published: (2025) -
Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation
by: Hoftijzer, Dennis, et al.
Published: (2024)