Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Premsri, Tanawan, Kordjamshidi, Parisa
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.17775
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917101119209472
author	Premsri, Tanawan Kordjamshidi, Parisa
author_facet	Premsri, Tanawan Kordjamshidi, Parisa
contents	Spatial reasoning is a fundamental aspect of human intelligence. One key concept in spatial cognition is the Frame of Reference, which identifies the perspective of spatial expressions. Despite its significance, FoR has received limited attention in AI models that need spatial intelligence. There is a lack of dedicated benchmarks and in-depth evaluation of large language models (LLMs) in this area. To address this issue, we introduce the Frame of Reference Evaluation in Spatial Reasoning Tasks (FoREST) benchmark, designed to assess FoR comprehension in LLMs. We evaluate LLMs on answering questions that require FoR comprehension and layout generation in text-to-image models using FoREST. Our results reveal a notable performance gap across different FoR classes in various LLMs, affecting their ability to generate accurate layouts for text-to-image generation. This highlights critical shortcomings in FoR comprehension. To improve FoR understanding, we propose Spatial-Guided prompting, which improves LLMs ability to extract essential spatial concepts. Our proposed method improves overall performance across spatial reasoning tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_17775
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks Premsri, Tanawan Kordjamshidi, Parisa Computation and Language Spatial reasoning is a fundamental aspect of human intelligence. One key concept in spatial cognition is the Frame of Reference, which identifies the perspective of spatial expressions. Despite its significance, FoR has received limited attention in AI models that need spatial intelligence. There is a lack of dedicated benchmarks and in-depth evaluation of large language models (LLMs) in this area. To address this issue, we introduce the Frame of Reference Evaluation in Spatial Reasoning Tasks (FoREST) benchmark, designed to assess FoR comprehension in LLMs. We evaluate LLMs on answering questions that require FoR comprehension and layout generation in text-to-image models using FoREST. Our results reveal a notable performance gap across different FoR classes in various LLMs, affecting their ability to generate accurate layouts for text-to-image generation. This highlights critical shortcomings in FoR comprehension. To improve FoR understanding, we propose Spatial-Guided prompting, which improves LLMs ability to extract essential spatial concepts. Our proposed method improves overall performance across spatial reasoning tasks.
title	FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks
topic	Computation and Language
url	https://arxiv.org/abs/2502.17775

Similar Items