Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Bogdanova, Liliia, Sun, Shiran, Han, Lifeng, Lefort, Natalia Amat, Plaza-del-Arco, Flor Miriam
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.01910
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914363317682176
author	Bogdanova, Liliia Sun, Shiran Han, Lifeng Lefort, Natalia Amat Plaza-del-Arco, Flor Miriam
author_facet	Bogdanova, Liliia Sun, Shiran Han, Lifeng Lefort, Natalia Amat Plaza-del-Arco, Flor Miriam
contents	This system paper describes our participation in the SemEval-2025 Task-7 ``Everyday Knowledge Across Diverse Languages and Cultures''. We attended two subtasks, i.e., Track 1: Short Answer Questions (SAQ), and Track 2: Multiple-Choice Questions (MCQ). The methods we used are retrieval augmented generation (RAGs) with open-sourced smaller LLMs (OS-sLLMs). To better adapt to this shared task, we created our own culturally aware knowledge base (CulKBs) by extracting Wikipedia content using keyword lists we prepared. We extracted both culturally-aware wiki-text and country-specific wiki-summary. In addition to the local CulKBs, we also have one system integrating live online search output via DuckDuckGo. Towards better privacy and sustainability, we aimed to deploy smaller LLMs (sLLMs) that are open-sourced on the Ollama platform. We share the prompts we developed using refinement techniques and report the learning curve of such prompts. The tested languages are English, Spanish, and Chinese for both tracks. Our resources and codes are shared via https://github.com/aaronlifenghan/FLANS-2026
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_01910
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	FLANS at SemEval-2026 Task 7: RAG with Open-Sourced Smaller LLMs for Everyday Knowledge Across Diverse Languages and Cultures Bogdanova, Liliia Sun, Shiran Han, Lifeng Lefort, Natalia Amat Plaza-del-Arco, Flor Miriam Computation and Language Artificial Intelligence This system paper describes our participation in the SemEval-2025 Task-7 ``Everyday Knowledge Across Diverse Languages and Cultures''. We attended two subtasks, i.e., Track 1: Short Answer Questions (SAQ), and Track 2: Multiple-Choice Questions (MCQ). The methods we used are retrieval augmented generation (RAGs) with open-sourced smaller LLMs (OS-sLLMs). To better adapt to this shared task, we created our own culturally aware knowledge base (CulKBs) by extracting Wikipedia content using keyword lists we prepared. We extracted both culturally-aware wiki-text and country-specific wiki-summary. In addition to the local CulKBs, we also have one system integrating live online search output via DuckDuckGo. Towards better privacy and sustainability, we aimed to deploy smaller LLMs (sLLMs) that are open-sourced on the Ollama platform. We share the prompts we developed using refinement techniques and report the learning curve of such prompts. The tested languages are English, Spanish, and Chinese for both tracks. Our resources and codes are shared via https://github.com/aaronlifenghan/FLANS-2026
title	FLANS at SemEval-2026 Task 7: RAG with Open-Sourced Smaller LLMs for Everyday Knowledge Across Diverse Languages and Cultures
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2603.01910

Similar Items