Saved in:
Bibliographic Details
Main Authors: Gao, Yingqiang, Matoshi, Veton, Rolshoven, Luca, Ellendorff, Tilia, Binder, Judith, Jann, Jeremy Austin, Schneider, Gerold, Stürmer, Matthias
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.22513
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908908540395520
author Gao, Yingqiang
Matoshi, Veton
Rolshoven, Luca
Ellendorff, Tilia
Binder, Judith
Jann, Jeremy Austin
Schneider, Gerold
Stürmer, Matthias
author_facet Gao, Yingqiang
Matoshi, Veton
Rolshoven, Luca
Ellendorff, Tilia
Binder, Judith
Jann, Jeremy Austin
Schneider, Gerold
Stürmer, Matthias
contents Public procurement refers to the process by which public sector institutions, such as governments, municipalities, and publicly funded bodies, acquire goods and services. Swiss law requires the integration of ecological, social, and economic sustainability requirements into tender evaluations in the format of criteria that have to be fulfilled by a bidder. However, translating high-level sustainability regulations into concrete, verifiable, and sector-specific procurement criteria (such as selection criteria, award criteria, and technical specifications) remains a labor-intensive and error-prone manual task, requiring substantial domain expertise in several groups of goods and services and considerable manual effort. This paper presents a configurable, LLM-assisted pipeline that is presented as a software supporting the systematic generation and evaluation of sustainability-oriented procurement criteria catalogs for Switzerland. The system integrates in-context prompting, interchangeable LLM backends, and automated output validation to enable auditable criteria generation across different procurement sectors. As a proof of concept, we instantiate the pipeline using official sustainability guidelines published by the Swiss government and the European Commission, which are ingested as structured reference documents. We evaluate the system through a combination of automated quality checks, including an LLM-based evaluation component, and expert comparison against a manually curated gold standard. Our results demonstrate that the proposed pipeline can substantially reduce manual drafting effort while producing criteria catalogs that are consistent with official guidelines. We further discuss system limitations, failure modes, and design trade-offs observed during deployment, highlighting key considerations for integrating generative AI into public sector software workflows.
format Preprint
id arxiv_https___arxiv_org_abs_2603_22513
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Generating and Evaluating Sustainable Procurement Criteria for the Swiss Public Sector using In-Context Prompting with Large Language Models
Gao, Yingqiang
Matoshi, Veton
Rolshoven, Luca
Ellendorff, Tilia
Binder, Judith
Jann, Jeremy Austin
Schneider, Gerold
Stürmer, Matthias
Software Engineering
Computation and Language
Public procurement refers to the process by which public sector institutions, such as governments, municipalities, and publicly funded bodies, acquire goods and services. Swiss law requires the integration of ecological, social, and economic sustainability requirements into tender evaluations in the format of criteria that have to be fulfilled by a bidder. However, translating high-level sustainability regulations into concrete, verifiable, and sector-specific procurement criteria (such as selection criteria, award criteria, and technical specifications) remains a labor-intensive and error-prone manual task, requiring substantial domain expertise in several groups of goods and services and considerable manual effort. This paper presents a configurable, LLM-assisted pipeline that is presented as a software supporting the systematic generation and evaluation of sustainability-oriented procurement criteria catalogs for Switzerland. The system integrates in-context prompting, interchangeable LLM backends, and automated output validation to enable auditable criteria generation across different procurement sectors. As a proof of concept, we instantiate the pipeline using official sustainability guidelines published by the Swiss government and the European Commission, which are ingested as structured reference documents. We evaluate the system through a combination of automated quality checks, including an LLM-based evaluation component, and expert comparison against a manually curated gold standard. Our results demonstrate that the proposed pipeline can substantially reduce manual drafting effort while producing criteria catalogs that are consistent with official guidelines. We further discuss system limitations, failure modes, and design trade-offs observed during deployment, highlighting key considerations for integrating generative AI into public sector software workflows.
title Generating and Evaluating Sustainable Procurement Criteria for the Swiss Public Sector using In-Context Prompting with Large Language Models
topic Software Engineering
Computation and Language
url https://arxiv.org/abs/2603.22513