Saved in:
Bibliographic Details
Main Authors: Boudoua, Bahdja, Guiffant, Nadia, Roche, Mathieu, Teisseire, Maguelonne, Tran, Annelise
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.13353
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This document, based on feedback from UMR TETIS members and the scientific literature, provides a generic methodology for creating annotation guidelines and annotated textual datasets (corpora). It covers methodological aspects, as well as storage, sharing, and valorization of the data. It includes definitions and examples to clearly illustrate each step of the process, thus providing a comprehensive framework to support the creation and use of corpora in various research contexts.