Saved in:
Bibliographic Details
Main Authors: Chabin, Jacques, Halfeld-Ferrari, Mirian, Hiot, Nicolas
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.09441
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909526359277568
author Chabin, Jacques
Halfeld-Ferrari, Mirian
Hiot, Nicolas
author_facet Chabin, Jacques
Halfeld-Ferrari, Mirian
Hiot, Nicolas
contents We present a general methodology for structuring textual data, represented as syntax trees enriched with semantic information, guided by a meta-model G defined as an attribute grammar. The method involves an evolution process where both the instance and its grammar evolve, with instance transformations guided by rewriting rules and a similarity measure. Each new instance generates a corresponding grammar, culminating in a target grammar GT that satisfies G. This methodology is applied to build a database populated from textual data. The process generates both a database schema and its instance, independent of specific database models. We demonstrate the approach using clinical medical cases, where trees represent database instances and grammars act as database schemas. Key contributions include the proposal of a general attribute grammar G, a formalization of grammar evolution, and a proof-of-concept implementation for database structuring.
format Preprint
id arxiv_https___arxiv_org_abs_2410_09441
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle From Text to Databases: attribute grammar as database meta-model
Chabin, Jacques
Halfeld-Ferrari, Mirian
Hiot, Nicolas
Databases
We present a general methodology for structuring textual data, represented as syntax trees enriched with semantic information, guided by a meta-model G defined as an attribute grammar. The method involves an evolution process where both the instance and its grammar evolve, with instance transformations guided by rewriting rules and a similarity measure. Each new instance generates a corresponding grammar, culminating in a target grammar GT that satisfies G. This methodology is applied to build a database populated from textual data. The process generates both a database schema and its instance, independent of specific database models. We demonstrate the approach using clinical medical cases, where trees represent database instances and grammars act as database schemas. Key contributions include the proposal of a general attribute grammar G, a formalization of grammar evolution, and a proof-of-concept implementation for database structuring.
title From Text to Databases: attribute grammar as database meta-model
topic Databases
url https://arxiv.org/abs/2410.09441