Enregistré dans:
Détails bibliographiques
Auteurs principaux: Fu, Silvery D., Wang, David, Zhang, Wen, Ge, Kathleen
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2406.11255
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866917696062357504
author Fu, Silvery D.
Wang, David
Zhang, Wen
Ge, Kathleen
author_facet Fu, Silvery D.
Wang, David
Zhang, Wen
Ge, Kathleen
contents Entity matching (EM), the task of identifying whether two descriptions refer to the same entity, is essential in data management. Traditional methods have evolved from rule-based to AI-driven approaches, yet current techniques using large language models (LLMs) often fall short due to their reliance on static knowledge and rigid, predefined prompts. In this paper, we introduce Libem, a compound AI system designed to address these limitations by incorporating a flexible, tool-oriented approach. Libem supports entity matching through dynamic tool use, self-refinement, and optimization, allowing it to adapt and refine its process based on the dataset and performance metrics. Unlike traditional solo-AI EM systems, which often suffer from a lack of modularity that hinders iterative design improvements and system optimization, Libem offers a composable and reusable toolchain. This approach aims to contribute to ongoing discussions and developments in AI-driven data management.
format Preprint
id arxiv_https___arxiv_org_abs_2406_11255
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Liberal Entity Matching as a Compound AI Toolchain
Fu, Silvery D.
Wang, David
Zhang, Wen
Ge, Kathleen
Databases
Artificial Intelligence
Software Engineering
Entity matching (EM), the task of identifying whether two descriptions refer to the same entity, is essential in data management. Traditional methods have evolved from rule-based to AI-driven approaches, yet current techniques using large language models (LLMs) often fall short due to their reliance on static knowledge and rigid, predefined prompts. In this paper, we introduce Libem, a compound AI system designed to address these limitations by incorporating a flexible, tool-oriented approach. Libem supports entity matching through dynamic tool use, self-refinement, and optimization, allowing it to adapt and refine its process based on the dataset and performance metrics. Unlike traditional solo-AI EM systems, which often suffer from a lack of modularity that hinders iterative design improvements and system optimization, Libem offers a composable and reusable toolchain. This approach aims to contribute to ongoing discussions and developments in AI-driven data management.
title Liberal Entity Matching as a Compound AI Toolchain
topic Databases
Artificial Intelligence
Software Engineering
url https://arxiv.org/abs/2406.11255