Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wu, Jiayi, Wu, Zhengyu, Li, Ronghua, Qin, Hongchao, Wang, Guoren
Format:	Preprint
Published:	2024
Subjects:	Databases
Online Access:	https://arxiv.org/abs/2402.00292
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910313879699456
author	Wu, Jiayi Wu, Zhengyu Li, Ronghua Qin, Hongchao Wang, Guoren
author_facet	Wu, Jiayi Wu, Zhengyu Li, Ronghua Qin, Hongchao Wang, Guoren
contents	Graph database engines play a pivotal role in efficiently storing and managing graph data across various domains, including bioinformatics, knowledge graphs, and recommender systems. Ensuring data accuracy within graph database engines is paramount, as inaccuracies can yield unreliable analytical outcomes. Current bug-detection approaches are confined to specific graph query languages, limiting their applicabilities when handling graph database engines that use various graph query languages across various domains. Moreover, they require extensive prior knowledge to generate queries for detecting bugs. To address these challenges, we introduces DGDB, a novel paradigm harnessing large language models(LLM), such as ChatGPT, for comprehensive bug detection in graph database engines. DGDB leverages ChatGPT to generate high-quality queries for different graph query languages. It subsequently employs differential testing to identify bugs in graph database engines. We applied this paradigm to graph database engines using the Gremlin query language and those using the Cypher query language, generating approximately 4,000 queries each. In the latest versions of Neo4j, Agensgraph, and JanusGraph databases, we detected 2, 5, and 3 wrong-result bugs, respectively.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_00292
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Effective Bug Detection in Graph Database Engines: An LLM-based Approach Wu, Jiayi Wu, Zhengyu Li, Ronghua Qin, Hongchao Wang, Guoren Databases Graph database engines play a pivotal role in efficiently storing and managing graph data across various domains, including bioinformatics, knowledge graphs, and recommender systems. Ensuring data accuracy within graph database engines is paramount, as inaccuracies can yield unreliable analytical outcomes. Current bug-detection approaches are confined to specific graph query languages, limiting their applicabilities when handling graph database engines that use various graph query languages across various domains. Moreover, they require extensive prior knowledge to generate queries for detecting bugs. To address these challenges, we introduces DGDB, a novel paradigm harnessing large language models(LLM), such as ChatGPT, for comprehensive bug detection in graph database engines. DGDB leverages ChatGPT to generate high-quality queries for different graph query languages. It subsequently employs differential testing to identify bugs in graph database engines. We applied this paradigm to graph database engines using the Gremlin query language and those using the Cypher query language, generating approximately 4,000 queries each. In the latest versions of Neo4j, Agensgraph, and JanusGraph databases, we detected 2, 5, and 3 wrong-result bugs, respectively.
title	Effective Bug Detection in Graph Database Engines: An LLM-based Approach
topic	Databases
url	https://arxiv.org/abs/2402.00292

Similar Items