Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Li, Zhuofeng, Gou, Zixing, Zhang, Xiangnan, Liu, Zhongyuan, Li, Sirui, Hu, Yuntong, Ling, Chen, Zhang, Zheng, Zhao, Liang
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2406.10310
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866910711768154112
author Li, Zhuofeng
Gou, Zixing
Zhang, Xiangnan
Liu, Zhongyuan
Li, Sirui
Hu, Yuntong
Ling, Chen
Zhang, Zheng
Zhao, Liang
author_facet Li, Zhuofeng
Gou, Zixing
Zhang, Xiangnan
Liu, Zhongyuan
Li, Sirui
Hu, Yuntong
Ling, Chen
Zhang, Zheng
Zhao, Liang
contents Text-Attributed Graphs (TAGs) augment graph structures with natural language descriptions, facilitating detailed depictions of data and their interconnections across various real-world settings. However, existing TAG datasets predominantly feature textual information only at the nodes, with edges typically represented by mere binary or categorical attributes. This lack of rich textual edge annotations significantly limits the exploration of contextual relationships between entities, hindering deeper insights into graph-structured data. To address this gap, we introduce Textual-Edge Graphs Datasets and Benchmark (TEG-DB), a comprehensive and diverse collection of benchmark textual-edge datasets featuring rich textual descriptions on nodes and edges. The TEG-DB datasets are large-scale and encompass a wide range of domains, from citation networks to social networks. In addition, we conduct extensive benchmark experiments on TEG-DB to assess the extent to which current techniques, including pre-trained language models, graph neural networks, and their combinations, can utilize textual node and edge information. Our goal is to elicit advancements in textual-edge graph research, specifically in developing methodologies that exploit rich textual node and edge descriptions to enhance graph analysis and provide deeper insights into complex real-world networks. The entire TEG-DB project is publicly accessible as an open-source repository on Github, accessible at https://github.com/Zhuofeng-Li/TEG-Benchmark.
format Preprint
id arxiv_https___arxiv_org_abs_2406_10310
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge Graphs
Li, Zhuofeng
Gou, Zixing
Zhang, Xiangnan
Liu, Zhongyuan
Li, Sirui
Hu, Yuntong
Ling, Chen
Zhang, Zheng
Zhao, Liang
Computation and Language
Artificial Intelligence
Text-Attributed Graphs (TAGs) augment graph structures with natural language descriptions, facilitating detailed depictions of data and their interconnections across various real-world settings. However, existing TAG datasets predominantly feature textual information only at the nodes, with edges typically represented by mere binary or categorical attributes. This lack of rich textual edge annotations significantly limits the exploration of contextual relationships between entities, hindering deeper insights into graph-structured data. To address this gap, we introduce Textual-Edge Graphs Datasets and Benchmark (TEG-DB), a comprehensive and diverse collection of benchmark textual-edge datasets featuring rich textual descriptions on nodes and edges. The TEG-DB datasets are large-scale and encompass a wide range of domains, from citation networks to social networks. In addition, we conduct extensive benchmark experiments on TEG-DB to assess the extent to which current techniques, including pre-trained language models, graph neural networks, and their combinations, can utilize textual node and edge information. Our goal is to elicit advancements in textual-edge graph research, specifically in developing methodologies that exploit rich textual node and edge descriptions to enhance graph analysis and provide deeper insights into complex real-world networks. The entire TEG-DB project is publicly accessible as an open-source repository on Github, accessible at https://github.com/Zhuofeng-Li/TEG-Benchmark.
title TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge Graphs
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2406.10310