Saved in:
Bibliographic Details
Main Authors: Xie, Jiaqing, Chi, Ziheng
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2405.08334
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913892870914048
author Xie, Jiaqing
Chi, Ziheng
author_facet Xie, Jiaqing
Chi, Ziheng
contents Pretrained language models (LMs) showcase significant capabilities in processing molecular text, while concurrently, message passing neural networks (MPNNs) demonstrate resilience and versatility in the domain of molecular science. Despite these advancements, we find there are limited studies investigating the bidirectional interactions between molecular structures and their corresponding textual representations. Therefore, in this paper, we propose two strategies to evaluate whether an information integration can enhance the performance: contrast learning, which involves utilizing an MPNN to supervise the training of the LM, and fusion, which exploits information from both models. Our empirical analysis reveals that the integration approaches exhibit superior performance compared to baselines when applied to smaller molecular graphs, while these integration approaches do not yield performance enhancements on large scale graphs.
format Preprint
id arxiv_https___arxiv_org_abs_2405_08334
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Could Chemical LLMs benefit from Message Passing
Xie, Jiaqing
Chi, Ziheng
Machine Learning
Artificial Intelligence
Pretrained language models (LMs) showcase significant capabilities in processing molecular text, while concurrently, message passing neural networks (MPNNs) demonstrate resilience and versatility in the domain of molecular science. Despite these advancements, we find there are limited studies investigating the bidirectional interactions between molecular structures and their corresponding textual representations. Therefore, in this paper, we propose two strategies to evaluate whether an information integration can enhance the performance: contrast learning, which involves utilizing an MPNN to supervise the training of the LM, and fusion, which exploits information from both models. Our empirical analysis reveals that the integration approaches exhibit superior performance compared to baselines when applied to smaller molecular graphs, while these integration approaches do not yield performance enhancements on large scale graphs.
title Could Chemical LLMs benefit from Message Passing
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2405.08334