Saved in:
Bibliographic Details
Main Authors: Ergul, Halil Ibrahim, Balcisoy, Selim, Bozkaya, Burcin
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.15724
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913703055589376
author Ergul, Halil Ibrahim
Balcisoy, Selim
Bozkaya, Burcin
author_facet Ergul, Halil Ibrahim
Balcisoy, Selim
Bozkaya, Burcin
contents In this study, the performance of various predictive models, including probabilistic baseline, CNN, LSTM, and finetuned LLMs, in forecasting merchant categories from financial transaction data have been evaluated. Utilizing datasets from Bank A for training and Bank B for testing, the superior predictive capabilities of the fine-tuned Mistral Instruct model, which was trained using customer data converted into natural language format have been demonstrated. The methodology of this study involves instruction fine-tuning Mistral via LoRA (LowRank Adaptation of Large Language Models) to adapt its vast pre-trained knowledge to the specific domain of financial transactions. The Mistral model significantly outperforms traditional sequential models, achieving higher F1 scores in the three key merchant categories of bank transaction data (grocery, clothing, and gas stations) that is crucial for targeted marketing campaigns. This performance is attributed to the model's enhanced semantic understanding and adaptability which enables it to better manage minority classes and predict transaction categories with greater accuracy. These findings highlight the potential of LLMs in predicting human behavior.
format Preprint
id arxiv_https___arxiv_org_abs_2502_15724
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Instruction-Based Fine-tuning of Open-Source LLMs for Predicting Customer Purchase Behaviors
Ergul, Halil Ibrahim
Balcisoy, Selim
Bozkaya, Burcin
Information Retrieval
Artificial Intelligence
In this study, the performance of various predictive models, including probabilistic baseline, CNN, LSTM, and finetuned LLMs, in forecasting merchant categories from financial transaction data have been evaluated. Utilizing datasets from Bank A for training and Bank B for testing, the superior predictive capabilities of the fine-tuned Mistral Instruct model, which was trained using customer data converted into natural language format have been demonstrated. The methodology of this study involves instruction fine-tuning Mistral via LoRA (LowRank Adaptation of Large Language Models) to adapt its vast pre-trained knowledge to the specific domain of financial transactions. The Mistral model significantly outperforms traditional sequential models, achieving higher F1 scores in the three key merchant categories of bank transaction data (grocery, clothing, and gas stations) that is crucial for targeted marketing campaigns. This performance is attributed to the model's enhanced semantic understanding and adaptability which enables it to better manage minority classes and predict transaction categories with greater accuracy. These findings highlight the potential of LLMs in predicting human behavior.
title Instruction-Based Fine-tuning of Open-Source LLMs for Predicting Customer Purchase Behaviors
topic Information Retrieval
Artificial Intelligence
url https://arxiv.org/abs/2502.15724