Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hou, Shilong, Shang, Ruilin, Long, Zi, Fu, Xianghua, Chen, Yin
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security Computation and Language
Online Access:	https://arxiv.org/abs/2502.15233
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916624372596736
author	Hou, Shilong Shang, Ruilin Long, Zi Fu, Xianghua Chen, Yin
author_facet	Hou, Shilong Shang, Ruilin Long, Zi Fu, Xianghua Chen, Yin
contents	An increasing number of companies have begun providing services that leverage cloud-based large language models (LLMs), such as ChatGPT. However, this development raises substantial privacy concerns, as users' prompts are transmitted to and processed by the model providers. Among the various privacy protection methods for LLMs, those implemented during the pre-training and fine-tuning phrases fail to mitigate the privacy risks associated with the remote use of cloud-based LLMs by users. On the other hand, methods applied during the inference phrase are primarily effective in scenarios where the LLM's inference does not rely on privacy-sensitive information. In this paper, we outline the process of remote user interaction with LLMs and, for the first time, propose a detailed definition of a general pseudonymization framework applicable to cloud-based LLMs. The experimental results demonstrate that the proposed framework strikes an optimal balance between privacy protection and utility. The code for our method is available to the public at https://github.com/Mebymeby/Pseudonymization-Framework.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_15233
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation Hou, Shilong Shang, Ruilin Long, Zi Fu, Xianghua Chen, Yin Cryptography and Security Computation and Language An increasing number of companies have begun providing services that leverage cloud-based large language models (LLMs), such as ChatGPT. However, this development raises substantial privacy concerns, as users' prompts are transmitted to and processed by the model providers. Among the various privacy protection methods for LLMs, those implemented during the pre-training and fine-tuning phrases fail to mitigate the privacy risks associated with the remote use of cloud-based LLMs by users. On the other hand, methods applied during the inference phrase are primarily effective in scenarios where the LLM's inference does not rely on privacy-sensitive information. In this paper, we outline the process of remote user interaction with LLMs and, for the first time, propose a detailed definition of a general pseudonymization framework applicable to cloud-based LLMs. The experimental results demonstrate that the proposed framework strikes an optimal balance between privacy protection and utility. The code for our method is available to the public at https://github.com/Mebymeby/Pseudonymization-Framework.
title	A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation
topic	Cryptography and Security Computation and Language
url	https://arxiv.org/abs/2502.15233

Similar Items