Key points are not available for this paper at this time.
It is a major challenge to maintain Differential Privacy (DP) when fine-tuning a Large Language Model (LLM) while also preserving the increased functionality that make fine-tuned LLMs appealing. In this paper, we explore the utilization of an LLM that has been modified for the purpose of encoded message transmission using text output as a medium, a task which falls under the classification of Linguistic Steganography. By examining the impact that DP preserving fine-tuning has on an LLM intended for such a specific and technical functionality, we evaluate what performance cost is imparted. Our experimentation focuses on using a modified implementation of Differentially Private Stochastic Gradient Descent, while fine-tuning a LLM on curated data taken from the ConvoKit Reddit dataset. We were able to securely fine-tune the LLM while maintaining a relatively strict DP privacy budget, and still benefit from the domain specific increased performance that LLM fine-tuning provides.
Coffey et al. (Fri,) studied this question.
Synapse has enriched 2 closely related papers on similar clinical questions. Consider them for comparative context: