Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

Xinyu Tang; Richard Shin; Huseyin Inan; Andre Manoel; Fatemehsadat Mireshghallah; Zinan Lin; Sivakanth Gopi; Janardhan (Jana) Kulkarni; Robert Sim

Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

Xinyu Tang ,
Richard Shin ,
Huseyin Inan ,
Andre Manoel ,
Fatemehsadat Mireshghallah ,
Zinan Lin ,
Sivakanth Gopi ,
Janardhan (Jana) Kulkarni ,
Robert Sim

ICLR 2024 | May 2024

下载 BibTex

We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from the private dataset with formal differential privacy (DP) guarantees, and show empirically that it can achieve effective ICL. We conduct extensive experiments on standard benchmarks and compare our algorithm with non-private ICL and zero-shot solutions. Our results demonstrate that our algorithm can achieve competitive performance with strong privacy levels. These results open up new possibilities for ICL with privacy protection for a broad range of applications.

论文与出版物下载

Privacy-preserving in-context learning with differentially private few-shot generation

20 9 月, 2024

This is a codebase to perform privacy-preserving in-context learning with differentially private few-shot generation.

下载数据