Attacks on Third-Party APIs of Large Language Models

doi:10.48550/arXiv.2404.16891

Attacks on Third-Party APIs of Large Language Models

Large language model (LLM) services have recently begun offering a plugin ecosystem to interact with third-party API services. This innovation enhances the capabilities of LLMs, but it also introduces risks, as these plugins developed by various third parties cannot be easily trusted. This paper proposes a new attacking framework to examine security and safety vulnerabilities within LLM platforms that incorporate third-party services. Applying our framework specifically to widely used LLMs, we identify real-world malicious attacks across various domains on third-party APIs that can imperceptibly modify LLM outputs. The paper discusses the unique challenges posed by third-party API integration and offers strategic possibilities to improve the security and safety of LLM ecosystems moving forward. Our code is released at https://github.com/vk0812/Third-Party-Attacks-on-LLMs.

Publication:

arXiv e-prints

Pub Date:

April 2024

DOI:

10.48550/arXiv.2404.16891

arXiv:

arXiv:2404.16891

Bibcode:

2024arXiv240416891Z

Keywords:

Computer Science - Cryptography and Security;
Computer Science - Artificial Intelligence;
Computer Science - Computation and Language;
Computer Science - Computers and Society

E-Print:

ICLR 2024 Workshop on Secure and Trustworthy Large Language Models

NASA/ADS

Attacks on Third-Party APIs of Large Language Models

Abstract