Performance-lossless Black-box Model Watermarking

doi:10.48550/arXiv.2312.06488

Performance-lossless Black-box Model Watermarking

With the development of deep learning, high-value and high-cost models have become valuable assets, and related intellectual property protection technologies have become a hot topic. However, existing model watermarking work in black-box scenarios mainly originates from training-based backdoor methods, which probably degrade primary task performance. To address this, we propose a branch backdoor-based model watermarking protocol to protect model intellectual property, where a construction based on a message authentication scheme is adopted as the branch indicator after a comparative analysis with secure cryptographic technologies primitives. We prove the lossless performance of the protocol by reduction. In addition, we analyze the potential threats to the protocol and provide a secure and feasible watermarking instance for language models.

Publication:

arXiv e-prints

Pub Date:

December 2023

DOI:

10.48550/arXiv.2312.06488

arXiv:

arXiv:2312.06488

Bibcode:

2023arXiv231206488Z

Keywords:

Computer Science - Cryptography and Security

NASA/ADS

Performance-lossless Black-box Model Watermarking

Abstract