RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model

Shi, Zhuan; Yan, Jing; Tang, Xiaoli; Lyu, Lingjuan; Faltings, Boi

Computer Science > Computers and Society

arXiv:2408.16634 (cs)

[Submitted on 29 Aug 2024 (v1), last revised 2 Sep 2024 (this version, v2)]

Title:RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model

Authors:Zhuan Shi, Jing Yan, Xiaoli Tang, Lingjuan Lyu, Boi Faltings

View PDF HTML (experimental)

Abstract:The increasing sophistication of text-to-image generative models has led to complex challenges in defining and enforcing copyright infringement criteria and protection. Existing methods, such as watermarking and dataset deduplication, fail to provide comprehensive solutions due to the lack of standardized metrics and the inherent complexity of addressing copyright infringement in diffusion models. To deal with these challenges, we propose a Reinforcement Learning-based Copyright Protection(RLCP) method for Text-to-Image Diffusion Model, which minimizes the generation of copyright-infringing content while maintaining the quality of the model-generated dataset. Our approach begins with the introduction of a novel copyright metric grounded in copyright law and court precedents on infringement. We then utilize the Denoising Diffusion Policy Optimization (DDPO) framework to guide the model through a multi-step decision-making process, optimizing it using a reward function that incorporates our proposed copyright metric. Additionally, we employ KL divergence as a regularization term to mitigate some failure modes and stabilize RL fine-tuning. Experiments conducted on 3 mixed datasets of copyright and non-copyright images demonstrate that our approach significantly reduces copyright infringement risk while maintaining image quality.

Comments:	arXiv admin note: text overlap with arXiv:2403.12052 by other authors
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
Cite as:	arXiv:2408.16634 [cs.CY]
	(or arXiv:2408.16634v2 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2408.16634

Submission history

From: Zhuan Shi [view email]
[v1] Thu, 29 Aug 2024 15:39:33 UTC (8,971 KB)
[v2] Mon, 2 Sep 2024 12:15:16 UTC (8,971 KB)

Computer Science > Computers and Society

Title:RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators