0% found this document useful (0 votes)

39 views8 pages

Purpose Limitation in Generative AI

The ICO's consultation on generative AI emphasizes the importance of purpose limitation in data protection throughout the generative AI lifecycle. Organizations must clearly define and document the specific purposes for processing personal data at each stage, ensuring compliance with data protection laws and aligning with individuals' reasonable expectations. This approach helps maintain public trust and facilitates compliance by delineating responsibilities between developers and deployers of generative AI models.

Uploaded by

sean.marsh001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views8 pages

Purpose Limitation in Generative AI

Uploaded by

sean.marsh001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Generative AI second call for evidence:

Purpose limitation in the generative AI

lifecycle
This consultation sets out the ICO’s emerging thinking on generative AI
development and use. It should not be interpreted as indication that any
particular form of data processing discussed below is legally compliant.

This post is part of the ICO’s consultation series on generative AI. This second call
focuses on how the data protection principle of purpose limitation should be applied at
different stages in the generative AI lifecycle. We provide a summary of the analysis
we have undertaken and the policy position we want to consult on.

You can respond to this call for evidence using the survey, or by emailing us
at [Link]@[Link].

The background

A specified, explicit and legitimate purpose

The purpose limitation principle in data protection law requires organisations to be

clear and open about why they are processing personal data, and to ensure that what
they intend to do with it is in line with individuals’ reasonable expectations.

Purpose limitation requires organisations to have a clear purpose for processing any
personal data before they start processing it. If they are not clear about why personal
data is processed, it follows they will not be able to be clear with individuals.

This purpose must be legitimate, meaning that:

1. there must a lawful basis for processing it; 1 and

2. the purpose is not in breach of other laws, such as intellectual property or contract
laws.

The purpose must also be specified and explicit: organisations need to be clear about
why they are processing the personal data. Organisations must be clear about this
both in their internal documentation and governance structures but also with the
people to whom the personal data relates.

Different stages, different purposes

The generative AI model lifecycle involves several stages. Each stage may involve
processing different types of personal data and for different purposes. Data protection
considerations are relevant for any activity that involves processing personal data.
For example, the purpose of training a core model will require training data and test
data, while the purpose of adapting the core model may require fine-tuning data from
a third-party developing its own application.

Different organisations may have control over these different purposes (eg whether a
model will be fine-tuned to power an application) helping delineate the boundaries of
purposes.

Why is purpose limitation important?

Having a specified purpose in each stage will allow an organisation to appropriately

understand the scope of each processing activity, evaluate its compliance with data
protection and help them evidence that.

For example, a developer may collect training data and also train a generative AI
model on that data. After model training, the developer may decide to develop an
application with which to deploy the model to serve some business objective. It is
essential that the organisation doing the model development and deployment
understands and documents those two purposes separately.

Without appropriate separation of purposes, the developer cannot assess how they
meet the other data protection principles, including whether:

the data is necessary for the purpose (minimisation principle);

the use of the data for that purpose is lawful (lawfulness principle);
the purpose has been explained to the people the data relates to (transparency
principle);
the purpose falls within people’s reasonable expectations or it can be explained why
any unexpected processing is justified (fairness principle); and
whether the stated purpose aligns with the scope of the processing activity and the
organisation’s ability to determine that scope.
In more detail – ICO guidance on purpose limitation

Principle (b): Purpose limitation

Records of processing and lawful basis

Our analysis

The compatibility of reusing training data

Training data can be expensive and difficult to collate, so developers may want to
return to the same or an enriched training dataset and use it many times. If training
data is reused in this way, for example to train two or more different models, the
developers who are reusing the training data must consider whether the purpose of
training a new model is compatible with the original purpose of collecting the training
data.

A key factor to consider is what was the reasonable expectation of the individual
whose data is being re-used at the moment its processing began. If the further
processing is not compatible with the original purpose, the controller will need to
establish a new, separate purpose.

This compatibility assessment may be easier for organisations that have a direct
relationship with the individuals whose personal data the generative AI encodes during
training.

Where the developer has no direct relationship with that individual, public messaging
campaigns and prominent privacy information may help to increase the awareness of
individuals, as well as safeguards (anonymisation or the use of privacy-enhancing
technologies) to mitigate the possible negative consequences to individuals.

If the further processing is not compatible with the original purpose, the controller will
need to establish a new, separate purpose.

In more detail – ICO guidance on compatibility assessments

What else do we need to consider?

One model, many purposes

A variety of generative AI-powered applications such as chatbots, image generators

and virtual assistants can all rely on an underlying model that acts as their foundation.
After the initial training of the generative AI model, an application is built based on it
or a fine-tuned version of it, enabling its deployment in the real world.

This means that one core model can give rise to many different applications. For
example, the same large language model could be used to produce an application to
help with ideation, an application that answers customer emails, an application that
generates legal contracts or even a general-purpose application that can ultimately be
used for any of those tasks.
At the time the initial generative AI model is trained, the developer may already have
the specific application or applications they want to build in mind. Alternatively, and in
particular if the developer and the deployer are different organisations, the application
may be specified only afterwards, once the core model is already in existence.
The two processing activities may be carried out by the same organisation, or by
different ones. We understand that common industry practice includes the following
scenarios:

One organisation develops both the generative AI model, and the application built
on top of it; 2
One organisation develops the generative AI model, then provides it or a fined-
tuned version of it to another organisation who then may develop an application
that embeds it to serve its own business objectives; and
One organisation develops the generative AI model, then develops an application
based on the model for another organisation, following their instructions about the
intended purpose for the product.

We consider that developing a generative AI model and developing an application

based on such a model (fined-tuned or not) constitute different purposes under data
protection law. This is in addition to the initial separate purpose that an organisation
may pursue when collating repositories of web-scraped data.

Defining a purpose

The purpose must be detailed and specific enough so that all relevant parties have a
clear understanding of why and how the personal data is used. These parties include:

(i) the organisation developing the model;

(ii) the people whose data is used to train it;
(iii) the people whose data is used during the deployment; and
(iv) the ICO.

Developers who rely on very broad purposes (eg “processing data for the purpose of
developing a generative AI model”) are likely to have difficulties in explaining both
internally and externally the specific processing activities that purpose covers. This is
because, without a precise explanation of what the purposes are, it will be hard for the
developer to demonstrate why particular types of personal data are needed, or how
any legitimate interests balancing test is passed.

Defining a specific and clear purpose for each different processing is key to a data
protection by design and by default approach.

Developers and deployers who consider the entire lifecycle of generative AI can assess
what the purpose of each of the stages of the lifecycle is, and then go on to establish
what personal data (if any) is needed for that purpose. A clearly defined purpose will
also help developers and deployers to allocate controller and processor responsibility
for the different stages of the lifecycle and explain that allocation of responsibility to
people whose data is being processed.

We understand that purposes in the earlier stages of the generative AI lifecycle such
as the initial data collection may be less easy to precisely define than those closer to
the deployment end. The development of many generative AI models is open-ended,
with a business goal of developing multi-functional, general-purpose models that will
enable companies to scale at all verticals. Nevertheless, defining a purpose at the
initial stages of the generative AI lifecycle involves considering what types of
deployments the model could result in, and what functionality the model will have.
When developing an application based on the model, it will be easier to specify the
purpose of that processing in more detail. Organisations developing applications based
on generative AI models should consider what these applications will be used for, and
what personal data processing is needed to develop it (eg, fine-tuning to ensure the
model is trained for the task in the specific context which it will be deployed).

Conclusion

The power of generative AI models is partly due to the broad way in which they can
be used. Despite the open-ended ambition of these models, developers need to give
careful consideration to the purpose limitation requirements of data protection, to
ensure that before they start processing, they can:

Set out sufficiently specific, explicit and clear purposes of each different stage of
the lifecycle; and
Explain what personal data is processed in each stage, and why it is needed to
meet the stated purpose.

Organisations will be better able to comply with data protection law and maintain
public trust if they give careful consideration to the difference between developing the
generative AI model, developing the application based on it, and are clear about what
types of data are used and how in each case.

1 See the first call for evidence for more detail on the lawful basis: Generative AI first
call for evidence: The lawful basis for web scraping to train generative AI models

2 The application through which the model is deployed can then be made available to
other parties or made accessible through an API, as discussed in our first Call for
Evidence.

AI Unit 6
No ratings yet
AI Unit 6
42 pages
RGPD Compliance of Processings That Embed Artificial Intelligence An Introduction
No ratings yet
RGPD Compliance of Processings That Embed Artificial Intelligence An Introduction
49 pages
Orientation Guide
No ratings yet
Orientation Guide
17 pages
EDPS GenAI Data Protection 1718384007
No ratings yet
EDPS GenAI Data Protection 1718384007
26 pages
DeepL Translation CNIL Check-List
No ratings yet
DeepL Translation CNIL Check-List
4 pages
UI - 1 AI Reflection, Project Cycle & Ethics Notes
No ratings yet
UI - 1 AI Reflection, Project Cycle & Ethics Notes
12 pages
Building Responsible AI Algorithms
No ratings yet
Building Responsible AI Algorithms
30 pages
Term 2 Ai Notes
No ratings yet
Term 2 Ai Notes
14 pages
GRC Framework in Generative AI
No ratings yet
GRC Framework in Generative AI
8 pages
AI Facilitators Handbook X Removed
No ratings yet
AI Facilitators Handbook X Removed
53 pages
OWASP 6. Deep AI Privacy - AI Exchange
No ratings yet
OWASP 6. Deep AI Privacy - AI Exchange
11 pages
Unit 1 Revisiting AI Project Cycle Ethical Frameworks For AI
No ratings yet
Unit 1 Revisiting AI Project Cycle Ethical Frameworks For AI
41 pages
Workbook - Week 7
No ratings yet
Workbook - Week 7
14 pages
Chapter 1
No ratings yet
Chapter 1
15 pages
Responsible Development of Ai
No ratings yet
Responsible Development of Ai
20 pages
Recomendações CNIL - Conform. de Sistemas de IA Com GDPR
No ratings yet
Recomendações CNIL - Conform. de Sistemas de IA Com GDPR
29 pages
GenAI Data Risks & Management Guide
No ratings yet
GenAI Data Risks & Management Guide
9 pages
Ethics
No ratings yet
Ethics
7 pages
CEDPO Generative AI The Data Protection Implications 1698808685
No ratings yet
CEDPO Generative AI The Data Protection Implications 1698808685
32 pages
AIDA Guidebook: AI Compliance & Risks
No ratings yet
AIDA Guidebook: AI Compliance & Risks
3 pages
ATARC AIDA Guidebook - FINAL Z
No ratings yet
ATARC AIDA Guidebook - FINAL Z
5 pages
GDPR and GenAI 1740412466
No ratings yet
GDPR and GenAI 1740412466
30 pages
Cipl Ai First Report - Artificial Intelligence and Data Protection in Te... - 1
No ratings yet
Cipl Ai First Report - Artificial Intelligence and Data Protection in Te... - 1
23 pages
Class10 Ch6 AI
No ratings yet
Class10 Ch6 AI
10 pages
White Paper Rethinking Privacy AI Era PDF
No ratings yet
White Paper Rethinking Privacy AI Era PDF
14 pages
AI and Data Analytics Compliance Guide
No ratings yet
AI and Data Analytics Compliance Guide
2 pages
Grade X-Unit 1 - Revisiting AI Project Cycle & Ethical Frameworks For AI
No ratings yet
Grade X-Unit 1 - Revisiting AI Project Cycle & Ethical Frameworks For AI
42 pages
Data Privacy
No ratings yet
Data Privacy
8 pages
Revisiting AI Project Cycle Class 10 Notes
No ratings yet
Revisiting AI Project Cycle Class 10 Notes
5 pages
How Ai Impacts Privacy
No ratings yet
How Ai Impacts Privacy
5 pages
Edpb Opinion 202428 Ai-Models en
No ratings yet
Edpb Opinion 202428 Ai-Models en
35 pages
Privacy Risks and Preservation Methods in Explainable Artificial Intelligence: A Scoping Review
No ratings yet
Privacy Risks and Preservation Methods in Explainable Artificial Intelligence: A Scoping Review
41 pages
AI Facilitators Handbook X-7-60
No ratings yet
AI Facilitators Handbook X-7-60
54 pages
AI Facilitators Handbook X
No ratings yet
AI Facilitators Handbook X
175 pages
Privacy and Ai The Imperative For Responsible Innovation
No ratings yet
Privacy and Ai The Imperative For Responsible Innovation
9 pages
Privacy and Data Security Concerns in AI1
No ratings yet
Privacy and Data Security Concerns in AI1
17 pages
Nothing To Hide:: The Privacy Expert's Guide To Artificial Intelligence and Machine Learning
No ratings yet
Nothing To Hide:: The Privacy Expert's Guide To Artificial Intelligence and Machine Learning
32 pages
Introduction To Responsible AI
100% (1)
Introduction To Responsible AI
10 pages
AI Privacy Guide for Organizations
No ratings yet
AI Privacy Guide for Organizations
15 pages
Extra NotesICT and AI Reflection
No ratings yet
Extra NotesICT and AI Reflection
12 pages
AI CH 01 Revisiting AI Project Cycle
No ratings yet
AI CH 01 Revisiting AI Project Cycle
5 pages
AI Toolkit To Audit
No ratings yet
AI Toolkit To Audit
82 pages
Ai Project Cycle and Ethical Frameworks
No ratings yet
Ai Project Cycle and Ethical Frameworks
5 pages
5ARB0 - Module - 6 AI Regulations, Data Protection and Privacy
No ratings yet
5ARB0 - Module - 6 AI Regulations, Data Protection and Privacy
36 pages
Generative AI Dossier 1694897354
100% (1)
Generative AI Dossier 1694897354
146 pages
Data Security Is Seen As A Future Paradigm of Data Processing
No ratings yet
Data Security Is Seen As A Future Paradigm of Data Processing
9 pages
Generative Ai Primer
No ratings yet
Generative Ai Primer
4 pages
Class 9 - Ai Part B - Unit 1
No ratings yet
Class 9 - Ai Part B - Unit 1
19 pages
Storage Emulated 0 Android Data Com - Cv.docscanner Cache Class 9 AI STUDY MATERIAL 19.08.2024
No ratings yet
Storage Emulated 0 Android Data Com - Cv.docscanner Cache Class 9 AI STUDY MATERIAL 19.08.2024
82 pages
Regulating Ai The Icos Strategic Approach
No ratings yet
Regulating Ai The Icos Strategic Approach
22 pages
Privacy in The Age of AI: Ethical Issues
No ratings yet
Privacy in The Age of AI: Ethical Issues
10 pages
Unit1 Revisiting AI Project Cycle & Ethical
No ratings yet
Unit1 Revisiting AI Project Cycle & Ethical
36 pages
AI Product Design: GANs for Good
No ratings yet
AI Product Design: GANs for Good
12 pages
Part B CH1 25 26 - Ai
No ratings yet
Part B CH1 25 26 - Ai
17 pages
10 Unit 1 Revisiting AI Project Cycle & Ethical Frameworks For AI
No ratings yet
10 Unit 1 Revisiting AI Project Cycle & Ethical Frameworks For AI
5 pages
Guidance Ethical e
No ratings yet
Guidance Ethical e
44 pages
Manual Final-1
No ratings yet
Manual Final-1
32 pages
Unit 2 ML
No ratings yet
Unit 2 ML
14 pages
Synthetic Data For Deep Learning: Generate Synthetic Data For Decision Making and Applications With Python and R 1st Edition Necmi Gürsakal Instant Download
100% (1)
Synthetic Data For Deep Learning: Generate Synthetic Data For Decision Making and Applications With Python and R 1st Edition Necmi Gürsakal Instant Download
82 pages
Assignment 4
No ratings yet
Assignment 4
46 pages
Elderly Fall Detection System
100% (1)
Elderly Fall Detection System
28 pages
A Music Visualization System Based On On Automatically Recognized Instrumentation
No ratings yet
A Music Visualization System Based On On Automatically Recognized Instrumentation
8 pages
Google Professional ML Engineer Exam Guide
No ratings yet
Google Professional ML Engineer Exam Guide
3 pages
Stock Market Analysis with LSTM Models
No ratings yet
Stock Market Analysis with LSTM Models
82 pages
Advancing Autonomous Vehicle Safety Machine Learning To Predict Sensor-Related Accident Severity
No ratings yet
Advancing Autonomous Vehicle Safety Machine Learning To Predict Sensor-Related Accident Severity
16 pages
Dhana Doc 1
No ratings yet
Dhana Doc 1
25 pages
AI Driven Framework For Enhanced Cardiovascular Risk Stratification Via APIs Supervised Learning Uns
No ratings yet
AI Driven Framework For Enhanced Cardiovascular Risk Stratification Via APIs Supervised Learning Uns
10 pages
AI Evals - The Ultimate FAQ & Free Resources
No ratings yet
AI Evals - The Ultimate FAQ & Free Resources
26 pages
Hands-On Data Preprocessing in Python
No ratings yet
Hands-On Data Preprocessing in Python
32 pages
Flower Recognition Using CNN Techniques
No ratings yet
Flower Recognition Using CNN Techniques
3 pages
Peerj Cs 2362
No ratings yet
Peerj Cs 2362
26 pages
Neuro-Symbolic AI Integrating Symbolic Reasoning With Deep Learning
No ratings yet
Neuro-Symbolic AI Integrating Symbolic Reasoning With Deep Learning
6 pages
DS Notes
No ratings yet
DS Notes
31 pages
KPI Data Management App Project Plan
No ratings yet
KPI Data Management App Project Plan
6 pages
Module 3 21cs752
No ratings yet
Module 3 21cs752
34 pages
Machine Learning Algorithm Demos
No ratings yet
Machine Learning Algorithm Demos
31 pages
CL Revison
No ratings yet
CL Revison
47 pages
Ahmed Et Al., 2023, Knowledge-Based Intelligent System For IT Incident DevOps
No ratings yet
Ahmed Et Al., 2023, Knowledge-Based Intelligent System For IT Incident DevOps
7 pages
EE2211 Past Paper
No ratings yet
EE2211 Past Paper
14 pages
Neural Networks with Weka Guide
No ratings yet
Neural Networks with Weka Guide
60 pages
Semi-Supervised Learning A Brief Review
No ratings yet
Semi-Supervised Learning A Brief Review
6 pages
IBM-CBSE AI Project Logb
No ratings yet
IBM-CBSE AI Project Logb
30 pages
2024 How Audi Scales Artificial Intelligencein Manufacturing MISQEVol 23 Iss 2
No ratings yet
2024 How Audi Scales Artificial Intelligencein Manufacturing MISQEVol 23 Iss 2
22 pages
MBA786M Project
No ratings yet
MBA786M Project
2 pages
Convolutional Neural Networks With Swift For Tensorflow: Image Recognition and Dataset Categorization 1st Edition Brett Koonce All Chapters Available
No ratings yet
Convolutional Neural Networks With Swift For Tensorflow: Image Recognition and Dataset Categorization 1st Edition Brett Koonce All Chapters Available
66 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages