Update: The “Government Copyright and AI Consultation” invites stakeholders from the technology and creative sectors to submit their opinions on the future of AI regulation

By Ben Travers, Paolo Sbuttoni, Hannah Duke, Oliver Toomey

15 Jan 2025 | 8 minute read

This week, the UK government have unveiled the far-reaching 'AI Opportunities Action Plan' to unlock the productivity benefits of AI across the UK economy. Whilst we welcome the 50-point action plan and are monitoring it closely, it's worth noting that most of the policies – opening up NHS data, building AI infrastructure, developing skills – are long-term in scope. The wider commercial and legal implications of the proposals may not become apparent for some time.

However, right now, another government initiative is looking to potentially overhaul one of the key regulatory battlegrounds in the context of the AI economy: the rules on AI training and copyright. On 17^th December 2024 (and separately to this week's Action Plan), the government published a 'Consultation on Copyright and AI', which suggests that a shake-up of the law could be on the cards.

At present, the restrictions imposed by legislation on the use of copyright content does not reflect how AI models interact with that consent in the real world, nor how the public has come to expect that interaction to take place. This can lead to disagreement and disputes - a significant problem given AI's power and 'intelligence' is predicated on the large scale copying of data available publicly on the internet, much of which comprises copyrighted works. This is known as "data mining".

Rights-holders are understandably concerned about how they can be remunerated for this copying and, more broadly, that they are not sharing in the value generated by AI. The AI consultation document accepts that "as things stand, the [copyright] framework does not meet the needs of the UK's creative industries or AI sectors". Submissions are open until the end of February, and we advise that rights-holders and AI developers take this opportunity to engage with the law-making process and consider how the proposed approach (see explanation below) could impact their businesses.

Regulatory landscape

In 2023, following the Vallance review on pro-innovation regulation for digital technologies, the UK Intellectual Property Office (IPO) convened a working group of AI firms and rights-holders to try and agree a 'code of conduct' governing the interaction between copyright and AI. Big tech businesses (Microsoft, IBM and Google Deepmind) hashed out the issue with some of the UK's key media industry bodies, including the BBC, the Premier League, Equity and UK Music (amongst others). It was proposed that AI firms who committed to the code could expect to be offered a reasonable licence by rights-holders in return for accessing and using their content to train AI models.

However, in February 2024, the government confirmed that the working group were not able to agree to a voluntary code, and the initiative was abandoned.

It is in this context that the UK government has now decided to launch a consultation which will, most likely, precede the tricky task of legislating in a way which protects (and incentivises) rights-holders whilst attracting innovative AI businesses to set up shop in UK. Indeed, point 24 of the AI Opportunities Action Plan acknowledges that the uncertainty around IP and AI is hindering innovation, and "needs to urgently be resolved".

It will be a significant legal and technical challenge to achieve the right balance: the value of copyright as a legal concept is to reward creators for their original efforts, and prevent others from using their works without permission. However, if you put up barriers to training AI in the UK, the government will deter key businesses and stifle AI innovation.

The regulatory development comes alongside litigation in several jurisdictions, including the Getty Images v Stability AI case in the UK High Court in which Stability argues it was within its rights to copy Getty's images for training. They maintain that their actions were (i) within existing exceptions in UK copyright law and (ii) the copying took place on a server outside the UK. It could however take several years for these issues to be resolved in case law, which is why the government is now pushing for a statutory solution.

Policy options

The consultation sets out three possible policy approaches. The first option would mean that AI models could only be trained on copyright works in the UK if they have an express licence to do so. Firms providing services in the UK would not be able to get around this requirement by training the AI models in other countries. They would need to obtain a licence in all cases where an AI model is trained on copyrighted works, which is an extremely pro-rights-holder position and unlikely to be pursued.

The second possibility is introducing a broader Text and Data Mining ("TDM") exception which currently exists in s. 29A Copyright, Design and Patents Act 1988 ("CDPA 1988"). This would allow data mining on all copyrighted works for AI training without permission and for commercial purposes (at present, the TDM exception is applicable for non-commercial data mining). This is an extremely pro-developer position.

Proposal: Expanding the Text and Data Mining exception with the option for a rights reservation

The third approach, and the one favoured by the Government, is to implement an updated data TDM exception which would enable AI developers to train models on data where rights-holders have not expressly reserved their rights. Rights-holders would therefore be able to prevent use of their works where no licence is agreed (for example, a news publisher might reserve rights in its publications).

This approach appears to balance the rights-holders' need for remuneration with the AI developers' requirement for good quality data sets. This, in turn, would encourage a market for licensing, as well as greater transparency measures on behalf of the AI developers.

Technical challenges

Technical standards

The proposal suggests rights reservation in a standardised "machine-readable format" (similar to the robot.txt standard), but it is not clear how this could be made technically available for all UK rights-holders. Furthermore, copies of the same works which themselves do not have rights reserved may exist elsewhere on the internet. A similar approach in the EU has shown that it is not always clear what constitutes a valid rights reservation under this model. The existing robot.txt standard cannot provide the granular control that rights-holders seek, as it only recognises reservations associated at site level and not with individual works.

Other suggestions including associating metadata with the work itself, or implementing a kind of notification regime whereby AI firms offer rights-holders a standardised process for direct notification that works cannot be used for training AI. It's also important to be aware that AI models have already consumed significant volumes of copyright content. There is therefore a risk that any measures will need to apply to future use of copyright content but that they will also need to address such use as may have occurred in the past. This may need to include mechanisms for extracting information which has previously been used without permission and for compensating rights holders where this cannot be done.

The consultation calls on "AI companies and creative industries to come together and create new technical systems to deliver the desired outcome of greater control and licensing of IP". We encourage stakeholders to consider these different types of standardisation for rights reservation protocols, and how they would impact your business.

Transparency

At present, it is often difficult for rights-holders to determine whether their works are being used to train AI models. The implementation of a rights reservation approach to copyright and AI would be dependent on minimum transparency obligations, and the consultation acknowledges that regulation may be needed to ensure this happens. This follows Article 53 of the EU's recently introduced AI Act, which requires AI providers to make publicly available a "sufficient detailed summary" of training content. It is too early to tell how this will impact the sector, but it will undoubtedly present practical challenges for small businesses and new entrants in AI development, given such large quantities of work are used in the training process. Again, stakeholders are invited to submit their opinions on how AI developers should disclose the sources of training material.

Further issues raised in the consultation

The consultation raises a number of other pertinent issues in the context of AI and copyright including:

Ownership of AI Outputs – it is currently unclear how copyright protections for computer generated works ("CGWs") (s. 178 CDPA 1988) apply to AI generated works. The CGW has been criticised for being potentially contradictory over the fact such works need to meet the 'originality' test, but that the test in case law (put simply) is very much associated with human qualities. Currently, the amount of creative effort invested at a human level may have an impact on whether copyright subsists in an AI created work. This position may not satisfy both creators and AI technology owners. Furthermore, as AI systems become more sophisticated, direct human input into the finished product may start to reduce, which could create a higher barrier for copyright ownership The consultation queries whether the CGW right should be clarified or simply removed.

Contracts and Licensing – there may also need to be an adjustment to industry standard contracts between (i) creators and (ii) publishing firms or collective management organisations across the sector. This is to ensure that contracts support good practice where works are licensed for AI training. This would be a significant change to the current position, which often sees businesses handing over the ownership of content to platform providers as a result of the platform terms and conditions.

Labelling – the consultation invites submissions on whether AI generated works should be labelled as such.

Engaging with the consultation

This is a good opportunity for stakeholders across sectors to share views on the proposals. Responses can be submitted via Citizen Space or to [email protected]. The consultation will run for 10 weeks, closing at midnight on 25 February 2025.

Please do get in touch with one of our specialist technology and IP lawyers if you have any queries.

Ben Travers

Partner

Head of Intellectual Property | Commercial, Tech & Data | International

01392685280

Email Ben Travers

View Profile

Paolo Sbuttoni

Partner

Commercial, Tech & Data

01174038980

Email Paolo Sbuttoni

View Profile

Hannah Duke

Senior Associate

Commercial, Tech & Data

01752676973

Email Hannah Duke

View Profile

Oliver Toomey

Associate

Commercial, Tech & Data

0117403919

Email Oliver Toomey

View Profile

Articles

11th February 2025

Too close for comfort – the Court of Appeal decision in Thatchers v Aldi

Find out more

Is the Sky the Limit? – The Supreme Court’s ruling in Skykick v Sky highlights the importance of genuine use in trade mark registration featured image

Articles

5th February 2025

Is the Sky the Limit? – The Supreme Court’s ruling in Skykick v Sky highlights the importance of genuine use in trade mark registration

Find out more

Articles

9th December 2024

Protect your brand in the EU: how to combat counterfeit goods and strengthen your IP rights

Find out more

Your exits are here: how to prepare the IP and data in your business as private equity takes flight (again) featured image

Articles

2nd December 2024

Your exits are here: how to prepare the IP and data in your business as private equity takes flight (again)

Find out more

Articles

21st November 2024

Farming businesses need to take back the power of brand

Find out more

Articles

8th October 2024

5 key points to consider when creating and protecting a new brand name

Find out more

A step forward for international designs/Hague design registrations featured image

Articles

17th September 2024

A step forward for international designs/Hague design registrations

Find out more

Apple’s iPhone trade mark Challenges Struck Out in UKIPO featured image

Articles

17th September 2024

Apple’s iPhone trade mark Challenges Struck Out in UKIPO

Find out more

An update on the UK IPO’s IP Audit Scheme featured image

Articles

3rd September 2024

An update on the UK IPO’s IP Audit Scheme

Find out more

The STEPS needed to properly plead a case: the critical role of evidence in trade mark oppositions featured image

Articles

28th August 2024

The STEPS needed to properly plead a case: the critical role of evidence in trade mark oppositions

Find out more

The virtual and real world intersect at the EUIPO featured image

Articles

22nd August 2024

The virtual and real world intersect at the EUIPO

Find out more

Trade mark protection in Gibraltar featured image

Articles

22nd August 2024

Trade mark protection in Gibraltar

Find out more

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_8FBMLQP5H9	2 years	This cookie is installed by Google Analytics.
_gat_myTracker	1 minute	A Google Analytics tracking cookie that sets a unique visitor ID, the date and time of the first visit, the start time of the active visit and the number of visits made by a visitor to the site.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
cusid	30 minutes	ClickDimensions sets this cookie to establish and continue a user session with the site.
cuvid	2 years	This cookie, set by ClickDimensions, is written to the browser upon the first visit to the site from that web browser.
cuvon	30 minutes	ClickDimensions sets this cookie to store the last time a visitor viewed a page.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
_hjSession_1667891	30 minutes	No description
_hjSessionUser_1667891	1 year	No description
AnalyticsSyncHistory	1 month	No description
JoP_SGM-bYTR	1 day	No description
jUspEKvx	1 day	No description
li_gc	2 years	No description
VK-bxnJ	1 day	No description
zkxqriGyJLZ	1 day	No description