Eduardo Alonso (City U London) and Nicola Lucchi (Universitat Pompeu Fabra Law) have posted “AI And Copyright “Hallucinations”: Does the Text and Data Mining Exception Really Support Generative AI Training?” (European Intellectual Property Review, 2025, volume 47, issue 9, pp. 515-526) on SSRN. Here is the abstract:
This article critically challenges the widespread – and, it is argued, conceptually flawed – assumption that arts 3 and 4 of the CDSM Directive provide a lawful basis for training generative AI systems on copyright-protected content. The article describes this misinterpretation as a form of legal “hallucination”, underscoring its disconnect from the Directive’s textual, technical, and normative foundations. Designed to enable automated analytical extraction for scientific or informational purposes, the TDM exceptions do not encompass the large-scale reproduction, internalisation, and expressive re-use of works characteristic of GenAI training. Article 3 is limited to non-commercial research; Art.4’s opt-out mechanism, based on non-standardised signals, exacerbates uncertainty without ensuring transparency or fair compensation. This misclassification not only undermines core copyright incentives but also distorts the scope of EU exceptions, placing the framework in tension with the three-step test and international norms. The article argues that applying TDM rules to GenAI training introduces structural imbalances, both doctrinal and distributive, that risk entrenching platform asymmetries, weakening authorial agency, and threatening cultural diversity. Rather than relying on strained legal interpretations, a forward-looking response requires bespoke legal reforms that preserve normative coherence while addressing the specific challenges posed by synthetic content creation.
