Illustration: Alex Castro / The Verge
The current AI boom, from Bing and Midjourney, relies on free access to training data, much of it scraped from the web and often protected by copyright. The use of this data has led to both criticism and lawsuits, particularly in the art world, with rights owners arguing that their work is being exploited without their permission.
Some of the AI world’s biggest players, like OpenAI, have avoided scrutiny by simply refusing to detail the data used to create their software. But legislation proposed in the EU to regulate AI (the long-building and far-reaching AI Act) could force companies to disclose this information, according to reports from Reuters and Euractiv.
Reuters…