As the government races to invest in AI research, federally funded researchers stand to encounter a troubling problem: datasets tainted with dangerous, and even illegal, content. AI models are often ...
LAION, the German research org that created the data used to train Stable Diffusion, among other generative AI models, has released a new dataset that it claims has been “thoroughly cleaned of known ...
LAION-5B is an open dataset released in March 2022 by the German non-profit organization Large-scale Artificial Intelligence Open Network (LAION), and consists of 5.85 billion image and text ...
In the recent LAION vs. Kneschke case, the Hamburg District Court addressed the application of Germany’s text and data mining (TDM) exceptions under the Copyright ...
Researchers have found child sexual abuse material in LAION-5B, an open-source artificial intelligence training dataset used to build image generation models. The discovery was made by the Stanford ...
Photos of Brazilian kids—sometimes spanning their entire childhood—have been used without their consent to power AI tools, including popular image generators like Stable Diffusion, Human Rights Watch ...
The Large-scale Artificial Intelligence Open Network (LAION) released LAION-5B, an AI training dataset containing over five billion image-text pairs. LAION-5B contains images and captions scraped from ...
The pause in developing large-scale AI models called for in an Open Letter is stirring up tempers and opposition, for example from open-source advocates. The non-profit Large-Scale Artificial ...
In building AI, not only the algorithm but also the training dataset is important, and the quality of the dataset greatly affects the accuracy of AI. Stable Diffusion, which is a hot topic as a ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results