A new web crawler launched by Meta last month is quietly scraping the web for AI training data

lemme in@lemm.ee · 2 months ago

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

Aniki 🌱🌿@lemmings.world · 2 months ago

Mate we have absurdly restrictive robots.txt including a custom WordPress plugin that automatically generates the file and the bots don’t give a fuck.

GarrulousBrevity@lemmy.world · 2 months ago

But meta’s will, and Alta Vista. I’m not angry at them when a script kitty makes a bad crawler

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

ERROR: The request could not be satisfied