A new web crawler launched by Meta last month is quietly scraping the web for AI training data

lemme in@lemm.ee · 2 months ago

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

edit-2 10 days ago

Removed by mod

GarrulousBrevity@lemmy.world · 2 months ago

Oh, no, that wasn’t excusing Meta in general. Just giving them a pass on that they’ve had, to my knowledge, a history of respecting robots.txt, which makes this piece of software better than outright malware. Starting it secretly and not giving site hosts a chance to make sure they had their privacy configured the way they liked first was a shady as hell move, no argument there.

edit-2 10 days ago

Removed by mod

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

ERROR: The request could not be satisfied