@ophiocephalic@kolektiva.social
Rather than scraping from sites directly, many of the addresses on Metaβs leaked list belong to Content Delivery Networks (CDNs) that are used by websites to cache and store information to improve site performance.This is a critical point. An instance or website can defend itself in numerous different ways, including actively adversarial strategies, and still succumb to extraction - if they're using Cloudflare
cc: @subMedia@kolektiva.social
@ophiocephalic@kolektiva.social
@FediPact@cyberpunk.lol
Another sickening consideration here. If they're scraping Cloudflare and CDNs rather than directly, it's possible or likely they're not just extracting public posts, but all posts, including DMs
@subMedia@kolektiva.social