Brutkey

1.3.6.1.4.1.61513
@xssfox@cloudisland.nz
I'm out of the loop, ai

So I see often any Ai bots scraping websites often at high rates. What's the actual go here, are they training without storing the scrapped data? Or is this after training when people are using them (requests to supplement a query, eg tool/mcp usage). Or are people inflating scraping the site 7 times a second to mean the whole site when it's just one page

I'm not doubting the extra load, I'm just curious about how it's not a scrape once and then they are done kind of deal

networking wizard catgirl
@pearl@fedi.rrr.sh
I'm out of the loop, ai

@xssfox@cloudisland.nz a lot of them will scrape a page, then scrape it a few hours later in case it changed in the last few hours, for every page on a website, including ones that are expensive for the server to handle


0x4d6165 (Mae or Julie)
@0x4d6165@wanderingwires.net
I'm out of the loop, ai

@pearl@rrr.sh @xssfox@cloudisland.nz i wonder, how does the internet archive manage to not harm websites when they basically do the same thing?

1.3.6.1.4.1.61513
@xssfox@cloudisland.nz
I'm out of the loop, ai

@0x4d6165@wanderingwires.net @pearl@fedi.rrr.sh presumably only scraping every couple of days