They can put a robots.txt file in their root structure which can tell robots (AI scrapers) to ignore that website. However that only works on robots which follow that rule, it’s self enforced so it’s a crap shoot of it’ll be followed. Otherwise to be honest there isn’t a lot a public facing website can do to avoid being scraped. Maybe put up a captcha on every page?
As an overall metric, but some things have definitely gotten worse. The planet is on fire for instance. That’s getting worse and we haven’t even gotten into the really bad part.
They can put a robots.txt file in their root structure which can tell robots (AI scrapers) to ignore that website. However that only works on robots which follow that rule, it’s self enforced so it’s a crap shoot of it’ll be followed. Otherwise to be honest there isn’t a lot a public facing website can do to avoid being scraped. Maybe put up a captcha on every page?