Reddit blocking all major search engines, except Google

gedaliyah@lemmy.world · 4 months ago

Reddit blocking all major search engines, except Google

leopold@lemmy.kde.social · 4 months ago

this is just going to cause indexers to ignore robots.txt

gedaliyah@lemmy.world · 4 months ago

“We always obey the robots.txt”

A bunch of corporations that have no accountability and plenty of incentive to just ignore it and have all been caught training AI on off-limits data.

Kairos@lemmy.today · 4 months ago

They’re likely blocking user agents too, which I think also doesn’t have legal enforcement (as in DuckDuckGo can just use “Google” unless they said otherwise.

Natanael@slrpnk.net · edit-2 4 months ago

LinkedIn tried blocking scraping that way but as long as the scraping isn’t burdensome it’s basically legal but you can still be bound by TOS and civil claims

https://natlawreview.com/article/hiq-and-linkedin-reach-proposed-settlement-landmark-scraping-case

capital@lemmy.world · 4 months ago

Rate limiting could “fix” that unfortunately.