Lee Duna@lemmy.nz to Fediverse@lemmy.worldEnglish · 11 months agoMastodon founder touts Threads' federation, saying it makes his X rival 'a far more attractive option'techcrunch.comexternal-linkmessage-square167fedilinkarrow-up1372arrow-down115
arrow-up1357arrow-down1external-linkMastodon founder touts Threads' federation, saying it makes his X rival 'a far more attractive option'techcrunch.comLee Duna@lemmy.nz to Fediverse@lemmy.worldEnglish · 11 months agomessage-square167fedilink
minus-squareAustralianSimon@lemmy.worldlinkfedilinkEnglisharrow-up5·11 months agoYou can scrape Lemmy instances for training data without even running an instance.
minus-squarejeffhykin@lemm.eelinkfedilinkEnglisharrow-up1arrow-down1·edit-211 months agoYeah, sorry if I’m not great at communicating. That’s exactly what I’m trying to point out when I said: Even if we don’t federate with them, Meta can still harvest the data so we should add these protections regardless.
minus-squareAustralianSimon@lemmy.worldlinkfedilinkEnglisharrow-up1·11 months agoThat’s the thing, anything public is fair game. This is why Reddit is ruining their API.
minus-squarejeffhykin@lemm.eelinkfedilinkEnglisharrow-up1·11 months agoIt’s not fair game for for-profit bussinesses training LLM’s. That’s part of why Reddit made the move; so that companies would need to pay Reddit for access to the data for legally training models
You can scrape Lemmy instances for training data without even running an instance.
Yeah, sorry if I’m not great at communicating. That’s exactly what I’m trying to point out when I said:
That’s the thing, anything public is fair game. This is why Reddit is ruining their API.
It’s not fair game for for-profit bussinesses training LLM’s. That’s part of why Reddit made the move; so that companies would need to pay Reddit for access to the data for legally training models