@Wearwolf@kind.social
@ErikJonker@mastodon.social @david_chisnall@infosec.exchange @chris@mstdn.chrisalemany.ca it's not personal, non-commercial use though
LLM work by learning what words are associated with what context and then spitting those words out again when prompted with a similar context
They are regurgitation engines. They can only spit out, produce a copy of, the content that went into them.
It would be illegal to sell collages of people's Instagram posts without their permission. LLMs are that but with people's words
@Wearwolf@kind.social
@ErikJonker@mastodon.social @david_chisnall@infosec.exchange @chris@mstdn.chrisalemany.ca the argument you would use is actually that it's a transformative work. The process of the content going through the machine makes it unique.
The problem there is that LLMs can't have original ideas. They can't rephrase or paraphrase. All of that transformation comes from combining your words with other people's words
So the defence is that you have stolen from so many people that it's not obvious what was stolen from whom