• 2 Posts
  • 23 Comments
Joined 3 years ago
cake
Cake day: June 11th, 2023

help-circle





  • So now the output of both programs is “illegimate” in your eyes, despite one of them never even getting direct access to the original text.

    Now lets say one of them just writes a story in the style of Twain, still plagiarism? Because I don’t know if you can copyright a style.

    The first painter painted on cave walls with his fingers. Was the brush a parrot tool? A utility to plagiarize? You could use it for plagiarism, yes, and by your logic, it shouldn’t be used. And any work created using it is not “legitimate”.


  • Sure, AI is not doing anything creative, but neither is my pen, its the tool im using to be creative. Lets think about this more with some scenarios:

    Lets say software developer “A” comes along, and they’re pretty fucking smart. They sit down, read through all of Mark Twains novels, and over the course of the next 5 years, create a piece of software that generates works in Twain’s style. Its so good that people begin using it to write real books. It doesn’t copy anything specifically from Twain, it just mimics his writing style.

    We also have developer “B”. While Dev A is working on his project, Dev B is working on a very similar project, but with one difference: Dev B writes an LLM to read the books for him, and develop a writing style similar to Twain’s based off of that. The final product is more or less the same as Dev A’s product, but he saves himself the time of needing to read through every work on his own, he just reads a couple to get an idea of what the output might look like.

    Is the work from Dev A’s software legitimate? Why or why not?

    Is the work from Dev B’s software legitimate? Why or why not?

    Assume both of these developers own copies of the works they used as training data, what is honestly the difference here? This is what I am struggling with so much.


  • Sure, but what I’m asking is: what do you think is a reasonable rate?

    We are talking data sets that have millions of written works in them. If it costs hundreds or thousands per work, this venture almost doesn’t make sense anymore. If its $1 per work, or cents per work, then is it even worth it for each individual contributor to get $1 when it adds millions in operating costs?

    In my opinion, this needs to be handled a lot more carefully than what is being proposed. We are potentially going to make AI datasets wayyyy too expensive for anyone to use aside from the largest companies in the market, and even then this will cause huge delays to that progress.

    If AI is just blatantly copy and pasting what it read, then yes, I see that as a huge issue. But reading and learning from what it reads, no matter how rudimentary that “learning” may be, is much different than just copying works.


  • Okay, given that AI models need to look over hundreds of thousands if not millions of documents to get to a decent level of usefulness, how much should the author of each individual work get paid out?

    Even if we say we are going to pay out a measly dollar for every work it looks over, you’re immediately talking millions of dollars in operating costs. Doesn’t this just box out anyone who can’t afford to spend tens or even hundreds of millions of dollars on AI development? Maybe good if you’ve always wanted big companies like Google and Microsoft to be the only ones able to develop these world-altering tools.

    Another issue, who decides which works are more valuable, or how? Is a Shel Silverstein book worth less than a Mark Twain novel because it contains less words? If I self publish a book, is it worth as much as Mark Twains? Sure his is more popular but maybe mine is longer and contains more content, whats my payout in this scenario?









  • Okay, I can understand that. But why is that being turned into “the creator of any work an AI looks at needs to be compensated” instead of holding AI companies accountable for plagiarized works?

    I totally understand fining an AI company that produces a copy of Starry Night. But if it makes a painting similar in style to Starry Night that wouldn’t normally be considered a plagiarized work if a human did it, do we still consider that an issue?



  • Hey man, Threads is that way if you want to use it, but I came here because I very specifically do not have any interest. You can sign up there, use it all you want! But don’t fuck the rest of us over because you want to use both platforms at the same time.

    And by the way, your cloning analogy changes nothing. The original dog is not in the house, he’s fucking off somewhere else. The bear is actually here, trying to come in, and you’re proposing we use him to defend against a dog that may or may not return. Ill take my chances with the dog.



  • They might be bots, but I think there’s a good chunk of people who just don’t think about it, so they don’t care. Writing them off as bots won’t change that, but maybe we can help them look a few steps ahead and change some of their minds.

    What is more likely? An army of bots has been deployed to astroturf Lemmy already, or people are just ignorant to some of these issues? Probably a mixture of both. But more of Column B I would guess.


  • Let me put it this way: advocating for Meta being federated with us is like asking to keep a bear in the same room as the family chihuahua.

    Here I am saying “gee, I don’t think we should keep a bear and a chihuahua in the same room together. This seems like a really bad idea. Bears are pretty violent and this dog has no way to defend itself”.

    Your reply is “Well bears eat salmon, not dogs, so i’m not worried about it. If the bear didn’t have good intentions, why would he be getting in the room with the dog? Besides, if he does start getting violent, we can just take the bear out of the room and separate the two.”

    Nah man, i just like the dog, and I don’t need him getting fucked up. If you want the bear you can hang out with him outside, and have the same experience you would have had if he was inside. I get he keeps trying to come in, but I don’t see how its worth the risk to actually let him in.