Basically a deer with a human face. Despite probably being some sort of magical nature spirit, his interests are primarily in technology and politics and science fiction.

Spent many years on Reddit before joining the Threadiverse as well.

  • 0 Posts
  • 193 Comments
Joined 2 years ago
cake
Cake day: March 3rd, 2024

help-circle







  • You went digging through my Reddit comments to find a two-month-old thread, that must have taken a lot of effort. But I’m afraid I don’t see what the relevance of it is, aside from a general “it’s about AI”. The bulk of the comments I wrote there were about water usage.

    I’m genuinely puzzled. Are you saying that deduplicating data is “hiding unethical behaviour?” It’s actually intended for improving the model’s performance, having a model spit out exact copies of its training data means you’ve produced a hugely expensive and wasteful re-implementation of copy-and-paste rather than a generative AI. The whole point of generative AI is to produce novel outputs.


  • Thanks. This has actually been a thing that bothered me many years before AI was ever a thing, there are open source programs I’ve installed that pop open a clickwrap “agree with the GPL before you can install this” step and it shows a misunderstanding of how these licenses fundamentally work. They’re not EULAs.

    As for whether I’m a “concern troll”, AI happens to be an area of significant interest to me right now and so I’ve been commenting a lot on it. My opinion on it also happens to be unpopular. I don’t like the idea of closed social media bubbles where only groupthink is allowed, so I just go ahead and speak my mind even knowing it’ll likely get hit with a lot of downvotes. I’m finding the Fediverse to be a lot more insular than Reddit is in this regard, I suspect because the population in general is a lot smaller, but at least downvotes don’t tend to “bury” comments.

    If anyone can’t stand reading my comments, I recommend blocking me. It’s the ultimate downvote.






  • It may be that such a license can’t exist. The way these viral copyleft licenses work is that they offer things to people who accept them that copyright otherwise doesn’t permit. The usual example: you can distribute copies of this work (a thing that copyright prohibits you from doing by default) but in exchange you must release any derivative works you make under the same licence.

    The problem is that you actually can reject that licence. You can download the software (that’s allowed because the person distributing the software agreed to the copyleft license) and then decide you’re not going to accept the license that came with it. At that point you’re restricted by ordinary copyright and can only do the things you’d normally do with it.

    There have already been court cases in the US that have ruled that training an AI is fair use, and the resulting model is not a derivative work covered by the copyright of the original. So you can just go ahead and train the AI at that point.