> Do you double check that the LLM hasn't magically recreated someone else's copyrighted code?
I frankly do not care, and I expect LLMs to become such ubiquitous table-stakes that I don't think anyone will really care in the long run.
> and I expect LLMs to become such ubiquitous table-stakes
Unless they develop entirely new technology they're stuck with linear growth of output capability for input costs. This will take a very long time. I expect it to be abandoned in favor of better ideas and computing interfaces. "AI" always seems to bloom right before a major shift in computing device capability and mobility and then gets left behind. I don't see anything special about this iteration.
> that I don't think anyone will really care in the long run.
There are trillions of dollars at stake and access to even the basics of this technology is far from egalitarian or well distributed. Until it is I would expect people who's futures and personal wealth depends on it to care quite a bit. In the meanwhile you might just accelerate yourself into a lawsuit.
That's really a non-issue. Anything copyrightable is non-trivial in length and complexity to the point that an LLM is not going to verbatim output that.
Funny, gemma will do this all day long.
>>> Please write a hello world app in java
```java
public class HelloWorld {
public static void main(String[] args) {
System.out.println("Hello, World!");
}
}
```
https://docs.oracle.com/javase/tutorial/getStarted/applicati...cat /bin/true and /bin/false if you are on a Solaris etc... as an example too.
Note this paper that will be presented at ICSE in a couple of weeks too.
https://arxiv.org/abs/2408.02487v3
The point being is that this is very much a very real and yet unsolved problem with LLMs right now.
> I frankly do not care
I just heard a thousand expensive IP lawyers sigh orgasmically.
IP lawyers would have a field day if they had access to the code base of any large corporation. Fortunately, they do not.