←back to thread

448 points lastdong | 4 comments | | HN request time: 0s | source
Show context
Meneth ◴[] No.45116481[source]
Open-source, eh? Where's the training data, then?
replies(1): >>45117919 #
Joel_Mckay ◴[] No.45117919[source]
Most scraped data is often full of copyright, usage agreement, and privacy law violations.

Making it "open" would be unwise for a commercial entity. =3

replies(1): >>45120324 #
zoobab ◴[] No.45120324[source]
Open source is being abused to not provide the actual source. Stop this.
replies(1): >>45120491 #
1. Joel_Mckay ◴[] No.45120491[source]
A lot of code has multiple FOSS licenses that are not contaminating like GPL. GPL violations do occur on code, but have nothing to do with the training Data.

For example, many academic data sets are not public domain, and can't be used in a commercial context. A GPL claim on that data is often an argument of which thief showed up first.

Rule #24: A lawyers Strategic Truth is to never lie, but also avoid voluntarily disclosing information that may help opponents.

Thus, a business will never disclose they paid a fool to break laws for them... =3

replies(2): >>45121109 #>>45121115 #
2. ◴[] No.45121109[source]
3. nullc ◴[] No.45121115[source]
Perhaps, but it is not Open Source in the traditional sense if they do not provide the preferred form for modifications.
replies(1): >>45121447 #
4. Joel_Mckay ◴[] No.45121447[source]
There are also some weird OSS license rules that only trip the disclosure obligation when distributing the build to end users.

Indeed, these adversarial behaviors do not follow the spirit of FOSS community standards. If a project started as FOSS, than FOSS it should remain. =3