Most active commenters
  • Thorrez(7)

←back to thread

688 points crescit_eundo | 32 comments | | HN request time: 0.001s | source | bottom
Show context
codeflo ◴[] No.42145710[source]
At this point, we have to assume anything that becomes a published benchmark is specifically targeted during training. That's not something specific to LLMs or OpenAI. Compiler companies have done the same thing for decades, specifically detecting common benchmark programs and inserting hand-crafted optimizations. Similarly, the shader compilers in GPU drivers have special cases for common games and benchmarks.
replies(3): >>42146244 #>>42146391 #>>42151266 #
darkerside ◴[] No.42146244[source]
VW got in a lot of trouble for this
replies(10): >>42146543 #>>42146550 #>>42146553 #>>42146556 #>>42146560 #>>42147093 #>>42147124 #>>42147353 #>>42147357 #>>42148300 #
sigmoid10 ◴[] No.42146560[source]
Apples and oranges. VW actually cheated on regulatory testing to bypass legal requirements. So to be comparable, the government would first need to pass laws where e.g. only compilers that pass a certain benchmark are allowed to be used for purchasable products and then the developers would need to manipulate behaviour during those benchmarks.
replies(3): >>42146749 #>>42147885 #>>42150309 #
1. 0xFF0123 ◴[] No.42146749[source]
The only difference is the legality. From an integrity point of view it's basically the same
replies(7): >>42146884 #>>42146984 #>>42147072 #>>42147078 #>>42147443 #>>42147742 #>>42147978 #
2. Thorrez ◴[] No.42146884[source]
I think breaking a law is more unethical than not breaking a law.

Also, legality isn't the only difference in the VW case. With VW, they had a "good emissions" mode. They enabled the good emissions mode during the test, but disabled it during regular driving. It would have worked during regular driving, but they disabled it during regular driving. With compilers, there's no "good performance" mode that would work during regular usage that they're disabling during regular usage.

replies(4): >>42146959 #>>42147070 #>>42147439 #>>42147666 #
3. Lalabadie ◴[] No.42146959[source]
> I think breaking a law is more unethical than not breaking a law.

It sounds like a mismatch of definition, but I doubt you're ambivalent about a behavior right until the moment it becomes illegal, after which you think it unethical. Law is the codification and enforcement of a social contract, not the creation of it.

replies(3): >>42147314 #>>42147369 #>>42148090 #
4. UniverseHacker ◴[] No.42146984[source]
I disagree- presumably if an algorithm or hardware is optimized for a certain class of problem it really is good at it and always will be- which is still useful if you are actually using it for that. It’s just “studying for the test”- something I would expect to happen even if it is a bit misleading.

VW cheated such that the low emissions were only active during the test- it’s not that it was optimized for low emissions under the conditions they test for, but that you could not get those low emissions under any conditions in the real world. That's "cheating on the test" not "studying for the test."

5. Winse ◴[] No.42147070[source]
unless following an unethical law would in itself be unethical, then breaking the unethical law would be the only ethical choice. In this case cheating emissions, which I see as unethical, but also advantageous for the consumer, should have been done openly if VW saw following the law as unethical. Ethics and morality are subjective to understanding, and law only a crude approximation of divinity. Though I would argue that each person on the earth through a shared common experience has a rough and general idea of right from wrong...though I'm not always certain they pay attention to it.
6. the_af ◴[] No.42147072[source]
> The only difference is the legality. From an integrity point of view it's basically the same

I think cheating about harming the environment is another important difference.

7. Swenrekcah ◴[] No.42147078[source]
That is not true. Even ChatGPT understands how they are different, I won’t paste the whole response but here are the differences it highlights:

Key differences:

1. Intent and harm: • VW’s actions directly violated laws and had environmental and health consequences. Optimizing LLMs for chess benchmarks, while arguably misleading, doesn’t have immediate real-world harms. 2. Scope: Chess-specific optimization is generally a transparent choice within AI research. It’s not a hidden “defeat device” but rather an explicit design goal. 3. Broader impact: LLMs fine-tuned for benchmarks often still retain general-purpose capabilities. They aren’t necessarily “broken” outside chess, whereas VW cars fundamentally failed to meet emissions standards.

8. Thorrez ◴[] No.42147314{3}[source]
>I doubt you're ambivalent about a behavior right until the moment it becomes illegal, after which you think it unethical.

There are many cases where I think that. Examples:

* Underage drinking. If it's legal for someone to drink, I think it's in general ethical. If it's illegal, I think it's in general unethical.

* Tax avoidance strategies. If the IRS says a strategy is allowed, I think it's ethical. If the IRS says a strategy is not allowed, I think it's unethical.

* Right on red. If the government says right on red is allowed, I think it's ethical. If the government (e.g. NYC) says right on red is not allowed, I think it's unethical.

The VW case was emissions regulations. I think they have an ethical obligation to obey emissions regulations. In the absence of regulations, it's not an obvious ethical problem to prioritize fuel efficiency instead of emissions (that's I believe what VW was doing).

replies(3): >>42147570 #>>42148734 #>>42156023 #
9. emn13 ◴[] No.42147369{3}[source]
Also, while laws ideally are inspired by an ethical social contract, the codification proces is long, complex and far from perfect. And then for rules concerning permissible behavior even in the best of cases, it's enforced extremely sparingly simply because it's not possible nor desirable to detect and deal with all infractions. Nor is it applied blindly and equally. As actually applied, a law is definitely not even close to some ethical ideal; sometimes it's outright opposed to it, even.

Law and ethics are barely related, in practice.

For example in the vehicle emissions context, it's worth noting that even well before VW was caught the actions of likely all carmakers affected by the regulations (not necessarily to the same extent) were clearly unethical. The rules had been subject to intense clearly unethical lobbying for years, and so even the legal lab results bore little resemblance to practical on-the-road results though systematic (yet legal) abuse. I wouldn't be surprised to learn that even what was measured intentionally diverged from what is harmfully in a profitable way. It's a good thing VW was made an example of - but clearly it's not like that resolved the general problem of harmful vehicle emissions. Optimistically, it might have signaled to the rest of the industry and VW in particular to stretch the rules less in the future.

10. hansworst ◴[] No.42147439[source]
Overfitting on test data absolutely does mean that the model would perform better in benchmarks than it would in real life use cases.
replies(1): >>42158947 #
11. currymj ◴[] No.42147443[source]
VW was breaking the law in a way that harmed society but arguably helped the individual driver of the VW car, who gets better performance yet still passes the emissions test.
replies(2): >>42147637 #>>42149872 #
12. chefandy ◴[] No.42147570{4}[source]
Drinking and right turns are unethical if they’re negligent. They’re not unethical if they’re not negligent. The government is trying to reduce negligence by enacting preventative measures to stop ALL right turns and ALL drinking in certain contexts that are more likely to yield negligence, or where the negligence world be particularly harmful, but that doesn’t change whether or not the behavior itself is negligent.

You might consider disregarding the government’s preventative measures unethical, and doing those things might be the way someone disregards the governments protective guidelines, but that doesn’t make those actions unethical any more than governments explicitly legalizing something makes it ethical.

To use a clearer example, the ethicality of abortion— regardless of what you think of it— is not changed by its legal status. You might consider violating the law unethical, so breaking abortion laws would constitute the same ethical violation as underage drinking, but those laws don’t change the ethics of abortion itself. People who consider it unethical still consider it unethical where it’s legal, and those that consider it ethical still consider it ethical where it’s not legal.

replies(4): >>42147856 #>>42148191 #>>42148730 #>>42157977 #
13. jimmaswell ◴[] No.42147637[source]
And afaik the emissions were still miles ahead of a car from 20 years prior, just not quite as extremely stringent as requested.
replies(1): >>42148188 #
14. Retr0id ◴[] No.42147666[source]
ethics should inform law, not the reverse
replies(1): >>42158917 #
15. boringg ◴[] No.42147742[source]
How so? VW intentionally changed the operation of the vehicle so that its emissions met the test requirements during the test and then went back to typical operation conditions afterwards.
16. adgjlsfhk1 ◴[] No.42147856{5}[source]
the right on red example is interesting because in that case, the law changes how other drivers and pedestrians will behave in ways that make it pretty much always unsafe
replies(1): >>42148048 #
17. TimTheTinker ◴[] No.42147978[source]
Right - in either case it's lying, which is crossing a moral line (which is far more important to avoid than a legal line).
18. chefandy ◴[] No.42148048{6}[source]
That just changes the parameters of negligence. On a country road in the middle of a bunch of farm land where you can see for miles, it doesn’t change a thing.
19. mbrock ◴[] No.42148090{3}[source]
But following the law is itself a load bearing aspect of the social contract. Violating building codes, for example, might not cause immediate harm if it's competent but unusual, yet it's important that people follow it just because you don't want arbitrariness in matters of safety. The objective ruleset itself is a value beyond the rules themselves, if the rules are sensible and in accordance with deeper values, which of course they sometimes aren't, in which case we value civil disobedience and activism.
20. slowmotiony ◴[] No.42148188{3}[source]
"not quite as extremely stringent as requested" is a funny way to say they were emitting 40 times more toxic fumes than permitted by law.
replies(1): >>42201013 #
21. mbrock ◴[] No.42148191{5}[source]
It's not so simple. An analogy is the Rust formatter that has no options so everyone just uses the same style. It's minimally "unethical" to use idiosyncratic Rust style just because it goes against the convention so people will wonder why you're so special, etc.

If the rules themselves are bad and go against deeper morality, then it's a different situation; violating laws out of civil disobedience, emergent need, or with a principled stance is different from wanton, arbitrary, selfish cheating.

If a law is particularly unjust, violating the law might itself be virtuous. If the law is adequate and sensible, violating it is usually wrong even if the violating action could be legal in another sensible jurisdiction.

22. ClumsyPilot ◴[] No.42148730{5}[source]
> but that doesn’t make those actions unethical any more than governments explicitly legalizing something makes it ethical

That is, sometimes, sufficient.

If government says ‘seller of a house must disclose issues’ then I rely rely on the law being followed, if you sell and leave the country, you have defrauded me.

However if I live in a ‘buyer beware’ jurisdiction, then I know I cannot trust the seller and I hire a surveyor and take insurance.

There is a degree of setting expectations- if there is a rule, even if it’s a terrible rule, I as individual can at least take some countermeasures.

You can’t take countermeasures against all forms of illegal behaviour, because there is infinite number of them. And a truly insane person is unpredictable at all.

23. banannaise ◴[] No.42148734{4}[source]
Outsourcing your morality to politicians past and present is not a particularly useful framework.
replies(2): >>42150043 #>>42158851 #
24. int_19h ◴[] No.42149872[source]
It might sound funny in retrospect, but some of us actually bought VW cars on the assumption that, if biodiesel-powered, it would be more green.
25. anonymouskimmer ◴[] No.42150043{5}[source]
Ethics are only morality if you spend your entire time in human social contexts. Otherwise morality is a bit larger, and ethics are a special case of group recognized good and bad behaviors.
26. darkerside ◴[] No.42156023{4}[source]
Lawful good. Or perhaps even lawful neutral?

What if I make sure to have a drink once a week for the summer with my 18 year old before they go to college because I want them to understand what it's like before they go binge with friends? Is that not ethical?

Speeding to the hospital in an emergency? Lying to Nazis to save a Jew?

Law and ethics are more correlated than some are saying here, but the map is not the territory, and it never will be.

replies(1): >>42158884 #
27. Thorrez ◴[] No.42157977{5}[source]
I agree if they're negligent they're unethical. But I also think if they're illegal they're generally unethical. In situations where some other right is more important that the law, underage drinking or illegal right on red would be ethical, such as if alcohol is needed as an emergency pain reliever, or a small amount for religious worship, or if you need to drive to the hospital fast in an emergency.

Abortion opponents view it as killing an innocent person. So that's unethical regardless of whether it's legal. I'm not contesting in any way that legal things can be unethical. Abortion supporters view it as a human right, and that right is more important than the law.

Right on red, underage drinking, and increasing car emissions aren't human rights. So outside of extenuating circumstances, if they're illegal, I see them as unethical.

28. Thorrez ◴[] No.42158851{5}[source]
I'm not outsourcing my morality. There are plenty of actions that are legal that are immoral.

I don't think the government's job is to enforce morality. The government's job is to set up a framework for society to help people get along.

29. Thorrez ◴[] No.42158884{5}[source]
There can be situations where someone's rights are more important than the law. In that case it's ethical to break the law. Speeding to the hospital and lying to Nazis are cases of that. The drinking with your 18 year old, I'm not sure, maybe.

My point though, is that in general, when there's not a right that outweighs the law, it's unethical to break the law.

30. Thorrez ◴[] No.42158917{3}[source]
I agree that ethics should inform law. But I live in a society, and have an ethical duty to respect other members of society. And part of that duty is following the laws of society.
31. Thorrez ◴[] No.42158947{3}[source]
I think you're talking about something different from what sigmoid10 was talking about. sigmoid10 said "manipulate behaviour during those benchmarks". I interpreted that to mean the compiler detects if a benchmark is going on and alters its behavior only then. So this wouldn't impact real life use cases.
32. linksnapzz ◴[] No.42201013{4}[source]
40x infinitesimal is still...infinitesimal.