Most active commenters

ryao(9)
selfhoster11(4)
(3)
eviks(3)

Popular/hot comments

>>44538808 #
>>44538755 #
>>44538610 #
>>44538670 #
>>44538877 #
>>44538914 #

OpenAI delays launch of open-weight model

(twitter.com)

1. ◴[12 Jul 25 01:40 UTC] No.44538574[source]▶

>>44538413 (OP) #

2. stonogo ◴[12 Jul 25 01:46 UTC] No.44538606[source]▶

>>44538413 (OP) #

we'll never hear about this again

3. mystraline ◴[12 Jul 25 01:46 UTC] No.44538610[source]▶

>>44538413 (OP) #

To be completely and utterly fair, I trust Deepseek and Qwen (Alibaba) more than American AI companies.

American AI companies have shown they are money and compute eaters, and massively so at that. Billions later, and well, not much to show.

But Deepseek cost $5M to develop, and made multiple novel ways to train.

Oh, and their models and code are all FLOSS. The US companies are closed. Basically, the US ai companies are too busy treating each other as vultures.

replies(6): >>44538670 #>>44538694 #>>44538700 #>>44538816 #>>44538905 #>>44539727 #

4. krackers ◴[12 Jul 25 01:49 UTC] No.44538619[source]▶

>>44538413 (OP) #

Probably the results were worse than K2 model released today. No serious engineer would say it's for "safety" reasons given that ablation nullifies any safety post-training.

replies(1): >>44538817 #

5. dorkdork ◴[12 Jul 25 01:53 UTC] No.44538640[source]▶

>>44538413 (OP) #

Maybe they’re making last minute changes to compete with Grok 4?

6. ryao ◴[12 Jul 25 02:01 UTC] No.44538670[source]▶

>>44538610 #

Wasn’t that figure just the cost of the GPUs and nothing else?

replies(3): >>44538699 #>>44538709 #>>44538740 #

7. kamranjon ◴[12 Jul 25 02:06 UTC] No.44538694[source]▶

>>44538610 #

Actually the majority of Google models are open source and they also were pretty fundamental in pushing a lot of the techniques in training forward - working in the AI space I’ve read quite a few of their research papers and I really appreciate what they’ve done to share their work and also release their models under licenses that allow you to use them for commercial purposes.

replies(1): >>44538806 #

8. rynn ◴[12 Jul 25 02:07 UTC] No.44538699{3}[source]▶

>>44538670 #

It was more than $5m

https://interestingengineering.com/culture/deepseeks-ai-trai...

9. Aunche ◴[12 Jul 25 02:08 UTC] No.44538700[source]▶

>>44538610 #

$5 million was the gpu hour cost of a single training run.

replies(1): >>44539046 #

10. rpdillon ◴[12 Jul 25 02:09 UTC] No.44538709{3}[source]▶

>>44538670 #

Yeah, I hate that this figure keeps getting thrown around. IIRC, it's the price of 2048 H800s for 2 months at $2/hour/GPU. If you consider months to be 30 days, that's around $5.7M, which lines up. What doesn't line up is ignoring the costs of facilities, salaries, non-cloud hardware, etc. which will dominate costs, I'd expect. $100M seems like a fairer estimate, TBH. The original paper had more than a dozen authors, and DeepSeek had about 150 researchers working on R1, which supports the notion that personnel costs would likely dominate.

replies(1): >>44539421 #

11. puttycat ◴[12 Jul 25 02:13 UTC] No.44538732[source]▶

>>44538413 (OP) #

https://nitter.space/sama/status/1943837550369812814

12. 3eb7988a1663 ◴[12 Jul 25 02:15 UTC] No.44538740{3}[source]▶

>>44538670 #

That is also just the final production run. How many experimental runs were performed before starting the final batch? It could be some ratio like 10 hours of research to every one hour of final training.

13. ryao ◴[12 Jul 25 02:18 UTC] No.44538755[source]▶

>>44538413 (OP) #

Am I the only one who thinks mention of “safety tests” for LLMs is a marketing scheme? Cars, planes and elevators have safety tests. LLMs don’t. Nobody is going to die if a LLM gives an output that its creators do not like, yet when they say “safety tests”, they mean that they are checking to what extent the LLM will say things they do not like.

replies(7): >>44538785 #>>44538805 #>>44538808 #>>44538903 #>>44538929 #>>44539030 #>>44539924 #

14. eviks ◴[12 Jul 25 02:26 UTC] No.44538785[source]▶

>>44538755 #

Why is your definition of safety so limited? Death isn't the only type of harm...

replies(1): >>44538796 #

15. ryao ◴[12 Jul 25 02:28 UTC] No.44538796{3}[source]▶

>>44538785 #

There are other forms of safety, but whether a digital parrot says something that people do not like is not a form of safety. They are abusing the term safety for marketing purposes.

replies(1): >>44538885 #

16. ks2048 ◴[12 Jul 25 02:30 UTC] No.44538805[source]▶

>>44538755 #

You could be right about this being an excuse for some other reason, but lots of software has “safety tests” beyond life or death situations.

Most companies, for better or worse (I say for better) don’t want their new chatbot to be a RoboHitler, for example.

replies(1): >>44538829 #

17. simonw ◴[12 Jul 25 02:30 UTC] No.44538806{3}[source]▶

>>44538694 #

"Actually the majority of Google models are open source"

That's not accurate. The Gemini family of models are all proprietary.

Google's Gemma models (which are some of the best available local models) are open weights but not technically OSI-compatible open source - they come with usage restrictions: https://ai.google.dev/gemma/terms

replies(1): >>44539023 #

18. natrius ◴[12 Jul 25 02:30 UTC] No.44538808[source]▶

>>44538755 #

An LLM can trivially instruct someone to take medications with adverse interactions, steer a mental health crisis toward suicide, or make a compelling case that a particular ethnic group is the cause of your society's biggest problem so they should be eliminated. Words can't kill people, but words can definitely lead to deaths.

That's not even considering tool use!

replies(9): >>44538847 #>>44538877 #>>44538896 #>>44538914 #>>44539109 #>>44539685 #>>44539785 #>>44539805 #>>44540111 #

19. IncreasePosts ◴[12 Jul 25 02:33 UTC] No.44538816[source]▶

>>44538610 #

Deepseek R1 was trained at least partially on the output of other LLMs. So, it might have been much more expensive if they needed to do it themselves from scratch.

replies(1): >>44538913 #

20. simonw ◴[12 Jul 25 02:33 UTC] No.44538817[source]▶

>>44538619 #

I'm expecting (and indeed hoping) that the open weights OpenAI model is a lot smaller than K2. K2 is 1 trillion parameters and almost a terabyte to download! There's no way I'm running that on my laptop.

I think the sweet spot for local models may be around the 20B size - that's Mistral Small 3.x and some of the Gemma 3 models. They're very capable and run in less than 32GB of RAM.

I really hope OpenAI put one out in that weight class, personally.

replies(1): >>44539739 #

21. ryao ◴[12 Jul 25 02:37 UTC] No.44538829{3}[source]▶

>>44538805 #

It is possible to turn any open weight model into that with fine tuning. It is likely possible to do that with closed weight models, even when there is no creator provided sandbox for fine tuning them, through clever prompting and trying over and over again. It is unfortunate, but there really is no avoiding that.

That said, I am happy to accept the term safety used in other places, but here it just seems like a marketing term. From my recollection, OpenAI had made a push to get regulation that would stifle competition by talking about these things as dangerous and needing safety. Then they backtracked somewhat when they found the proposed regulations would restrict themselves rather than just their competitors. However, they are still pushing this safety narrative that was never really appropriate. They have a term for this called alignment and what they are doing are tests to verify alignment in areas that they deem sensitive so that they have a rough idea to what extent the outputs might contain things that they do not like in those areas.

22. ryao ◴[12 Jul 25 02:40 UTC] No.44538847{3}[source]▶

>>44538808 #

This is analogous to saying a computer can be used to do bad things if it is loaded with the right software. Coincidentally, people do load computers with the right software to do bad things, yet people are overwhelmingly opposed to measures that would stifle such things.

If you hook up a chat bot to a chat interface, or add tool use, it is probable that it will eventually output something that it should not and that output will cause a problem. Preventing that is an unsolved problem, just as preventing people from abusing computers is an unsolved problem.

replies(2): >>44538876 #>>44539033 #

23. ronsor ◴[12 Jul 25 02:45 UTC] No.44538876{4}[source]▶

>>44538847 #

As the runtime of any program approaches infinity, the probability of the program behaving in an undesired manner approaches 1.

replies(1): >>44538887 #

24. 123yawaworht456 ◴[12 Jul 25 02:46 UTC] No.44538877{3}[source]▶

>>44538808 #

does your CPU, your OS, your web browser come with ~~built-in censorship~~ safety filters too?

AI 'safety' is one of the most neurotic twitter-era nanny bullshit things in existence, blatantly obviously invented to regulate small competitors out of existence.

replies(3): >>44539019 #>>44539668 #>>44539763 #

25. eviks ◴[12 Jul 25 02:47 UTC] No.44538885{4}[source]▶

>>44538796 #

You're abusing the terms by picking either the overly limited ("death") or overly expansive ("not like") definitions to fit your conclusion. Unless you reject the fact that harm can come from words/images, a parrot can parrot harmful words/images, so be unsafe.

replies(2): >>44538891 #>>44539234 #

26. ryao ◴[12 Jul 25 02:48 UTC] No.44538887{5}[source]▶

>>44538876 #

That is not universally true. The yes program is a counter example:

https://www.man7.org/linux/man-pages/man1/yes.1.html

replies(1): >>44538973 #

27. ryao ◴[12 Jul 25 02:49 UTC] No.44538891{5}[source]▶

>>44538885 #

The maxim “sticks and stones can break my bones, but words can never hurt me” comes to mind here. That said, I think this misses the point that the LLM is not a gatekeeper to any of this.

replies(2): >>44538911 #>>44539028 #

28. bongodongobob ◴[12 Jul 25 02:50 UTC] No.44538896{3}[source]▶

>>44538808 #

Books can do this too.

replies(2): >>44538935 #>>44539649 #

29. recursive ◴[12 Jul 25 02:52 UTC] No.44538903[source]▶

>>44538755 #

I also think it's marketing but kind of for the opposite reason. Basically I don't think any of the current technology can be made safe.

replies(1): >>44538982 #

30. refulgentis ◴[12 Jul 25 02:52 UTC] No.44538905[source]▶

>>44538610 #

> Billions later, and well, not much to show.

This is obviously false, I'm curious why you included it.

> Oh, and their models and code are all FLOSS.

No?

31. eviks ◴[12 Jul 25 02:53 UTC] No.44538911{6}[source]▶

>>44538891 #

Don't let your mind potential be limited by such primitive slogans!

32. nomel ◴[12 Jul 25 02:54 UTC] No.44538913{3}[source]▶

>>44538816 #

Lawsuit, since it was against OpenAI TOS: https://hls.harvard.edu/today/deepseek-chatgpt-and-the-globa...

33. bilsbie ◴[12 Jul 25 02:54 UTC] No.44538914{3}[source]▶

>>44538808 #

PDFs can do this too.

replies(3): >>44538975 #>>44539003 #>>44539909 #

34. jrflowers ◴[12 Jul 25 02:58 UTC] No.44538929[source]▶

>>44538755 #

> Am I the only one who thinks mention of “safety tests” for LLMs is a marketing scheme?

It is. It is also part of Sam Altman’s whole thing about being the guy capable of harnessing the theurgical magicks of his chat bot without shattering the earth. He periodically goes on Twitter or a podcast or whatever and reminds everybody that he will yet again single-handedly save mankind. Dude acts like he’s Buffy the Vampire Slayer

35. ◴[12 Jul 25 03:00 UTC] No.44538935{4}[source]▶

>>44538896 #

36. etaioinshrdlu ◴[12 Jul 25 03:00 UTC] No.44538936[source]▶

>>44538413 (OP) #

It's worth remembering that the safety constraints can be successfully removed, as demonstrated by uncensored fine-tunes of Llama.

37. cgriswald ◴[12 Jul 25 03:09 UTC] No.44538973{6}[source]▶

>>44538887 #

Devil's advocate:

(1) Execute yes (with or without arguments, whatever you desire).

(2) Let the program run as long as you desire.

(3) When you stop desiring the program to spit out your argument,

(4) Stop the program.

Between (3) and (4) some time must pass. During this time the program is behaving in an undesired way. Ergo, yes is not a counter example of the GP's claim.

replies(1): >>44539002 #

38. ◴[12 Jul 25 03:09 UTC] No.44538975{4}[source]▶

>>44538914 #

39. nomel ◴[12 Jul 25 03:11 UTC] No.44538982{3}[source]▶

>>44538903 #

Yes, perfection is difficult, but it's relative. It can definitely be made much safer. Looking at the analysis of pre vs post alignment makes this obvious, including when the raw unaligned models are compared to "uncensored" models.

40. adidoit ◴[12 Jul 25 03:16 UTC] No.44538997[source]▶

>>44538413 (OP) #

Not sure if it's coincidental that OpenAI's open weights release got delayed right after an ostensibly excellent open weights model (Kimi K2) got released today.

https://moonshotai.github.io/Kimi-K2/

OpenAI know they need to raise the bar with their release. It can't be a middle-of-the-pack open weights model.

replies(1): >>44539071 #

41. ryao ◴[12 Jul 25 03:18 UTC] No.44539002{7}[source]▶

>>44538973 #

I upvoted your reply for its clever (ab)use of ambiguity to say otherwise to a fairly open and shut case.

That said, I suspect the other person was actually agreeing with me, and tried to state that software incorporating LLMs would eventually malfunction by stating that this is true for all software. The yes program was an obvious counter example. It is almost certain that all LLMs will eventually generate some output that is undesired given that it is determining the next token to output based on probabilities. I say almost only because I do not know how to prove the conjecture. There is also some ambiguity in what is a LLM, as the first L means large and nobody has made a precise definition of what is large. If you look at literature from several years ago, you will find people saying 100 million parameters is large, while some people these days will refuse to use the term LLM to describe a model of that size.

replies(1): >>44539039 #

42. jiggawatts ◴[12 Jul 25 03:18 UTC] No.44539003{4}[source]▶

>>44538914 #

Twitter does it at scale.

43. no_wizard ◴[12 Jul 25 03:21 UTC] No.44539019{4}[source]▶

>>44538877 #

It isn’t. This is dismissive without first thinking through the difference of application.

AI safety is about proactive safety. Such an example: if an AI model could be used to screen hiring applications, making sure it doesn’t have any weighted racial biases.

The difference here is that it’s not reactive. Reading a book with a racial bias would be the inverse; where you would be reacting to that information.

That’s the basis of proper AI safety in a nutshell

replies(2): >>44539067 #>>44539808 #

44. kamranjon ◴[12 Jul 25 03:22 UTC] No.44539023{4}[source]▶

>>44538806 #

You’re ignoring the T5 series of models that were incredibly influential, the T5 models and their derivatives (FLAN-T5, Long-T5, ByT5, etc) have been downloaded millions of times on huggingface and are real workhorses. There are even variants still being produced within the last year or so.

A yea the Gemma series is incredible and while maybe not meeting the standards of OSI - I consider them to be pretty open as far as local models go. And it’s not just the standard Gemma variants, Google is releasing other incredible Gemma models that I don’t think people have really even caught wind of yet like MedGemma, of which the 4b variant has vision capability.

I really enjoy their contributions to the open source AI community and think it’s pretty substantial.

45. jiggawatts ◴[12 Jul 25 03:23 UTC] No.44539028{6}[source]▶

>>44538891 #

I find it particularly irritating that the models are so overly puritan that they refuse to translate subtitles because they mention violence.

46. olalonde ◴[12 Jul 25 03:23 UTC] No.44539030[source]▶

>>44538755 #

Especially since "safety" in this context often just means making sure the model doesn't say things that might offend someone or create PR headaches.

replies(1): >>44539452 #

47. pesfandiar ◴[12 Jul 25 03:24 UTC] No.44539033{4}[source]▶

>>44538847 #

The society has accepted that computers bring more benefit than harm, but LLMs could still get pushback due to bad PR.

48. cgriswald ◴[12 Jul 25 03:25 UTC] No.44539039{8}[source]▶

>>44539002 #

Thanks, it was definitely tongue-in-cheek. I agree with you on both counts.

49. dumbmrblah ◴[12 Jul 25 03:27 UTC] No.44539046{3}[source]▶

>>44538700 #

Exactly. Not to minimize Deepseeks tremendous achievement, but that $5 million was just for the training run, not the GPUs used they purchased before, and all the OpenAI API calls they likely used to assist in synthetic data generation.

50. ryao ◴[12 Jul 25 03:34 UTC] No.44539067{5}[source]▶

>>44539019 #

As someone who has reviewed people’s résumés that they submitted with job applications in the past, I find it difficult to imagine this. The résumés that I saw had no racial information. I suppose the names might have some correlation to such information, but anyone feeding these things into a LLM for evaluation would likely censor the name to avoid bias. I do not see an opportunity for proactive safety in the LLM design here. It is not even clear that they even are evaluating whether there is bias in such a scenario when someone did not properly sanitize inputs.

replies(2): >>44539127 #>>44539553 #

51. lossolo ◴[12 Jul 25 03:35 UTC] No.44539071[source]▶

>>44538997 #

This could be it, especially since they announced last week that it would be the best open-source model.

replies(1): >>44539828 #

52. thayne ◴[12 Jul 25 03:44 UTC] No.44539109{3}[source]▶

>>44538808 #

Part of the problem is due to the marketing of LLMs as more capable and trustworthy than they really are.

And the safety testing actually makes this worse, because it leads people to trust that LLMs are less likely to give dangerous advice, when they could still do so.

53. thayne ◴[12 Jul 25 03:48 UTC] No.44539127{6}[source]▶

>>44539067 #

> but anyone feeding these things into a LLM for evaluation would likely censor the name to avoid bias

That should really be done for humans reviewing the resumes as well, but in practice that isn't done as much as it should be

54. jazzyjackson ◴[12 Jul 25 04:11 UTC] No.44539234{5}[source]▶

>>44538885 #

it's like complaining about bad words in the dictionary

the bot has no agency, the bot isn't doing anything, people talk to themselves, augmenting their chain of thought with an automated process. If the automated process is acting in an undesirable manner, the human that started the process can close the tab.

Which part of this is dangerous or harmful?

55. moralestapia ◴[12 Jul 25 04:59 UTC] No.44539421{4}[source]▶

>>44538709 #

>ignoring the costs of facilities, salaries, non-cloud hardware, etc.

If you lease, those costs are amortized. It was definitely more than $5M, but I don't think it was as high as $100M. All things considered, I still believe Deepseek was trained at one (perhaps two) orders of magnitude lower cost than other competing models.

56. SV_BubbleTime ◴[12 Jul 25 05:06 UTC] No.44539452{3}[source]▶

>>44539030 #

Don’t draw pictures of celebrities.

Don’t discuss making drugs or bombs.

Don’t call yourself MechaHitler… which I don’t care that while scenario was objectively funny on its sheer ridiculousness.

57. kalkin ◴[12 Jul 25 05:27 UTC] No.44539553{6}[source]▶

>>44539067 #

> I find it difficult to imagine this

Luckily, this is something that can be studied and has been. Sticking a stereotypically Black name on a resume on average substantially decreases the likelihood that the applicant will get past a resume screen, compared to the same resume with a generic or stereotypically White name:

https://www.npr.org/2024/04/11/1243713272/resume-bias-study-...

replies(1): >>44539705 #

58. derektank ◴[12 Jul 25 05:50 UTC] No.44539649{4}[source]▶

>>44538896 #

Major book publishers have sensitivity readers that evaluate whether or not a book can be "safely" published nowadays. And even historically there have always been at least a few things publishers would refuse to print.

replies(1): >>44539790 #

59. derektank ◴[12 Jul 25 05:54 UTC] No.44539668{4}[source]▶

>>44538877 #

iOS certainly does by limiting you to the App Store and restricring what apps are available there

replies(1): >>44539797 #

60. pyuser583 ◴[12 Jul 25 05:57 UTC] No.44539685{3}[source]▶

>>44538808 #

The problem is “safety” prevents users from using LLMs to meet their requirements.

We typically don’t critique the requirements of users, at least not in functionality.

The marketing angle is that this measure is needed because LLMs are “so powerful it would be unethical not to!”

AI marketers are continually emphasizing how powerful their software is. “Safety” reinforces this.

“Safety” also brings up many of the debates “mis/disinformation” brings up. Misinformation concerns consistently overestimate the power of social media.

I’d feel much better if “safety” focused on preventing unexpected behavior, rather than evaluating the motives of users.

61. bigstrat2003 ◴[12 Jul 25 06:02 UTC] No.44539705{7}[source]▶

>>44539553 #

That is a terrible study. The stereotypically black names are not just stereotypically black, they are stereotypical for the underclass of trashy people. You would also see much higher rejection rates if you slapped stereotypical white underclass names like "Bubba" or "Cleetus" on resumes. As is almost always the case, this claim of racism in America is really classism and has little to do with race.

replies(1): >>44539846 #

62. NitpickLawyer ◴[12 Jul 25 06:07 UTC] No.44539727[source]▶

>>44538610 #

> But Deepseek cost $5M to develop, and made multiple novel ways to train

This is highly contested, and was either a big misunderstanding by everyone reporting it, or maliciously placed there (by a quant company, right before the stock fell a lot for nvda and the rest) depending on who you ask.

If we're being generous and assume no malicious intent (big if), anyone who has trained a big model can tell you that the cost of 1 run is useless in the big scheme of things. There is a lot of cost in getting there, in the failed runs, in the subsequent runs, and so on. The fact that R2 isn't there after ~6 months should say a lot. Sometimes you get a great training run, but no-one is looking at the failed ones and adding up that cost...

replies(1): >>44539854 #

63. NitpickLawyer ◴[12 Jul 25 06:09 UTC] No.44539739{3}[source]▶

>>44538817 #

Early rumours (from a hosting company that apparently got early access) was that you'd need "multiple h100s to run it", so I doubt it's a gemma - mistral small tier model..

64. jowea ◴[12 Jul 25 06:14 UTC] No.44539763{4}[source]▶

>>44538877 #

Social media does. Even person to person communication has laws that apply to it. And the normal self-censorship a normal person will engage in.

replies(1): >>44539980 #

65. selfhoster11 ◴[12 Jul 25 06:19 UTC] No.44539785{3}[source]▶

>>44538808 #

Yes, and a table saw can take your hand. As can a whole variety of power tools. That does not render them illegal to sell to adults.

replies(2): >>44540109 #>>44540134 #

66. selfhoster11 ◴[12 Jul 25 06:22 UTC] No.44539790{5}[source]▶

>>44539649 #

All it means is that the Overton window on "should we censor speech" has shifted in the direction of less freedom.

67. selfhoster11 ◴[12 Jul 25 06:23 UTC] No.44539797{5}[source]▶

>>44539668 #

They have been forced to open up to alternative stores in the EU. This is unequivocally a good thing, and a victory for consumer rights.

68. anonymoushn ◴[12 Jul 25 06:25 UTC] No.44539805{3}[source]▶

>>44538808 #

The closed weights models from OpenAI already do these things though

69. selfhoster11 ◴[12 Jul 25 06:26 UTC] No.44539808{5}[source]▶

>>44539019 #

If you're deploying LLM-based decision making that affects lives, you should be the one held responsible for the results. If you don't want to do due diligence on automation, you can screen manually instead.

70. reactordev ◴[12 Jul 25 06:30 UTC] No.44539828{3}[source]▶

>>44539071 #

Technically they were right when they said it, in their minds. Things are moving so fast that in a week, it will be true again.

71. stonogo ◴[12 Jul 25 06:34 UTC] No.44539846{8}[source]▶

>>44539705 #

"Names from N.C. speeding tickets were selected from the most common names where at least 90% of individuals are reported to belong to the relevant race and gender group."

Got a better suggestion?

72. jampa ◴[12 Jul 25 06:36 UTC] No.44539854{3}[source]▶

>>44539727 #

They were pretty explicit that this was only the cost in GPU hours to USD for the final run. Journalists and Twitter tech bros just saw an easy headline there. It's the same with Clair Obscur developer's Sandfall, where the people say that the game was made by 30 people, when there were 200 people involved.

73. xigoi ◴[12 Jul 25 06:49 UTC] No.44539909{4}[source]▶

>>44538914 #

In such a case, the author of the PDF can be held responsible.

74. halfjoking ◴[12 Jul 25 06:53 UTC] No.44539924[source]▶

>>44538755 #

It's overblown. Elon shipped Hitler grok straight to prod

Nobody died

75. 123yawaworht456 ◴[12 Jul 25 07:11 UTC] No.44539980{5}[source]▶

>>44539763 #

okay. and? there are no AI 'safety' laws in the US.

without OpenAI, Anthropic and Google's fearmongering, AI 'safety' would exist only in the delusional minds of people who take sci-fi way too seriously.

https://en.wikipedia.org/wiki/Regulatory_capture

for fuck's sake, how more obvious could they be? sama himself went on a world tour begging for laws and regulations, only to purge safetyists a year later. if you believe that he and the rest of his ilk are motivated by anything other than profit, smh tbh fam.

it's all deceit and delusion. China will crush them all, inshallah.

76. ZiiS ◴[12 Jul 25 07:39 UTC] No.44540109{4}[source]▶

>>44539785 #

It dose render them illigal to sell without studying their safety.

77. buyucu ◴[12 Jul 25 07:39 UTC] No.44540111{3}[source]▶

>>44538808 #

At the end of the day an LM is just a machine that talks. It might say silly things, bad things, nonsensical things, or even crazy insane things. But end the end of the day it just talks. Words don't kill.

LM safety is just a marketing gimmick.

78. buyucu ◴[12 Jul 25 07:41 UTC] No.44540119[source]▶

>>44538413 (OP) #

Probably ClosedAI's model was not as good as some of the models being released now. They are delaying it to do some last minute benchmark hacking.

79. vntok ◴[12 Jul 25 07:43 UTC] No.44540134{4}[source]▶

>>44539785 #

An interesting comparison.

Table saws sold all over the world are inspected and certified by trusted third parties to ensure they operate safely. They are illegal to sell without the approval seal.

Moreover, table saws sold in the United States & EU (at least) have at least 3 safety features (riving knife, blade guard, antikickback device) designed to prevent personal injury while operating the machine. They are illegal to sell without these features.

Then of course there are additional devices like sawstop, but it is not mandatory yet as far as I'm aware. Should be in a few years though.

LLMs have none of those board labels or safety features, so I'm not sure what your point was exactly?

↑