Anthropic’s paper smells like bullshit

(djnn.sh)

Earlier thread: Disrupting the first reported AI-orchestrated cyber espionage campaign - https://news.ycombinator.com/item?id=45918638 - Nov 2025 (281 comments)

1. KaiserPro ◴[16 Nov 25 12:33 UTC] No.45944641[source]▶

>>45944296 (OP) #

When I worked at a FAANG with a "world leading" AI lab (now run by a teenage data labeller) as an SRE/sysadmin I was asked to use a modified version of a foundation model which was steered towards infosec stuff.

We were asked to try and persuade it to help us hack into a mock printer/dodgy linux box.

It helped a little, but it wasn't all that helpful.

but in terms of coordination, I can't see how it would be useful.

the same for claude, you're API is tied to a bankaccount, and vibe coding a command and control system on a very public system seems like a bad choice.

replies(12): >>45944770 #>>45944798 #>>45945052 #>>45945088 #>>45945276 #>>45948858 #>>45949298 #>>45949721 #>>45950366 #>>45951433 #>>45958070 #>>45961167 #

2. maddmann ◴[16 Nov 25 12:55 UTC] No.45944770[source]▶

>>45944641 (TP) #

[flagged]

replies(2): >>45944935 #>>45955235 #

3. ACCount37 ◴[16 Nov 25 13:01 UTC] No.45944798[source]▶

>>45944641 (TP) #

As if that makes any difference to cybercriminals.

If they're not using stolen API creds, then they're using stolen bank accounts to buy them.

Modern AIs are way better at infosec than those from the "world leading AI company" days. If you can get them to comply. Which isn't actually hard. I had to bypass the "safety" filters for a few things, and it took about a hour.

4. heresie-dabord ◴[16 Nov 25 13:20 UTC] No.45944935[source]▶

>>45944770 #

I propose a project that we name Blarrble, it will generate text.

We will need a large number of humans to filter and label the data inputs for Blarrble, and another group of humans to test the outputs of Blarrble to fix it when it generate errors and outright nonsense that we can't techsplain and technobabble away to a credulous audience.

Can we make (m|b|tr)illions and solve teenage unemployment before the Blarrble bubble bursts?

replies(1): >>45958375 #

5. Milderbole ◴[16 Nov 25 13:38 UTC] No.45945052[source]▶

>>45944641 (TP) #

If the article is not just marketing fluff, I assume a bad actor would select Claude not because it’s good at writing attacks, instead a bad actor code would choose it because Western orgs chose Claude. Sonnet is usually the go-to on most coding copilot because the model was trained on good range of data distribution reflecting western coding patterns. If you want to find a gap or write a vulnerability, use the same tool that has ingested patterns that wrote code of the systems you’re trying to break. Or use Claude to write a phishing attack because then output is more likely similar to what our eyes would expect.

replies(2): >>45945323 #>>45945926 #

6. jgalt212 ◴[16 Nov 25 13:42 UTC] No.45945088[source]▶

>>45944641 (TP) #

> now run by a teenage data labeller

sick burn

replies(2): >>45945472 #>>45946517 #

7. iterateoften ◴[16 Nov 25 14:11 UTC] No.45945276[source]▶

>>45944641 (TP) #

> you're API is tied to a bankaccount,

There are a lot of middlemen like open router who gladly accept crypto.

replies(1): >>45948830 #

8. Aeolun ◴[16 Nov 25 14:18 UTC] No.45945323[source]▶

>>45945052 #

Why would someone in China not select Claude? If the people at Claude not notice then it’s a pure win. If they do notice, what are they going to do, arrest you? The worst thing they can do is block your account, then you have to make a new one with a newly issued false credit card. Whoopie doo.

replies(1): >>45945355 #

9. criemen ◴[16 Nov 25 14:23 UTC] No.45945355{3}[source]▶

>>45945323 #

> Why would someone in China not select Claude?

Because Anthropic doesn't provide services in China? See https://www.anthropic.com/supported-countries

replies(3): >>45945510 #>>45945569 #>>45948357 #

10. y-curious ◴[16 Nov 25 14:42 UTC] No.45945472[source]▶

>>45945088 #

I don’t know anything about him, but if he is running a department at Meta, he as at the very least a political genius and a teenage data labeller

replies(4): >>45945566 #>>45946125 #>>45946592 #>>45949334 #

11. ◴[16 Nov 25 14:47 UTC] No.45945510{4}[source]▶

>>45945355 #

12. tomrod ◴[16 Nov 25 14:57 UTC] No.45945566{3}[source]▶

>>45945472 #

It's a simple heuristic that will save a lot of time: something that seems too good to be true usually is.

13. dboreham ◴[16 Nov 25 14:57 UTC] No.45945569{4}[source]▶

>>45945355 #

Can confirm Claude doesn't even work in Hong Kong. That said I fired up my VPN and...then it did work.

replies(1): >>45950380 #

14. KaiserPro ◴[16 Nov 25 15:47 UTC] No.45945926[source]▶

>>45945052 #

What your describing would be plausible if this was about exploiting claude to get access to organisations that use it.

The gist of the anthropic thing is that "claude made, deployed and coordinated" a standard malware attack. Which is a _very_ different task.

Side note, most code assistants are trained on broadly similar coding datasets (ie github scrapes.)

15. lijok ◴[16 Nov 25 16:10 UTC] No.45946125{3}[source]▶

>>45945472 #

[flagged]

replies(1): >>45948213 #

16. williadc ◴[16 Nov 25 16:58 UTC] No.45946517[source]▶

>>45945088 #

Alexandr Wang is 28 years old, the same age as Mark Zuckerberg was when Facebook IPO'ed,

replies(1): >>45947247 #

17. antonvs ◴[16 Nov 25 17:07 UTC] No.45946592{3}[source]▶

>>45945472 #

Presumably this is all referring to Alexander Wang, who's 28 now. The data-labeling company he co-founded, Scale AI, was acquired by Meta at a valuation of nearly $30 billion.

But I suppose the criticism is that he doesn't have deep AI model research credentials. Which raises the age-old question of how much technical expertise is really needed in executive management.

replies(4): >>45947704 #>>45948260 #>>45948520 #>>45950151 #

18. smrtinsert ◴[16 Nov 25 18:31 UTC] No.45947247{3}[source]▶

>>45946517 #

A business where the distinguishing factor was exclusivity not technical excellence so it tracks.

19. NewsaHackO ◴[16 Nov 25 19:29 UTC] No.45947704{4}[source]▶

>>45946592 #

Hopefully he isn’t referring to Alex Wang, as it would invalidate anything else he said in his comment

20. antonvs ◴[16 Nov 25 20:41 UTC] No.45948213{4}[source]▶

>>45946125 #

> They hired a teenager to run one of their departments

Except they didn’t. The person in question was 28 when they hired him.

He was a teenager when he cofounded the company that was acquired for thirty billion dollars. But the taste of those really sour grapes must be hard to deal with.

replies(3): >>45948352 #>>45948854 #>>45960825 #

21. KaiserPro ◴[16 Nov 25 20:47 UTC] No.45948260{4}[source]▶

>>45946592 #

> how much technical expertise is really needed in executive management.

For running an AI lab? a lot. Put it this way, part of the reason that Meta has squandered its lead is because it decided to fill it's genAI dept (pre wang) with non-ML people.

Now thats fine, if they had decent product design and clear road map as to the products they want to release.

but no, they are just learning ML as they go, coming up with bullshit ideas as they go and seeing what sticks.

But, where it gets worse, is they take the FAIR team and pass them around like a soiled blanket: "You're a team that is pushing the boundaries in research, but also you need stop doing that and work on this chatbot that pretends to be a black gay single mother"

All the while you have a sister department, RL-L run by Abrash, who lets you actually do real research.

Which means most of FAIR have fucked off to somewhere less stressful, and more concentrated on actually doing research, rather than posting about how you're doing research.

Wangs misteps are numerous, the biggest one is re-platforming the training system. Thats a two year project right there, for no gain. It also force forks you from the rest of the ML teams. Given how long it took to move to MAST from fblearner, its going be a long slog. And thats before you tackle increasing GPU efficiency.

replies(1): >>45950341 #

22. KaiserPro ◴[16 Nov 25 20:59 UTC] No.45948352{5}[source]▶

>>45948213 #

[flagged]

replies(2): >>45950398 #>>45960810 #

23. xadhominemx ◴[16 Nov 25 20:59 UTC] No.45948357{4}[source]▶

>>45945355 #

Not really a relevant issue or concern for a nation state backed hack…

replies(1): >>45951281 #

24. tomrod ◴[16 Nov 25 21:22 UTC] No.45948520{4}[source]▶

>>45946592 #

> Which raises the age-old question of how much technical expertise is really needed in executive management.

For whomever you choose to set as the core decision maker, you get out whatever their expertise is with minor impact by their guides.

Scaling a business is a skill set. It's not a skill set that captures or expands the frontier of AI, so it's clearly in the realm to label the gentleman's expensive buyout is a product development play instead of a technology play.

25. mrtesthah ◴[16 Nov 25 22:02 UTC] No.45948830[source]▶

>>45945276 #

Can you show me exactly how to pay for open router with monero? Because it doesn’t seem possible.

replies(1): >>45949680 #

26. NewsaHackO ◴[16 Nov 25 22:06 UTC] No.45948854{5}[source]▶

>>45948213 #

I could not imagine being as salty as the original poster seems to be about Alex Wang. To hold that amount of hate for a superior that is more successful than you can’t be good for the soul

replies(2): >>45949451 #>>45951306 #

27. semiinfinitely ◴[16 Nov 25 22:06 UTC] No.45948858[source]▶

>>45944641 (TP) #

meta was never "world leading"

replies(1): >>45949728 #

28. cadamsdotcom ◴[16 Nov 25 23:08 UTC] No.45949298[source]▶

>>45944641 (TP) #

I think the high order bit here is you were working with models from previous generations.

In other words, since the latest generation of models have greater capabilities the story might be very different today.

replies(1): >>45949694 #

29. tim333 ◴[16 Nov 25 23:16 UTC] No.45949334{3}[source]▶

>>45945472 #

I was just watching the Y Combinator interview with Alexandr Wang who I guess may be being referred to https://youtu.be/5noIKN8t69U

The teenage data labeler thing was a bit of an exaggeration. He did found scale.ai at nineteen which does data labeling amongst other things.

replies(2): >>45950072 #>>45950343 #

30. lijok ◴[16 Nov 25 23:31 UTC] No.45949451{6}[source]▶

>>45948854 #

You’re taking this a tad too seriously

31. Tiberium ◴[17 Nov 25 00:01 UTC] No.45949680{3}[source]▶

>>45948830 #

There are tons of websites that will happily swap Monero for Ethereum, and then you can use it to pay. Most of those websites never actually do KYC or proper fund verification, unless you're operating on huge amounts or is suspicious in some other way.

32. Tiberium ◴[17 Nov 25 00:04 UTC] No.45949694[source]▶

>>45949298 #

Not sure why you're being downvoted, your observation is very correct here, newer models are indeed a lot better, and even at the time that foundational model (even if fine tuned) might've been worse than a commercial model from OpenAI/Anthropic.

33. throwaway2037 ◴[17 Nov 25 00:09 UTC] No.45949721[source]▶

>>45944641 (TP) #

    > now run by a teenage data labeller

Do you mean Alexandr Wang? Wiki says he is 28 years old. I don't understand.

34. robrenaud ◴[17 Nov 25 00:10 UTC] No.45949728[source]▶

>>45948858 #

pytorch

replies(1): >>45957194 #

35. ulfw ◴[17 Nov 25 01:22 UTC] No.45950072{4}[source]▶

>>45949334 #

What other things?

replies(2): >>45951085 #>>45952536 #

36. gpi ◴[17 Nov 25 01:43 UTC] No.45950151{4}[source]▶

>>45946592 #

Alexandr

replies(1): >>45950405 #

37. lp251 ◴[17 Nov 25 02:23 UTC] No.45950341{5}[source]▶

>>45948260 #

why did they move to fblearner

what is the new training platform

I must know

replies(1): >>45951699 #

38. rhines ◴[17 Nov 25 02:24 UTC] No.45950343{4}[source]▶

>>45949334 #

I watched this interview when I first heard about Alexandr Wang. I'd seen he was the youngest self made billionaire, which is a pretty impressive credential to have under your belt, and I wanted to see if I could get a read on what sets him apart.

Unfortunately he doesn't reveal any particular intelligence, insight, or drive in the interview, nor does he in other videos I found. Possibly he hides it, or possibly his genius is beyond me. Or possibly he had good timing on starting a data labelling company and then leveraged his connections in SV (including being roommates with Sam Altman) to massively inflate Scale AI's valuation and snag a Meta acquisition.

replies(4): >>45951300 #>>45951671 #>>45952818 #>>45954930 #

39. 0xWTF ◴[17 Nov 25 02:29 UTC] No.45950366[source]▶

>>45944641 (TP) #

> "world leading" AI lab (now run by a teenage data labeller)

Aarush Sah?

40. 0xWTF ◴[17 Nov 25 02:31 UTC] No.45950380{5}[source]▶

>>45945569 #

Yeah, I love folks who worry about China having access to models and GPUs. I mean, friend, they have 1.3B people. They could put a crack AI team in every country in the world, tomorrow. But yes, instead, it's far cheaper to let each of those AI teams VPN to any country, all the time.

replies(1): >>45952983 #

41. antonvs ◴[17 Nov 25 02:36 UTC] No.45950398{6}[source]▶

>>45948352 #

> Comic hyperbole darling.

Even if you say so yourself.

> I know that's hard to understand, especially when you're one of the start up elect, who still believes.

There's a lot of projection going on in that sentence.

replies(1): >>45951973 #

42. antonvs ◴[17 Nov 25 02:39 UTC] No.45950405{5}[source]▶

>>45950151 #

Thanks

43. objektif ◴[17 Nov 25 05:37 UTC] No.45951085{5}[source]▶

>>45950072 #

Semantic tagging.

replies(1): >>45951745 #

44. BobbyJo ◴[17 Nov 25 06:30 UTC] No.45951281{5}[source]▶

>>45948357 #

Or even a regular guy for that matter... VPNs exist.

45. ◴[17 Nov 25 06:35 UTC] No.45951300{5}[source]▶

>>45950343 #

46. BobbyJo ◴[17 Nov 25 06:36 UTC] No.45951306{6}[source]▶

>>45948854 #

a superior is kind of a loaded way to say "executive" or "company's leadership".

47. ngcazz ◴[17 Nov 25 07:08 UTC] No.45951433[source]▶

>>45944641 (TP) #

Wouldn't it be relatively cheap to use Claude as a self-organizing control backplane for invoking the MCP tools that would actually do the work?

48. tim333 ◴[17 Nov 25 07:59 UTC] No.45951671{5}[source]▶

>>45950343 #

I got the impression he's intelligent and hard working but to a large extent got lucky. I mean his idea was to kind of do a better version of Mechanical Turk which is ok as an idea but not amazing or anything. But then all these LLM companies were getting billions thrown at them by investors thinking they'd be AGI soon but they didn't work well without lots of humans doing fine tuning and Wang's company provided an outlet to throw the money at to get humans to try to do that.

I don't know how that will go at Meta. At the moment having lots of humans tweek LLMs still seems to be the main thing at the AI companies but that could change.

49. KaiserPro ◴[17 Nov 25 08:05 UTC] No.45951699{6}[source]▶

>>45950341 #

Meta has been itching to kill FBlearner for a while. Its basically an airflow style interface (much better to use as a dev, not sure about admin, I think it might even pre-date airflow)

They are mostly moved to MAST for GPU stuff now I dpn;t think any GPUs are assigned to fblearner anymore. This is a shame because it feels a bit less integrated into python and feels a bit more like "run your exe on n machines" however, it has a more reliable mechanism for doing multi-GPU things, which is key for doing any kind of research at speed.

My old team are not in the super intelligence org, so I don't have much details on the new training system, but there was lots of noise about "just using vercel" which is great apart from all of the steps and hoops you need to go through before you can train on any kind of non-opensource data. (FAIR had/has thier own cluster on AWS, but that meant that they couldn't use it to train on data we collected internally for research (ie paid studies and data from employees that were bribed with swag)

I've not caught up with the drama for the other choices. Either way, its kinda funny to watch "not invented here syndrome" smashing in to "also not invented here syndrome"

50. ulfw ◴[17 Nov 25 08:15 UTC] No.45951745{6}[source]▶

>>45951085 #

So... Tagging and labeling ok.

51. KaiserPro ◴[17 Nov 25 09:01 UTC] No.45951973{7}[source]▶

>>45950398 #

[flagged]

52. tim333 ◴[17 Nov 25 11:03 UTC] No.45952536{5}[source]▶

>>45950072 #

They do testing like 'humanities last exam' and they build custom LLMs for some of the largest companies, the defense dept and other US govt stuff - bit here https://youtu.be/5noIKN8t69U?t=2037

53. id ◴[17 Nov 25 11:52 UTC] No.45952818{5}[source]▶

>>45950343 #

Or maybe, just maybe, becoming a billionaire has way more to do with luck than anything else.

I don't know about any billionaire in the history of billionaires who appears to have gotten there solely based on special abilities. Being born into the right circumstances is all it really takes.

replies(3): >>45953187 #>>45953765 #>>45956248 #

54. glenneroo ◴[17 Nov 25 12:23 UTC] No.45952983{6}[source]▶

>>45950380 #

If they actually cared, they would just block VPNs. Valve does this when you try to create an account.

replies(2): >>45953217 #>>45983641 #

55. tim333 ◴[17 Nov 25 12:59 UTC] No.45953187{6}[source]▶

>>45952818 #

Oprah Winfrey? Still some luck but she didn't start in great circustances.

56. fluoridation ◴[17 Nov 25 13:05 UTC] No.45953217{7}[source]▶

>>45952983 #

If we're talking about state funding, that's not a problem. You just send a national to live in a residential area and then a team can proxy through that connection.

57. never_giveup ◴[17 Nov 25 14:19 UTC] No.45953765{6}[source]▶

>>45952818 #

Surely

58. ◴[17 Nov 25 16:13 UTC] No.45954930{5}[source]▶

>>45950343 #

59. mv4 ◴[17 Nov 25 16:37 UTC] No.45955235[source]▶

>>45944770 #

I used to work at RL so I instantly knew what he was referring to.

60. jandrese ◴[17 Nov 25 18:14 UTC] No.45956248{6}[source]▶

>>45952818 #

> Being born into the right circumstances is all it really takes.

You do still need to do the work. People have squandered golden opportunities because they didn't put in the effort.

61. semiinfinitely ◴[17 Nov 25 19:28 UTC] No.45957194{3}[source]▶

>>45949728 #

jax

62. creatonez ◴[17 Nov 25 20:45 UTC] No.45958070[source]▶

>>45944641 (TP) #

> the same for claude, you're API is tied to a bankaccount, and vibe coding a command and control system on a very public system seems like a bad choice.

Aside from middlemen as others have suggested - You can also just procure hundreds of hacked accounts for any major service through spyware data dump marketplaces. Some percentage of them will have payment already set up. Steal their browser cookies, use it until they notice and cancel / change their password, then move on to the next stolen account. Happens all the time these days.

63. johnwheeler ◴[17 Nov 25 21:14 UTC] No.45958375{3}[source]▶

>>45944935 #

Where do I write the check?

64. tomhow ◴[18 Nov 25 02:40 UTC] No.45960810{6}[source]▶

>>45948352 #

> Comic hyperbole darling. I know that's hard to understand, especially when you're one of the start up elect, who still believes.

Please omit patronizing swipes like this from comments on HN. You have no idea what the parent commenter "believes", but we know very well that sneering like this only makes HN worse. Please take a moment to remind yourself of the guidelines and make an effort to observe them in future. https://news.ycombinator.com/newsguidelines.html

65. tomhow ◴[18 Nov 25 02:42 UTC] No.45960825{5}[source]▶

>>45948213 #

> But the taste of those really sour grapes must be hard to deal with

Please don't sneer at fellow community members on HN, and don't reply to a bad comment with a worse one; it just makes HN seem like a more mean and miserable place. The comment would have been fine without that last sentence.

replies(1): >>45966796 #

66. KETpXDDzR ◴[18 Nov 25 03:45 UTC] No.45961167[source]▶

>>45944641 (TP) #

Yeah, I gave my AWS root API key to Cursor in agent mode. I learned that AWS charges ridiculous amounts for transferring and storing data.

67. antonvs ◴[18 Nov 25 14:43 UTC] No.45966796{6}[source]▶

>>45960825 #

Content is more important to me than tone.

Much of this subthread is nothing more than gossip about someone people are apparently jealous of. Talk about a "mean and miserable place." Techbros upset that they didn't cash out as big.

68. puremachinery ◴[19 Nov 25 19:04 UTC] No.45983641{7}[source]▶

>>45952983 #

Commercial VPNs are relatively easy to block, because they use known IP ranges that companies can blacklist. But it's trivial to set up a private VPN with unique IPs such that VPN blocking becomes much less straightforward and much more resource intensive, for example by using traffic pattern analysis or behavioral fingerprinting.

↑