Most active commenters
  • KaiserPro(6)
  • antonvs(5)
  • tim333(4)
  • (3)

←back to thread

1160 points vxvxvx | 68 comments | | HN request time: 1.9s | source | bottom

Earlier thread: Disrupting the first reported AI-orchestrated cyber espionage campaign - https://news.ycombinator.com/item?id=45918638 - Nov 2025 (281 comments)
1. KaiserPro ◴[] No.45944641[source]
When I worked at a FAANG with a "world leading" AI lab (now run by a teenage data labeller) as an SRE/sysadmin I was asked to use a modified version of a foundation model which was steered towards infosec stuff.

We were asked to try and persuade it to help us hack into a mock printer/dodgy linux box.

It helped a little, but it wasn't all that helpful.

but in terms of coordination, I can't see how it would be useful.

the same for claude, you're API is tied to a bankaccount, and vibe coding a command and control system on a very public system seems like a bad choice.

replies(12): >>45944770 #>>45944798 #>>45945052 #>>45945088 #>>45945276 #>>45948858 #>>45949298 #>>45949721 #>>45950366 #>>45951433 #>>45958070 #>>45961167 #
2. maddmann ◴[] No.45944770[source]
[flagged]
replies(2): >>45944935 #>>45955235 #
3. ACCount37 ◴[] No.45944798[source]
As if that makes any difference to cybercriminals.

If they're not using stolen API creds, then they're using stolen bank accounts to buy them.

Modern AIs are way better at infosec than those from the "world leading AI company" days. If you can get them to comply. Which isn't actually hard. I had to bypass the "safety" filters for a few things, and it took about a hour.

4. heresie-dabord ◴[] No.45944935[source]
I propose a project that we name Blarrble, it will generate text.

We will need a large number of humans to filter and label the data inputs for Blarrble, and another group of humans to test the outputs of Blarrble to fix it when it generate errors and outright nonsense that we can't techsplain and technobabble away to a credulous audience.

Can we make (m|b|tr)illions and solve teenage unemployment before the Blarrble bubble bursts?

replies(1): >>45958375 #
5. Milderbole ◴[] No.45945052[source]
If the article is not just marketing fluff, I assume a bad actor would select Claude not because it’s good at writing attacks, instead a bad actor code would choose it because Western orgs chose Claude. Sonnet is usually the go-to on most coding copilot because the model was trained on good range of data distribution reflecting western coding patterns. If you want to find a gap or write a vulnerability, use the same tool that has ingested patterns that wrote code of the systems you’re trying to break. Or use Claude to write a phishing attack because then output is more likely similar to what our eyes would expect.
replies(2): >>45945323 #>>45945926 #
6. jgalt212 ◴[] No.45945088[source]
> now run by a teenage data labeller

sick burn

replies(2): >>45945472 #>>45946517 #
7. iterateoften ◴[] No.45945276[source]
> you're API is tied to a bankaccount,

There are a lot of middlemen like open router who gladly accept crypto.

replies(1): >>45948830 #
8. Aeolun ◴[] No.45945323[source]
Why would someone in China not select Claude? If the people at Claude not notice then it’s a pure win. If they do notice, what are they going to do, arrest you? The worst thing they can do is block your account, then you have to make a new one with a newly issued false credit card. Whoopie doo.
replies(1): >>45945355 #
9. criemen ◴[] No.45945355{3}[source]
> Why would someone in China not select Claude?

Because Anthropic doesn't provide services in China? See https://www.anthropic.com/supported-countries

replies(3): >>45945510 #>>45945569 #>>45948357 #
10. y-curious ◴[] No.45945472[source]
I don’t know anything about him, but if he is running a department at Meta, he as at the very least a political genius and a teenage data labeller
replies(4): >>45945566 #>>45946125 #>>45946592 #>>45949334 #
11. ◴[] No.45945510{4}[source]
12. tomrod ◴[] No.45945566{3}[source]
It's a simple heuristic that will save a lot of time: something that seems too good to be true usually is.
13. dboreham ◴[] No.45945569{4}[source]
Can confirm Claude doesn't even work in Hong Kong. That said I fired up my VPN and...then it did work.
replies(1): >>45950380 #
14. KaiserPro ◴[] No.45945926[source]
What your describing would be plausible if this was about exploiting claude to get access to organisations that use it.

The gist of the anthropic thing is that "claude made, deployed and coordinated" a standard malware attack. Which is a _very_ different task.

Side note, most code assistants are trained on broadly similar coding datasets (ie github scrapes.)

15. lijok ◴[] No.45946125{3}[source]
[flagged]
replies(1): >>45948213 #
16. williadc ◴[] No.45946517[source]
Alexandr Wang is 28 years old, the same age as Mark Zuckerberg was when Facebook IPO'ed,
replies(1): >>45947247 #
17. antonvs ◴[] No.45946592{3}[source]
Presumably this is all referring to Alexander Wang, who's 28 now. The data-labeling company he co-founded, Scale AI, was acquired by Meta at a valuation of nearly $30 billion.

But I suppose the criticism is that he doesn't have deep AI model research credentials. Which raises the age-old question of how much technical expertise is really needed in executive management.

replies(4): >>45947704 #>>45948260 #>>45948520 #>>45950151 #
18. smrtinsert ◴[] No.45947247{3}[source]
A business where the distinguishing factor was exclusivity not technical excellence so it tracks.
19. NewsaHackO ◴[] No.45947704{4}[source]
Hopefully he isn’t referring to Alex Wang, as it would invalidate anything else he said in his comment
20. antonvs ◴[] No.45948213{4}[source]
> They hired a teenager to run one of their departments

Except they didn’t. The person in question was 28 when they hired him.

He was a teenager when he cofounded the company that was acquired for thirty billion dollars. But the taste of those really sour grapes must be hard to deal with.

replies(3): >>45948352 #>>45948854 #>>45960825 #
21. KaiserPro ◴[] No.45948260{4}[source]
> how much technical expertise is really needed in executive management.

For running an AI lab? a lot. Put it this way, part of the reason that Meta has squandered its lead is because it decided to fill it's genAI dept (pre wang) with non-ML people.

Now thats fine, if they had decent product design and clear road map as to the products they want to release.

but no, they are just learning ML as they go, coming up with bullshit ideas as they go and seeing what sticks.

But, where it gets worse, is they take the FAIR team and pass them around like a soiled blanket: "You're a team that is pushing the boundaries in research, but also you need stop doing that and work on this chatbot that pretends to be a black gay single mother"

All the while you have a sister department, RL-L run by Abrash, who lets you actually do real research.

Which means most of FAIR have fucked off to somewhere less stressful, and more concentrated on actually doing research, rather than posting about how you're doing research.

Wangs misteps are numerous, the biggest one is re-platforming the training system. Thats a two year project right there, for no gain. It also force forks you from the rest of the ML teams. Given how long it took to move to MAST from fblearner, its going be a long slog. And thats before you tackle increasing GPU efficiency.

replies(1): >>45950341 #
22. KaiserPro ◴[] No.45948352{5}[source]
[flagged]
replies(2): >>45950398 #>>45960810 #
23. xadhominemx ◴[] No.45948357{4}[source]
Not really a relevant issue or concern for a nation state backed hack…
replies(1): >>45951281 #
24. tomrod ◴[] No.45948520{4}[source]
> Which raises the age-old question of how much technical expertise is really needed in executive management.

For whomever you choose to set as the core decision maker, you get out whatever their expertise is with minor impact by their guides.

Scaling a business is a skill set. It's not a skill set that captures or expands the frontier of AI, so it's clearly in the realm to label the gentleman's expensive buyout is a product development play instead of a technology play.

25. mrtesthah ◴[] No.45948830[source]
Can you show me exactly how to pay for open router with monero? Because it doesn’t seem possible.
replies(1): >>45949680 #
26. NewsaHackO ◴[] No.45948854{5}[source]
I could not imagine being as salty as the original poster seems to be about Alex Wang. To hold that amount of hate for a superior that is more successful than you can’t be good for the soul
replies(2): >>45949451 #>>45951306 #
27. semiinfinitely ◴[] No.45948858[source]
meta was never "world leading"
replies(1): >>45949728 #
28. cadamsdotcom ◴[] No.45949298[source]
I think the high order bit here is you were working with models from previous generations.

In other words, since the latest generation of models have greater capabilities the story might be very different today.

replies(1): >>45949694 #
29. tim333 ◴[] No.45949334{3}[source]
I was just watching the Y Combinator interview with Alexandr Wang who I guess may be being referred to https://youtu.be/5noIKN8t69U

The teenage data labeler thing was a bit of an exaggeration. He did found scale.ai at nineteen which does data labeling amongst other things.

replies(2): >>45950072 #>>45950343 #
30. lijok ◴[] No.45949451{6}[source]
You’re taking this a tad too seriously
31. Tiberium ◴[] No.45949680{3}[source]
There are tons of websites that will happily swap Monero for Ethereum, and then you can use it to pay. Most of those websites never actually do KYC or proper fund verification, unless you're operating on huge amounts or is suspicious in some other way.
32. Tiberium ◴[] No.45949694[source]
Not sure why you're being downvoted, your observation is very correct here, newer models are indeed a lot better, and even at the time that foundational model (even if fine tuned) might've been worse than a commercial model from OpenAI/Anthropic.
33. throwaway2037 ◴[] No.45949721[source]

    > now run by a teenage data labeller
Do you mean Alexandr Wang? Wiki says he is 28 years old. I don't understand.
34. robrenaud ◴[] No.45949728[source]
pytorch
replies(1): >>45957194 #
35. ulfw ◴[] No.45950072{4}[source]
What other things?
replies(2): >>45951085 #>>45952536 #
36. gpi ◴[] No.45950151{4}[source]
Alexandr
replies(1): >>45950405 #
37. lp251 ◴[] No.45950341{5}[source]
why did they move to fblearner

what is the new training platform

I must know

replies(1): >>45951699 #
38. rhines ◴[] No.45950343{4}[source]
I watched this interview when I first heard about Alexandr Wang. I'd seen he was the youngest self made billionaire, which is a pretty impressive credential to have under your belt, and I wanted to see if I could get a read on what sets him apart.

Unfortunately he doesn't reveal any particular intelligence, insight, or drive in the interview, nor does he in other videos I found. Possibly he hides it, or possibly his genius is beyond me. Or possibly he had good timing on starting a data labelling company and then leveraged his connections in SV (including being roommates with Sam Altman) to massively inflate Scale AI's valuation and snag a Meta acquisition.

replies(4): >>45951300 #>>45951671 #>>45952818 #>>45954930 #
39. 0xWTF ◴[] No.45950366[source]
> "world leading" AI lab (now run by a teenage data labeller)

Aarush Sah?

40. 0xWTF ◴[] No.45950380{5}[source]
Yeah, I love folks who worry about China having access to models and GPUs. I mean, friend, they have 1.3B people. They could put a crack AI team in every country in the world, tomorrow. But yes, instead, it's far cheaper to let each of those AI teams VPN to any country, all the time.
replies(1): >>45952983 #
41. antonvs ◴[] No.45950398{6}[source]
> Comic hyperbole darling.

Even if you say so yourself.

> I know that's hard to understand, especially when you're one of the start up elect, who still believes.

There's a lot of projection going on in that sentence.

replies(1): >>45951973 #
42. antonvs ◴[] No.45950405{5}[source]
Thanks
43. objektif ◴[] No.45951085{5}[source]
Semantic tagging.
replies(1): >>45951745 #
44. BobbyJo ◴[] No.45951281{5}[source]
Or even a regular guy for that matter... VPNs exist.
45. ◴[] No.45951300{5}[source]
46. BobbyJo ◴[] No.45951306{6}[source]
a superior is kind of a loaded way to say "executive" or "company's leadership".
47. ngcazz ◴[] No.45951433[source]
Wouldn't it be relatively cheap to use Claude as a self-organizing control backplane for invoking the MCP tools that would actually do the work?
48. tim333 ◴[] No.45951671{5}[source]
I got the impression he's intelligent and hard working but to a large extent got lucky. I mean his idea was to kind of do a better version of Mechanical Turk which is ok as an idea but not amazing or anything. But then all these LLM companies were getting billions thrown at them by investors thinking they'd be AGI soon but they didn't work well without lots of humans doing fine tuning and Wang's company provided an outlet to throw the money at to get humans to try to do that.

I don't know how that will go at Meta. At the moment having lots of humans tweek LLMs still seems to be the main thing at the AI companies but that could change.

49. KaiserPro ◴[] No.45951699{6}[source]
Meta has been itching to kill FBlearner for a while. Its basically an airflow style interface (much better to use as a dev, not sure about admin, I think it might even pre-date airflow)

They are mostly moved to MAST for GPU stuff now I dpn;t think any GPUs are assigned to fblearner anymore. This is a shame because it feels a bit less integrated into python and feels a bit more like "run your exe on n machines" however, it has a more reliable mechanism for doing multi-GPU things, which is key for doing any kind of research at speed.

My old team are not in the super intelligence org, so I don't have much details on the new training system, but there was lots of noise about "just using vercel" which is great apart from all of the steps and hoops you need to go through before you can train on any kind of non-opensource data. (FAIR had/has thier own cluster on AWS, but that meant that they couldn't use it to train on data we collected internally for research (ie paid studies and data from employees that were bribed with swag)

I've not caught up with the drama for the other choices. Either way, its kinda funny to watch "not invented here syndrome" smashing in to "also not invented here syndrome"

50. ulfw ◴[] No.45951745{6}[source]
So... Tagging and labeling ok.
51. KaiserPro ◴[] No.45951973{7}[source]
[flagged]
52. tim333 ◴[] No.45952536{5}[source]
They do testing like 'humanities last exam' and they build custom LLMs for some of the largest companies, the defense dept and other US govt stuff - bit here https://youtu.be/5noIKN8t69U?t=2037
53. id ◴[] No.45952818{5}[source]
Or maybe, just maybe, becoming a billionaire has way more to do with luck than anything else.

I don't know about any billionaire in the history of billionaires who appears to have gotten there solely based on special abilities. Being born into the right circumstances is all it really takes.

replies(3): >>45953187 #>>45953765 #>>45956248 #
54. glenneroo ◴[] No.45952983{6}[source]
If they actually cared, they would just block VPNs. Valve does this when you try to create an account.
replies(2): >>45953217 #>>45983641 #
55. tim333 ◴[] No.45953187{6}[source]
Oprah Winfrey? Still some luck but she didn't start in great circustances.
56. fluoridation ◴[] No.45953217{7}[source]
If we're talking about state funding, that's not a problem. You just send a national to live in a residential area and then a team can proxy through that connection.
57. never_giveup ◴[] No.45953765{6}[source]
Surely
58. ◴[] No.45954930{5}[source]
59. mv4 ◴[] No.45955235[source]
I used to work at RL so I instantly knew what he was referring to.
60. jandrese ◴[] No.45956248{6}[source]
> Being born into the right circumstances is all it really takes.

You do still need to do the work. People have squandered golden opportunities because they didn't put in the effort.

61. semiinfinitely ◴[] No.45957194{3}[source]
jax
62. creatonez ◴[] No.45958070[source]
> the same for claude, you're API is tied to a bankaccount, and vibe coding a command and control system on a very public system seems like a bad choice.

Aside from middlemen as others have suggested - You can also just procure hundreds of hacked accounts for any major service through spyware data dump marketplaces. Some percentage of them will have payment already set up. Steal their browser cookies, use it until they notice and cancel / change their password, then move on to the next stolen account. Happens all the time these days.

63. johnwheeler ◴[] No.45958375{3}[source]
Where do I write the check?
64. tomhow ◴[] No.45960810{6}[source]
> Comic hyperbole darling. I know that's hard to understand, especially when you're one of the start up elect, who still believes.

Please omit patronizing swipes like this from comments on HN. You have no idea what the parent commenter "believes", but we know very well that sneering like this only makes HN worse. Please take a moment to remind yourself of the guidelines and make an effort to observe them in future. https://news.ycombinator.com/newsguidelines.html

65. tomhow ◴[] No.45960825{5}[source]
> But the taste of those really sour grapes must be hard to deal with

Please don't sneer at fellow community members on HN, and don't reply to a bad comment with a worse one; it just makes HN seem like a more mean and miserable place. The comment would have been fine without that last sentence.

replies(1): >>45966796 #
66. KETpXDDzR ◴[] No.45961167[source]
Yeah, I gave my AWS root API key to Cursor in agent mode. I learned that AWS charges ridiculous amounts for transferring and storing data.
67. antonvs ◴[] No.45966796{6}[source]
Content is more important to me than tone.

Much of this subthread is nothing more than gossip about someone people are apparently jealous of. Talk about a "mean and miserable place." Techbros upset that they didn't cash out as big.

68. puremachinery ◴[] No.45983641{7}[source]
Commercial VPNs are relatively easy to block, because they use known IP ranges that companies can blacklist. But it's trivial to set up a private VPN with unique IPs such that VPN blocking becomes much less straightforward and much more resource intensive, for example by using traffic pattern analysis or behavioral fingerprinting.