Windsurf SWE-1: Our First Frontier Models

1. firejake308 ◴[15 May 25 21:35 UTC] No.43999554[source]▶

I'm confused why they are working on their own frontier models if they are going to be bought by OpenAI anyway. I guess this is something they were working on before the announcement?

replies(6): >>44002084 #>>44002155 #>>44002246 #>>44002598 #>>44002975 #>>44003869 #

2. kristopolous ◴[16 May 25 05:25 UTC] No.44002084[source]▶

>>43999554 (TP) #

Must have been. These things take months.

3. anshumankmr ◴[16 May 25 05:41 UTC] No.44002155[source]▶

>>43999554 (TP) #

Getting more money perhaps also, if they believed their model to be good, and had amassed some good training data Open AI can leverage, apart from the user base.

4. allenleee ◴[16 May 25 06:03 UTC] No.44002246[source]▶

>>43999554 (TP) #

It seems OpenAI acquired Windsurf but is letting it operate independently, keeping its own brand and developing its own coding models. That way, if Windsurf runs into technical problems, the backlash lands on Windsurf—not OpenAI. It’s a smart way to innovate while keeping the main brand safe.

replies(4): >>44002285 #>>44003466 #>>44005701 #>>44006118 #

5. riffraff ◴[16 May 25 06:12 UTC] No.44002285[source]▶

>>44002246 #

But doesn't this mean they have twice the costs in training? I was under the impression that was still the most expensive part of these companies' balance.

replies(2): >>44002309 #>>44003091 #

6. kcorbitt ◴[16 May 25 06:18 UTC] No.44002309{3}[source]▶

>>44002285 #

It's very unlikely that they're doing their own pre-training, which is the longest and most expensive part of creating a frontier model (if they were, they'd likely brag about it).

Most likely they built this as a post-train of an open model that is already strong on coding like Qwen 2.5.

7. dyl000 ◴[16 May 25 07:17 UTC] No.44002598[source]▶

>>43999554 (TP) #

openAI models have an issue where they are pretty good at everything but not incredible at anything. They're too well rounded.

for coding you use anthropic or google models, I haven't found anyone who swears by openAI models for coding... Their reasoning models are either too expensive or hallucinate massively to the point of being useless... I would assume the gpt 4.1 family will be popular for SWE's

Having a smaller scope model (agentic coding only) allows for much cheaper inference and windsurf building its own moat (so far agentic IDE's haven't had a moat)

replies(1): >>44002812 #

8. jjani ◴[16 May 25 07:51 UTC] No.44002812[source]▶

>>44002598 #

> openAI models have an issue where they are pretty good at everything but not incredible at anything. They're too well rounded.

This suggests OpenAI models do have tasks they're better at than the "less rounded" competition, who have taks they're weaker in. Could you name a single sucg task (except for image generation, which is an entirely different usecase), that OpenAI models are better at than Gemini 2.5 and Claude 3.7 without costing at least 5x as much?

9. seunosewa ◴[16 May 25 08:20 UTC] No.44002975[source]▶

>>43999554 (TP) #

They were working on the model before the acquisition. It makes sense to test it and see how it does instead of throwing the work away. Their data will probably be used to improve gpt-4.1, o4 mini high, and other OpenAI coding models

10. rfoo ◴[16 May 25 08:40 UTC] No.44003091{3}[source]▶

>>44002285 #

mid/post training does not cost that much, except maybe large scale RL, but even this is more of an infra problem. If anything, the cost is mostly in running various experiments (i.e. the process of doing research).

It is very puzzling why "wrapper" companies don't (and religiously say they won't ever) do something on this front. The only barrier is talents.

replies(1): >>44003222 #

11. anshumankmr ◴[16 May 25 09:01 UTC] No.44003222{4}[source]▶

>>44003091 #

You might be underestimating the barrier to hiring the really smart people. Open AI/Google etc would be hiring and poaching people like crazy, offering cushy bonuses and TCs that would make blow your mind.(Like say Noam Brown at Open AI) And some of the more ambitious ones would start their own ventures (like say Ilya etc.).

That being said I am sure a lot of the so called wrapper companies are paying insanely well too, but competing with FAANGMULA might be trickier for them.

replies(2): >>44003495 #>>44003716 #

12. OtherShrezzing ◴[16 May 25 09:46 UTC] No.44003466[source]▶

>>44002246 #

This is effectively how Microsoft is treating OpenAI.

13. NitpickLawyer ◴[16 May 25 09:51 UTC] No.44003495{5}[source]▶

>>44003222 #

FAANGMULA ... Microsoft, Uber?, L??, Anthropic? Who's the L?

replies(2): >>44003605 #>>44006161 #

14. Archonical ◴[16 May 25 10:11 UTC] No.44003605{6}[source]▶

>>44003495 #

Lyft.

15. whywhywhywhy ◴[16 May 25 10:31 UTC] No.44003716{5}[source]▶

>>44003222 #

Any half decent and methodical software engineer can fine tune/repurpose a model if you have the data and the money to burn on compute and experiment runs, which they do.

replies(2): >>44004352 #>>44004373 #

16. jstummbillig ◴[16 May 25 10:59 UTC] No.44003869[source]▶

>>43999554 (TP) #

Why would OpenAI not let smart people work on models? That seems to be what they do. The point is: They are no longer "their own" models. They are now OpenAI models. If they suck, if they are redundant, if there is no idea there that makes sense, that effort will not continue indefinitely.

17. anshumankmr ◴[16 May 25 12:00 UTC] No.44004352{6}[source]▶

>>44003716 #

Fine tuning/distilling etc is fine. I was speaking to the original commenter's question about research, which is where things are trickier. Fine tuning is something I even managed and Unsloth has removed even barriers for training some of the more commonly used open source models.

18. brookst ◴[16 May 25 12:01 UTC] No.44004373{6}[source]▶

>>44003716 #

They can absolutely do it, but they will get poorer results than someone who really understands LLMs. There is still a huge amount of taste and art in the sourcing and curation of data for fine tuning.

19. ActionHank ◴[16 May 25 14:06 UTC] No.44005701[source]▶

>>44002246 #

Windsurf is a hedge against MS + VSCode and GH + copilot.

OAI is trying frantically to build a moat without doing any digging.

20. sunshinekitty ◴[16 May 25 14:43 UTC] No.44006118[source]▶

>>44002246 #

This is an incredibly premature statement to make. The acquisition announcement is days old.

21. riffraff ◴[16 May 25 14:48 UTC] No.44006161{6}[source]▶

>>44003495 #

A is Airbnb, afair.