(ethz.ch)

1. oytis ◴[11 Jul 25 20:08 UTC] No.44536348[source]▶

The press release talks a lot about how it was done, but very little about how capabilities compare to other open models.

replies(2): >>44536398 #>>44536523 #

2. pantalaimon ◴[11 Jul 25 20:15 UTC] No.44536398[source]▶

>>44536348 (TP) #

It's a university, teaching the 'how it's done' is kind of the point

replies(1): >>44536825 #

3. joot82 ◴[11 Jul 25 20:32 UTC] No.44536523[source]▶

>>44536348 (TP) #

The model will be released in two sizes — 8 billion and 70 billion parameters [...]. The 70B version will rank among the most powerful fully open models worldwide. [...] In late summer, the LLM will be released under the Apache 2.0 License.

We'll find out in September if it's true?

replies(2): >>44536885 #>>44536967 #

4. EA-3167 ◴[11 Jul 25 21:12 UTC] No.44536825[source]▶

>>44536398 #

Sure, but usually you teach something that is inherently useful, or can be applied to some sort of useful endeavor. In this case I think it's fair to ask what the collision of two bubbles really achieves, or if it's just a useful teaching model, what it can be applied to.

5. k__ ◴[11 Jul 25 21:19 UTC] No.44536885[source]▶

>>44536523 #

I hope DeepSeek R2, but I fear Llama 4.

6. oytis ◴[11 Jul 25 21:32 UTC] No.44536967[source]▶

>>44536523 #

Yeah, I was thinking more of a table with benchmark results

↑

ETH Zurich and EPFL to release a LLM developed on public infrastructure