(arxiv.org)

248 points doener | 1 comments | 15 Apr 25 10:17 UTC | HN request time: 0.234s | source

1. YetAnotherNick ◴[15 Apr 25 12:15 UTC] No.43691621[source]▶

They compared with Llama 3.1 and found that to be better on average for their tasks like European MMLU. And Llama 3.1 is the worst in the batch with Qwen 2.5 and Gemma 3 being significantly better.

↑

Teuken-7B-Base and Teuken-7B-Instruct: Towards European LLMs (2024)