(medium.com)

83 points peakji | 3 comments | 22 Oct 24 16:07 UTC | HN request time: 0.804s | source

Steiner is a series of reasoning models trained on synthetic data using reinforcement learning. These models can explore multiple reasoning paths in an autoregressive manner during inference and autonomously verify or backtrack when necessary, enabling a linear traversal of the implicit search tree.

Blog: https://medium.com/@peakji/a-small-step-towards-reproducing-...

Hugging Face: https://huggingface.co/collections/peakji/steiner-preview-67...

1. schmeichel ◴[22 Oct 24 19:14 UTC] No.41917645[source]▶

>>41915735 (OP) #

This seems promising! Great work! Any chance there will be a Ollama Modelfile for the masses?

replies(1): >>41917682 #

2. peakji ◴[22 Oct 24 19:18 UTC] No.41917682[source]▶

>>41917645 (TP) #

GGUF files are available on HF: https://huggingface.co/peakji/steiner-32b-preview-gguf

I haven't personally used Ollama Modelfile, but I think it should be relatively easy to convert from GGUF?

replies(1): >>41924937 #

3. ca_tech ◴[23 Oct 24 13:35 UTC] No.41924937[source]▶

>>41917682 #

You can now run any huggingface model using the following command

ollama run hf.co/{username}/{repository}

Example: ollama run hf.co/peakji/steiner-32b-preview-gguf:Q4_K_M

Source: https://huggingface.co/docs/hub/en/ollama

↑

Show HN: Steiner – An open-source reasoning model inspired by OpenAI o1