←back to thread

68 points peakji | 4 comments | | HN request time: 0.001s | source

Steiner is a series of reasoning models trained on synthetic data using reinforcement learning. These models can explore multiple reasoning paths in an autoregressive manner during inference and autonomously verify or backtrack when necessary, enabling a linear traversal of the implicit search tree.

Blog: https://medium.com/@peakji/a-small-step-towards-reproducing-...

Hugging Face: https://huggingface.co/collections/peakji/steiner-preview-67...

Show context
swyx ◴[] No.41917274[source]
advice to OP - you hurt your own credibility posting on medium dot com. just blog on huggingface or substack or hashnode.
replies(3): >>41917456 #>>41918832 #>>41921519 #
1. peakji ◴[] No.41917456[source]
I'm new here. Just curious, why avoid Medium? Is it a Hacker News thing, or did I miss something?
replies(3): >>41918051 #>>41918091 #>>41918187 #
2. ◴[] No.41918051[source]
3. whatshisface ◴[] No.41918091[source]
Medium doesn't "hurt your credibility" nearly as much as revealing that one's arsenal of litmus tests is suffering from such a paucity of real knowledge that one bases it on the web design, but Medium has a lot of annoying popups. A lot of people like Substack better and they have a paid subscriber thing that works well.

(realistically speaking, experts tend to know less about the blog hosting ecosystem the more they know about their domain)

4. swyx ◴[] No.41918187[source]
its just a "tell" that you dont mind the poor reader experience and being associated with the rest of low quality slop that is on medium. many of us here have simply given up clicking on anything medium related