←back to thread

321 points denysvitali | 3 comments | | HN request time: 0s | source
1. lastdong ◴[] No.45109630[source]
In my opinion, we need more models trained on fully traceable and clean data instead of closed models that we later find out were trained on Reddit and Facebook discussion threads.
replies(1): >>45145687 #
2. johntash ◴[] No.45145687[source]
I want to see something trained _only_ on stuff like encyclopedias, programming books, etc. I'm interested in how different it would be compared to something with a lot of social media in it.
replies(1): >>45146549 #
3. ekianjo ◴[] No.45146549[source]
Better to do a fine tune or a LoRA than a full retraining from scratch