←back to thread

283 points Brajeshwar | 1 comments | | HN request time: 0s | source
Show context
cs702 ◴[] No.45231366[source]
The title is biased, blaming Google for mistreating people and implying that Google's AI isn't smart, but the OP is worth reading, because it gives readers a sense of the labor and cost involved in providing AI models with human feedback, the HF in RLHF, to ensure they behave in ways acceptable to human beings, more aligned with human expectations, values, and preferences.
replies(6): >>45231394 #>>45231412 #>>45231441 #>>45231748 #>>45231773 #>>45233975 #
1. NewEntryHN ◴[] No.45233975[source]
Isn't that mostly the fine-tuning phase? RLHF being cherry on top?