←back to thread

283 points Brajeshwar | 2 comments | | HN request time: 0.604s | source
Show context
iandanforth ◴[] No.45231600[source]
"Google said in a statement: “Quality raters are employed by our suppliers and are temporarily assigned to provide external feedback on our products. Their ratings are one of many aggregated data points that help us measure how well our systems are working, but do not directly impact our algorithms or models.” GlobalLogic declined to comment for this story." (emphasis mine)

How is this not a straight up lie? For this to be true they would have to throw away labeled training data.

replies(4): >>45231651 #>>45231697 #>>45231758 #>>45232359 #
creddit ◴[] No.45231697[source]
Because they are doing it to compute quality metrics not to implement RLHF. It’s not training data.
replies(1): >>45233477 #
1. visarga ◴[] No.45233477[source]
Every decision they take based on evals influences the model.
replies(1): >>45234755 #
2. creddit ◴[] No.45234755[source]
/"directly"/