←back to thread

283 points Brajeshwar | 2 comments | | HN request time: 0s | source
Show context
cjbarber ◴[] No.45232433[source]
I previously made a list on twitter of some data labeling startups that work with foundation model companies.[1] Here's the RLHF provider section:

RLHF providers:

1. Surge. $1b+ revenue bootstrapped. DataAnnotation is the worker-side (you might've seen their ads), also TaskUp and Gethybrid.

2. Scale. The most well known. Remotasks and Outlier are the worker-side

3. Invisible. Started as a kind of managed VA service.

4. Mercor. Started mostly as a way to hire remote devs I think.

5. Handshake AI. Handshake is a college hiring network. This is a spinout

6. Pareto

7. Prolific

8. Toloka

9. Turing

10. Sepal AI. The team is ex-Turing

11. Datacurve. Coding data.

12. Snorkel. Started as a software platform for data labeling. Offers some data as a service now.

13. Micro1. Also started as a way to hire remote contractor devs

[1]: https://x.com/chrisbarber/status/1965096585555272072

replies(1): >>45232997 #
1. echelon ◴[] No.45232997[source]
This is great!

Are there companies that focus on labeling of inputs rather than RLHF of outputs?

replies(1): >>45233242 #
2. cjbarber ◴[] No.45233242[source]
Yes, there are quite a few that do that. Appen, iMerit, TELUS, etc. Also Scale AI started focused on input annotation I think for self driving.