I previously made a list on twitter of some data labeling startups that work with foundation model companies.[1] Here's the RLHF provider section:
RLHF providers:
1. Surge. $1b+ revenue bootstrapped. DataAnnotation is the worker-side (you might've seen their ads), also TaskUp and Gethybrid.
2. Scale. The most well known. Remotasks and Outlier are the worker-side
3. Invisible. Started as a kind of managed VA service.
4. Mercor. Started mostly as a way to hire remote devs I think.
5. Handshake AI. Handshake is a college hiring network. This is a spinout
6. Pareto
7. Prolific
8. Toloka
9. Turing
10. Sepal AI. The team is ex-Turing
11. Datacurve. Coding data.
12. Snorkel. Started as a software platform for data labeling. Offers some data as a service now.
13. Micro1. Also started as a way to hire remote contractor devs
replies(1):