←back to thread

314 points whoishiring | 1 comments | | HN request time: 0.204s | source

Please state the location and include REMOTE for remote work, REMOTE (US) or similar if the country is restricted, and ONSITE when remote work is not an option.

Please only post if you personally are part of the hiring company—no recruiting firms or job boards. One post per company. If it isn't a household name, explain what your company does.

Please only post if you are actively filling a position and are committed to responding to applicants.

Commenters: please don't reply to job posts to complain about something. It's off topic here.

Readers: please only email if you are personally interested in the job.

Searchers: try https://dheerajck.github.io/hnwhoishiring/, http://nchelluri.github.io/hnjobs/, https://hnresumetojobs.com, https://hnhired.fly.dev, https://kennytilton.github.io/whoishiring/, https://hnjobs.emilburzo.com, or this (unofficial) Chrome extension: https://chromewebstore.google.com/detail/hn-hiring-pro/mpfal....

Don't miss this other fine thread: Who wants to be hired? https://news.ycombinator.com/item?id=46108940

1. botglen ◴[] No.46109127[source]
Profitmind | Web Scraping Junior Developer | Remote or Pittsburgh | $90-110k | Full-time | https://www.profitmind.com/

At Profitmind, we're building massive-scale ecommerce datasets to use in AI training/inference, and we need a junior engineer to help develop the scraping infrastructure for product data. You'll be reverse-engineering undocumented APIs, handling anti-bot systems, and dealing with edge cases like pagination limits, rate limiting, and sites that change their protection schemes without warning. It's fun work! The technical side involves analyzing a site's network requests, deobfuscating and reading obfuscated javascript, and implementing simple HTTP request scraping to full browser automation. You'll also work on the infrastructure layer: state management for resumable scrapes, deduplicating products, data integrity, and monitoring systems to detect when sites change.

The work you'll be doing is in the hot path of our company, so the systems you will build need to be performant and maintainable. You should have solid Python skills and experience scraping ecommerce websites and APIs. You should also like the slightly-obsessive investigative nature of the work.

If you worked in scraping or botting in the past, please hit me up!

Reach out directly - gray at netail.ai