←back to thread

410 points jjulius | 4 comments | | HN request time: 0.859s | source
1. wg0 ◴[] No.41885204[source]
In all the hype of AI etc, if you think about it then the foundational problem is that even Computer Vision is not a solved problem at the human level of accuracy and that's at the heart of the issue of both Tesla and that Amazon checkout.

Otherwise as thought experiment, imagine just a tiny 1 Inch tall person glued to the grocery trolley and another sitting on each shelf - just these two alone are all you need for "automated checkout".

replies(1): >>41885275 #
2. vineyardmike ◴[] No.41885275[source]
> Otherwise as thought experiment, imagine just a tiny 1 Inch tall person glued to the grocery trolley and another sitting on each shelf - just these two alone are all you need for "automated checkout".

I don’t think this would actually work, as silly a thought experiment as it is.

The problem isn’t the vision, it’s state management and cost. It was very easy (but expensive) to see and classify via CV if a person picked something up, it just requires hundreds of concurrent high resolution streams and a way to stitch the global state from all the videos.

A little 1 inch person on each shelf needs a good way to communicate to every other tiny person what they say, and come to consensus. If 5 people/cameras detect person A picking something up, you need to differentiate between every permutation within 5 discrete actions and 1 seen 5 times.

In case you didn’t know, Amazon actually hired hundreds of people in India to review the footage and correct mistakes (for training the models). They literally had a human on each shelf. And they still had issues with the state management. With people.

replies(1): >>41885317 #
3. wg0 ◴[] No.41885317[source]
Yeah - that's exactly is my point that humans were required to recognize and computer vision is NOT a solved problem regardless of tech bros misleading techno optimism.

Distributed communication and state management on the other hand is a solved problem already mostly with known parameters. How else do you think thousand and thousands of Kubernetes work in the wild.

replies(1): >>41893525 #
4. Schiendelman ◴[] No.41893525{3}[source]
I think you're missing the point GP made: humans couldn't do it. They tried to get humans to do it, and humans had an unacceptable error rate.

This is important. The autonomous driving problem and the grocery store problem are both about trade-offs, one isn't clearly better than the other.