←back to thread

145 points jasondavies | 8 comments | | HN request time: 0.913s | source | bottom
1. S0y ◴[] No.41368437[source]
This is really awesome. A question for someone who knows more about this: How much harder would it be to make this work using any number of photos? I'm assuming this is the end goal for a model like this.

Imagine being able to create an accurate enough 3D rendering of any interior with just a bunch of snapshots anyone can take with their phone.

replies(2): >>41368608 #>>41368627 #
2. dagmx ◴[] No.41368608[source]
That’s already how Gaussian splats work.

They’re novelty of splattr (though I contest that they’re the first to do so) is that they need fewer images than usual.

replies(2): >>41368661 #>>41368790 #
3. Arkanum ◴[] No.41368627[source]
Probably not much harder, but you wouldn't get the same massive jump in quality that you get going from 1 image to 2. NeRF/Gaussian Splatting in general is what you're describing, but from the looks of it, this just does it in a single forward pass rather than optimising the gaussian/network weights.
4. Arkanum ◴[] No.41368661[source]
I think the novelty is that they don't have to optimise the splats at all, they're directly predicted in a single forward pass.
replies(1): >>41372531 #
5. GaggiX ◴[] No.41368790[source]
The novelty here is that it does work on uncalibrated images.
replies(2): >>41369005 #>>41372536 #
6. milleramp ◴[] No.41369005{3}[source]
Not really, it is using Mast3r to determine camera poses.
7. dagmx ◴[] No.41372531{3}[source]
That’s not really novel either imho, though google search is escaping me on the specific papers I saw at siggraph.

Imho it’s an interesting combination of technologies but not novel in an off itself.

8. dagmx ◴[] No.41372536{3}[source]
A lot of splats systems do work on uncalibrated images so that’s not novel either. They all just do a camera solve, which arguable isn’t terrible for a stereo pair with low divergence.