←back to thread

224 points jamesxv7 | 1 comments | | HN request time: 0s | source

First of all, this is purely a personal learning project for me, aiming to combine three of my passions: photography, software engineering, and my family memories. I have a large collection of family photos and want to build an interactive experience to explore them, ala Google or Apple Photo features.

My goal is to create a system with smart search capabilities, and one of the most important requirements is that it must run entirely on my local hardware. Privacy is key, but the main driver is the challenge and joy of building it myself (an obviously learn).

The key features I'm aiming for are:

Automatic identification and tagging of family members (local face recognition).

Generation of descriptive captions for each photo.

Natural language search (e.g., "Show me photos of us at the beach in Luquillo from last summer").

I've already prompted AI tools for a high-level project plan, and they provided a solid blueprint (eg, Ollama with LLaVA, a vector DB like ChromaDB, you know it). Now, I'm highly interested in the real-world human experience. I'm looking for advice, learning stories, and the little details that only come from building something similar.

What tools, models, and best practices would you recommend for a project like this in 2025? Specifically, I'm curious about combining structured metadata (EXIF), face recognition data, and semantic vector search into a single, cohesive application.

Any and all advice would be deeply appreciated. Thanks!

Show context
crobibero ◴[] No.44426343[source]
I think Immich checks a lot of these

https://immich.app/

replies(5): >>44426505 #>>44426857 #>>44427196 #>>44429603 #>>44434882 #
sircastor ◴[] No.44426857[source]
Immich is what I'm using right now. I'm running it in a Docker container on my Synology. It was very advantageous to spin up another docker container on my laptop to do the face recognition work because the Synology was going to take forever on it.

We no longer are auto uploading to Google or Apple.

So far, I really like it. I haven't quite gone 100%, as we're still uploading with Synology's photo app, but Immich provides a much more refined, featured interface.

replies(3): >>44427168 #>>44427672 #>>44428506 #
old-gregg ◴[] No.44428506[source]
May I ask: why not use Synology's own photo stack? The web UI is pretty good, the iPhone app is great, it runs locally without depending on Synology servers, and does have face recognition and all other features.
replies(3): >>44429245 #>>44429755 #>>44432124 #
1. itsdrewmiller ◴[] No.44429245[source]
Have you tried Immich? It is extremely polished and has every feature you mentioned, along with being open source with tons of community energy and no lock in.