Most active commenters

akho(9)
prophesi(5)

Ask HN: What's the 2025 stack for a self-hosted photo library with local AI?

First of all, this is purely a personal learning project for me, aiming to combine three of my passions: photography, software engineering, and my family memories. I have a large collection of family photos and want to build an interactive experience to explore them, ala Google or Apple Photo features.

My goal is to create a system with smart search capabilities, and one of the most important requirements is that it must run entirely on my local hardware. Privacy is key, but the main driver is the challenge and joy of building it myself (an obviously learn).

The key features I'm aiming for are:

Automatic identification and tagging of family members (local face recognition).

Generation of descriptive captions for each photo.

Natural language search (e.g., "Show me photos of us at the beach in Luquillo from last summer").

I've already prompted AI tools for a high-level project plan, and they provided a solid blueprint (eg, Ollama with LLaVA, a vector DB like ChromaDB, you know it). Now, I'm highly interested in the real-world human experience. I'm looking for advice, learning stories, and the little details that only come from building something similar.

What tools, models, and best practices would you recommend for a project like this in 2025? Specifically, I'm curious about combining structured metadata (EXIF), face recognition data, and semantic vector search into a single, cohesive application.

Any and all advice would be deeply appreciated. Thanks!

Show context

mossTechnician ◴[30 Jun 25 18:23 UTC] No.44426333[source]▶

>>44426233 (OP) #

This may not interest you, but Ente checks most of these boxes for me. It has face recognition and AI-based object search out of the box, and you can self-host their open-source server without any restrictions. The models they used might be useful for your project.

replies(3): >>44426503 #>>44426975 #>>44428905 #

1. akho ◴[30 Jun 25 19:30 UTC] No.44426975[source]▶

>>44426333 #

The Ente self-hosting proposition seems strange. Why would I want to e2e encrypt my photos that I self-host? Sounds like it will only make life more difficult.

replies(5): >>44427017 #>>44429002 #>>44429476 #>>44430189 #>>44430219 #

2. mossTechnician ◴[30 Jun 25 19:36 UTC] No.44427017[source]▶

>>44426975 (TP) #

1. "Self-hosted" doesn't always mean "on your own hardware." Some people rent VPSes. This helps keep their data safe.

2. The software is provided without modification; I think it would be stranger to remove the encryption.

replies(2): >>44428598 #>>44432893 #

3. idatum ◴[30 Jun 25 22:26 UTC] No.44428598[source]▶

>>44427017 #

> Some people rent VPSes. This helps keep their data safe.

This is exactly how I self-host Ente and it has been great.

Machine leaning for image detection has worked really well for me, especially facial recognition for family members (easy to find that photo to share).

I have the client on my Android mobile, Fire tablet (via F-Droid), and my Windows laptop.

My initial motivation was to replace "cloud" storage for getting photos copied off the phone as soon as possible.

4. ibizaman ◴[30 Jun 25 23:26 UTC] No.44429002[source]▶

>>44426975 (TP) #

You may want to self-host for your family or close friends while guaranteeing them privacy.

replies(1): >>44432862 #

5. freehorse ◴[01 Jul 25 00:44 UTC] No.44429476[source]▶

>>44426975 (TP) #

Because you want to access your photos remotely, or give access to more people to certain albums. If the point is to just store them locally and no remote access is needed, a hard drive would probably be enough.

replies(1): >>44432870 #

6. zzyzxd ◴[01 Jul 25 03:04 UTC] No.44430189[source]▶

>>44426975 (TP) #

e2ee makes it easier to sell their hosted version, and there's probably not enough incentive to justify the additional overhead of having an unencrypted option.

Also, my house is less secure than commercial data centers, so e2ee gives me greater peace of mind about data safety.

replies(1): >>44432837 #

7. prophesi ◴[01 Jul 25 03:12 UTC] No.44430219[source]▶

>>44426975 (TP) #

If there's a server involved, there's no reason not to have sensitive files and information end-to-end encrypted, whether self-hosting or not.

replies(1): >>44432849 #

8. akho ◴[01 Jul 25 11:34 UTC] No.44432837[source]▶

>>44430189 #

> Also, my house is less secure than commercial data centers, so e2ee gives me greater peace of mind about data safety.

I think you overestimate security of data centers.

At rest, you use full-disk encryption anyway, so the extra layer just makes things harder.

9. akho ◴[01 Jul 25 11:36 UTC] No.44432849[source]▶

>>44430219 #

You do want to have things encrypted in transit and at rest. e2ee means server admins (I) cannot access the user's (mine) photos.

replies(1): >>44435005 #

10. akho ◴[01 Jul 25 11:37 UTC] No.44432862[source]▶

>>44429002 #

I'd prefer to guarantee they don't lose access, despite their key management practices.

replies(1): >>44473745 #

11. akho ◴[01 Jul 25 11:38 UTC] No.44432870[source]▶

>>44429476 #

That's why you need a server. e2ee does not help with any of that.

12. akho ◴[01 Jul 25 11:41 UTC] No.44432893[source]▶

>>44427017 #

TB-scale VPSes are not economical vs a home NAS. I see how that can be useful for smaller collections, though.

13. prophesi ◴[01 Jul 25 15:38 UTC] No.44435005{3}[source]▶

>>44432849 #

The server admin can still access their own photos via the client. They wouldn't be able to access the photos of other users.

edit: To explain further why it's almost always desirable:

You guarantee that you and your users' information is safe if the server is compromised, if an admin goes rogue, or if local bodies of power request their information from you.

The information can't be sent to third-parties by design.

Any operations / transformations that need to be applied to the information will have to either be done via homomorphic encryption or on the client-side (which is much more likely to be open source / easy-to-deobfuscate compared to blackbox server code).

replies(1): >>44436480 #

14. akho ◴[01 Jul 25 17:58 UTC] No.44436480{4}[source]▶

>>44435005 #

I understand what e2ee is, thank you. I just don't think it’s justified for self-hosted photo servers.

E. g., “Any operations / transformations” includes facial recognition, CLIP embeddings, &c; you want to run this on the server, overnight, and to be able to re-run at a later date when new models become available. Under e2ee, that’s a round-trip through a client device at every model update. So that’s a significant downside, for no important upsides in the case when you and your family are the only users.

replies(1): >>44436643 #

15. prophesi ◴[01 Jul 25 18:14 UTC] No.44436643{5}[source]▶

>>44436480 #

I was explaining why e2ee has important upsides, not how e2ee works. With Ente (and I think Immich as well), facial recognition and generating new CLIP embeddings are done on-device[0], usually right when the photo is taken / before they're uploaded to the server.

[0] https://ente.io/blog/image-search-with-clip-ggml/

replies(1): >>44438033 #

16. akho ◴[01 Jul 25 21:09 UTC] No.44438033{6}[source]▶

>>44436643 #

Immich does it on the server.

What happens if there’s a new, better model? You’d need to re-download, decrypt, and run inference on all your past media, which is in terabytes for many.

I understand the benefit of e2ee in a situation where there is no trust between user and admin. In personal self-hosting, that’s the same person (or family), and the upsides are not as relevant. The downsides (possibility of data loss for, e. g., kids who are not very good with passwords/keys; difficulties with updating models / thumbs; …) remain important, and outweigh the benefits, even assuming the e2ee is implemented well.

replies(1): >>44440065 #

17. prophesi ◴[02 Jul 25 03:40 UTC] No.44440065{7}[source]▶

>>44438033 #

You do you, but the trust is beyond just admin and users. And family photos are treated as treasures. Data loss is a fair point, but if you're self-hosting a photos app I imagine server/db backups are part of your routine. Account recovery is all that's needed to recover lost photos from there. Well, unless your VPS is compromised in a manner of data loss for longer than you wished before your backups ran, in which case it's still better that such sensitive info was e2ee'd.

edit: also feel like I'm echoing the classic dropbox comment, but self-hosting in a sane and secure manner is harder than it's made out to be. It needs to be taken seriously.

replies(1): >>44440609 #

18. akho ◴[02 Jul 25 05:52 UTC] No.44440609{8}[source]▶

>>44440065 #

e2ee prevents account recovery.

replies(1): >>44446927 #

19. prophesi ◴[02 Jul 25 18:05 UTC] No.44446927{9}[source]▶

>>44440609 #

People have found decent solutions for that. Proton's is essentially a backup password/phrase or a file you keep safe. Not as simple as a magic link, and could still lose your backup phrase/file, but alas. Security is always a compromise on convenience.

[0] https://proton.me/blog/data-recovery-end-to-end-encryption

20. ibizaman ◴[05 Jul 25 16:21 UTC] No.44473745{3}[source]▶

>>44432862 #

That’s a very good point. For a long time I was advocating for self-hosting for increasing one’s privacy, but I always was hitting the “I’ve got nothing to hide” wall. Now, the concern is losing access to your data. What do you do if you’re kicked out of your email account?

↑