←back to thread

176 points GavCo | 4 comments | | HN request time: 0.001s | source

The new Gemini 3 Pro Image model (aka Nano Banana) is incredible at generating slides, so I thought it would be fun to build a CLI tool that lets you edit PDF presentations using plain English. The tool converts the page you want to edit into an image, sends it to the model API together with your prompt to generate an edited image, then converts the updated image back and stitches into the original document.

Examples:

- `nano-pdf edit deck.pdf 5 "Update the revenue chart to show Q3 at $2.5M"`

- `nano-pdf add deck.pdf 15 "Create an executive summary slide with 5 bullet points"`

Features:

- Edit multiple pages in parallel

- Add entirely new slides that match your deck's style

- Google Search enabled by default so the model can look up current data

- Preserves text layer for copy/paste and search

It can work with any kind of PDF but I expect it would be most useful for a quick edit to a deck or something similar.

GitHub: https://github.com/gavrielc/Nano-PDF

1. iamflimflam1 ◴[] No.46093381[source]
The lack of examples makes me very reluctant to commit any time to trying this out - despite it being something that I’m interested in.

Has anyone given any it a go? Does it work?

replies(1): >>46093444 #
2. stingraycharles ◴[] No.46093444[source]
What? There are examples in the repo and even in OP’s post.

I haven’t tried it, but there are plenty of examples.

replies(1): >>46093638 #
3. albert_e ◴[] No.46093638[source]
Do you mean example commands? we see those examples on the githib README, yes,

But people here are probably also looking for example input and output PDFs (or images/screenshots) showing the actual work done to get a sense of what to expect.

replies(1): >>46093757 #
4. iamflimflam1 ◴[] No.46093757{3}[source]
Exactly - if these examples work really well, then include some screenshots.