←back to thread

176 points GavCo | 1 comments | | HN request time: 0.374s | source

The new Gemini 3 Pro Image model (aka Nano Banana) is incredible at generating slides, so I thought it would be fun to build a CLI tool that lets you edit PDF presentations using plain English. The tool converts the page you want to edit into an image, sends it to the model API together with your prompt to generate an edited image, then converts the updated image back and stitches into the original document.

Examples:

- `nano-pdf edit deck.pdf 5 "Update the revenue chart to show Q3 at $2.5M"`

- `nano-pdf add deck.pdf 15 "Create an executive summary slide with 5 bullet points"`

Features:

- Edit multiple pages in parallel

- Add entirely new slides that match your deck's style

- Google Search enabled by default so the model can look up current data

- Preserves text layer for copy/paste and search

It can work with any kind of PDF but I expect it would be most useful for a quick edit to a deck or something similar.

GitHub: https://github.com/gavrielc/Nano-PDF

Show context
toddmorey ◴[] No.46092476[source]
I thought it was kinda funny that Google Slide’s own built in “beautify this slide” button converts the whole slide into an uneditable image.
replies(1): >>46093662 #
1. albert_e ◴[] No.46093662[source]
AFAIK -- even the "Designer" feature of Microsoft Powerpoint (now folded under Copilot license I believe) gives slide deigns with shapes etc that are not editable. Thankfully the text remains editable. But if we wnat to ever so slightly modify the suggested design my removing or reshaping some if the shapes ... nopes. Feels like they are worried about humans with taste ripping-off the AI output :D