←back to thread

128 points Brajeshwar | 2 comments | | HN request time: 0.415s | source
Show context
zdc1 ◴[] No.42481387[source]
These days whenever I read an interesting article, I will take 2 minutes to copy and paste it into my Obsidian vault under my Articles folder. I'll take care to paste the images as images (and not links) and make sure I've got the author and source URL at the top, and have my separate notes section link to it. It's a bit silly and obsessive, but given how transient content on the Internet is, I think it's necessary to make a copy of anything you care about.
replies(9): >>42481420 #>>42481474 #>>42481634 #>>42481690 #>>42481757 #>>42482021 #>>42482314 #>>42482732 #>>42483499 #
1. Modified3019 ◴[] No.42481690[source]
I use https://github.com/gildas-lormeau/SingleFile

I set it to tolerate longer processing times, and to open the file after saving so I can sanity check that it got everything. Works great at faithfully saving a page with images as it appears in browser, and saves so much time.

You might also have a look at https://github.com/ArchiveBox/ArchiveBox

replies(1): >>42481855 #
2. Modified3019 ◴[] No.42481855[source]
Also, I believe by default the files are saved as plain html (with resources being base64 encoded), so search tools which can index the contents of html files will work.

There is also the option to have the contents compressed, and (a separate option) to keep the plaintext of the file uncompressed, which will likewise still allow indexing to work while saving space.