(rubenerd.com)

158 points Brajeshwar | 2 comments | 17 Dec 24 14:31 UTC | HN request time: 0.423s | source

Show context

zdc1 ◴[21 Dec 24 18:45 UTC] No.42481387[source]▶

These days whenever I read an interesting article, I will take 2 minutes to copy and paste it into my Obsidian vault under my Articles folder. I'll take care to paste the images as images (and not links) and make sure I've got the author and source URL at the top, and have my separate notes section link to it. It's a bit silly and obsessive, but given how transient content on the Internet is, I think it's necessary to make a copy of anything you care about.

replies(9): >>42481420 #>>42481474 #>>42481634 #>>42481690 #>>42481757 #>>42482021 #>>42482314 #>>42482732 #>>42483499 #

1. Modified3019 ◴[21 Dec 24 19:36 UTC] No.42481690[source]▶

>>42481387 #

I use https://github.com/gildas-lormeau/SingleFile

I set it to tolerate longer processing times, and to open the file after saving so I can sanity check that it got everything. Works great at faithfully saving a page with images as it appears in browser, and saves so much time.

You might also have a look at https://github.com/ArchiveBox/ArchiveBox

replies(1): >>42481855 #

2. Modified3019 ◴[21 Dec 24 19:59 UTC] No.42481855[source]▶

>>42481690 (TP) #

Also, I believe by default the files are saved as plain html (with resources being base64 encoded), so search tools which can index the contents of html files will work.

There is also the option to have the contents compressed, and (a separate option) to keep the plaintext of the file uncompressed, which will likewise still allow indexing to work while saving space.

↑

Is stuff online worth saving?