←back to thread

663 points nikisweeting | 1 comments | | HN request time: 0s | source

We've been pushing really hard over the last 6mo to develop this release. I'd love to hear feedback from people who've worked on big plugin systems in the past, or anyone who's tried our betas!
Show context
A4ET8a8uTh0 ◴[] No.41864395[source]
Those additions are welcome, but if I could request one -- I and one that it is very consistently requested -- feature:

- backing up an entire page

Yes, it is hard. Yes, for non-pure html pages is extra kind of painful, but that would honestly making archivebox go from nice to have to.. yes, I have an actual archive I can use when stuff goes down.

replies(1): >>41864814 #
nikisweeting ◴[] No.41864814[source]
Do you mean backing up an entire domain? Like example.com/*

If so that's starting to roll out in v0.8.5rc50, check out the archivebox/crawls/ folder.

If you mean archiving a single page more thoroughly, what do you find is missing in Archivebox? Are you able to get singlefile/chrome/wget html when archiving?

replies(1): >>41864880 #
A4ET8a8uTh0 ◴[] No.41864880[source]
Edit: The first option. ( previous stuff removed )

Lemme check my current version ( edit: 0.7.2 -- ty, I will update and test soon :D)

replies(1): >>41865010 #
nikisweeting ◴[] No.41865010{3}[source]
Ah ok. One caveat: it's only available via the 'archivebox shell' / Python API currently, the CLI & web UIs for full depth crawling will come later.

You can play around with the models and tasks, but I would wait a few weeks for it to stabilize and check again, it's still under heavy active development

Check archivebox/archivebox:dev periodically

replies(1): >>41865214 #
A4ET8a8uTh0 ◴[] No.41865214{4}[source]
No worries. I can do that.

You guys probably hear it all the time, but you are doing lords work. If I thought I could be of use in that project, I would be trying to contribute myself ( in fact, let me see if there a way I can participate in a useful manner ).

replies(1): >>41867160 #
1. nikisweeting ◴[] No.41867160{5}[source]
Thanks! I love working on archiving so far, and it's been very motivating to see more and more people getting into archiving lately.