Most active commenters

Popular/hot comments

(www.bleepingcomputer.com)

Show context

trompetenaccoun ◴[20 Oct 24 15:33 UTC] No.41895988[source]▶

We need archives built on decentralized storage. Don't get me wrong, I really like and support the work Internet Archive is doing, but preserving history is too important to entrust it solely to singular entities, which means singular points of failure.

replies(19): >>41896170 #>>41896389 #>>41896411 #>>41896420 #>>41897459 #>>41897680 #>>41897913 #>>41898320 #>>41898841 #>>41899160 #>>41899729 #>>41899779 #>>41899999 #>>41900368 #>>41901199 #>>41902340 #>>41904676 #>>41905019 #>>41907926 #

1. __MatrixMan__ ◴[20 Oct 24 18:31 UTC] No.41897459[source]▶

>>41895988 #

To make the web distributed-archive-friendly I think we need to start referencing things by hash and not by a path which some server has implied it will serve consistently but which actually shows you different data at different times for a million different reasons.

If different data always gets a different reference, it's easy to know if you have enough backups of it. If the same name gets you a pile of snapshots taken under different conditions, it's hard to be sure which of those are the thing that we'd want to back up for that particular name.

replies(2): >>41897960 #>>41899538 #

2. Cheer2171 ◴[20 Oct 24 19:42 UTC] No.41897960[source]▶

>>41897459 (TP) #

Done. It is called IPFS. The IA already supports it.

https://github.com/internetarchive/dweb-archive/blob/master/...

replies(3): >>41898278 #>>41898354 #>>41898515 #

3. __MatrixMan__ ◴[20 Oct 24 20:38 UTC] No.41898278[source]▶

>>41897960 #

Right, what I'm saying is that now we need to get the rest of the web (or at least the parts we want to keep) on board.

4. majorchord ◴[20 Oct 24 20:51 UTC] No.41898354[source]▶

>>41897960 #

IPFS has shown that the protocol is fundamentally broken at the level of growth they want to achieve and it is already extremely slow as it is. It often takes several minutes to locate a single file.

replies(2): >>41898517 #>>41898578 #

5. Groxx ◴[20 Oct 24 21:12 UTC] No.41898515[source]▶

>>41897960 #

Which has a rather lengthy section explaining why it's currently a failed experiment: https://github.com/internetarchive/dweb-archive/blob/master/...

(this doc is 5-6 years old though, and I'm not sure what may have changed since then)

In my own (toy-scale) IPFS experiments a couple years ago it has been rather usable, but also the software has been utterly insane for operators and users, and if I were IA I would only consider it if I budgeted for a from-scratch rewrite (of the stuff in use). Nearly uncontrollable and unintrospectable and high resource use for no apparent reason.

6. diggan ◴[20 Oct 24 21:12 UTC] No.41898517{3}[source]▶

>>41898354 #

The beauty is that IA could offer their own distribution of IPFS that uses their own DHT for example, and they could allow only public read access to it. This would solve the slow part of finding a file, for IA specifically. Then the actual transfers tend to be pretty quick with IPFS.

What's the point of using IPFS then? Others can still spread the file elsewhere and verify it's the correct one, by using the exact same ID of the file, although on two different networks. The beauty of content-addressing I guess.

replies(1): >>41898568 #

7. acdha ◴[20 Oct 24 21:20 UTC] No.41898568{4}[source]▶

>>41898517 #

That isn’t solving the problem, it’s just giving them more of it to work on. IA has enough material that I’d be surprised if they didn’t hit IPFS’s design limits on their own, and they’d likely need to change the design in ways which would be hard to get upstream.

8. BlueTemplar ◴[20 Oct 24 21:22 UTC] No.41898578{3}[source]▶

>>41898354 #

Several minutes sounds more than fine for this purpose ?

Especially if it's about having an Internet Archive backup.

replies(1): >>41899135 #

9. Aachen ◴[20 Oct 24 22:52 UTC] No.41899135{4}[source]▶

>>41898578 #

I think the point is that it's already slow at the current amount of data, let alone when you stuff dozens more PB into it

10. jonhohle ◴[21 Oct 24 00:02 UTC] No.41899538[source]▶

>>41897459 (TP) #

There was a startup called Space Monkey that sold NAS drives where you got a portion of the space and the rest was used for copies of other people’s content (encrypted). The idea was you could lose your device, plug in a new one and restore from the cloud. They ended up folding before any of their resilience claims could be tested (at least by me).

Would be people be willing to buy an IA box that hosted a shard of random content along with the things they wanted themselves?

replies(2): >>41899985 #>>41900004 #

11. gruez ◴[21 Oct 24 01:41 UTC] No.41899985[source]▶

>>41899538 #

What happens when the user base explodes (eg. due to this event), and a few months layer they all get bored and drop out?

12. mbirth ◴[21 Oct 24 01:43 UTC] No.41900004[source]▶

>>41899538 #

Does anyone remember wua.la? It worked similar in that you offered local disk space in exchange for cloud storage. It was later bought by LaCie and killed off shortly after.

↑

Internet Archive breached again through stolen access tokens