Most active commenters

macintux(3)
binary132(3)

Popular/hot comments

>>42190678 #

←back to thread

Open Riak – open, modern Riak fork

(github.com)

Show context

tibbar ◴[19 Nov 24 21:09 UTC] No.42188120[source]▶

>>42187766 (OP) #

I've never met an engineering team that used Riak, but it is used heavily as an example technology in Kleppmann's 'Designing Data Intensive Applications'. (I would say, informally, it's usually the example of the "other way" as opposed to other more well-known databases.) This does make me wonder what became of it, why it didn't take off.

replies(8): >>42188162 #>>42188238 #>>42188584 #>>42188647 #>>42188675 #>>42188741 #>>42191594 #>>42191757 #

1. macintux ◴[19 Nov 24 21:24 UTC] No.42188238[source]▶

>>42188120 #

Speaking as a former tech evangelist/engineer at Basho, there were a few significant challenges.

Riak is horribly unfriendly as a database: no SQL, it exposes eventual consistency directly to the developer, it’s relatively slow, and Erlang is a fairly unusual language.

While you can run Riak on a single server, you’d have to really want to.

Its strength is the ability to scale massively, but not many projects need that scale, and by the time you do, you’re probably already using some friendlier database and you’d rather make that one work.

replies(5): >>42188533 #>>42188580 #>>42188614 #>>42188779 #>>42191772 #

2. binary132 ◴[19 Nov 24 21:56 UTC] No.42188533[source]▶

>>42188238 (TP) #

I wonder if some of these issues could be addressed sanely in an extension to the functionality

replies(1): >>42189569 #

3. cmrdporcupine ◴[19 Nov 24 22:02 UTC] No.42188580[source]▶

>>42188238 (TP) #

Thing is, Cassandra became and remained popular, with similar aspects (though in JVM instead of Erlang, so).

Though it had a couple years head start when there really no other options for people wanting that kind of kit.

replies(2): >>42190476 #>>42193171 #

4. btilly ◴[19 Nov 24 22:06 UTC] No.42188614[source]▶

>>42188238 (TP) #

Back in 2011 I was working on a project that involved Riak. The difficulty and slowness for doing stuff corresponding to basic SQL operations was certainly a giant strike against it, and helped sink that project before it was released.

replies(1): >>42189099 #

5. anotherjesse ◴[19 Nov 24 22:28 UTC] No.42188779[source]▶

>>42188238 (TP) #

https://howfuckedismydatabase.com/nosql/ this infamous comic is about riak

replies(1): >>42189635 #

6. p_l ◴[19 Nov 24 23:16 UTC] No.42189099[source]▶

>>42188614 #

> corresponding to basic SQL operations

Ohhh, this brings memories of developers hitting the wall... Between different SQL databases!

Back in 2016 I was delegated at work to do ops on a project that had big data ambitions in Threat Intelligence space.

Part of how they intended to support that was Apache Phoenix, an SQL database backed by HBase, running on top of Hadoop that also provided object storage (annoyingly through WebHDFS gateway).

Constant problems with hung Phoenix queries and instability of Hadoop in entirety led me to propose moving over to PostgreSQL, which generally went quite well... Except several cases of "basic SQL operations" that turned to have wildly different performance compared to Phoenix and most importantly, to MySQL in MyISAM mode, like doing SELECT (*) on huge tables.

Fun times, got to meet a postgres core team member thanks to it.

7. macintux ◴[20 Nov 24 00:24 UTC] No.42189569[source]▶

>>42188533 #

We were working on ways of making it easier (such as CRDTs to reduce the amount of work developers had to do to leverage eventual consistency), but these were pretty challenging problems to solve.

One of our biggest disappointments: we had plans to add a way to enforce strong consistency leveraging (IIRC) something akin to multi-paxos, but couldn't get it to work.

replies(2): >>42190465 #>>42190678 #

8. rubiquity ◴[20 Nov 24 00:35 UTC] No.42189635[source]▶

>>42188779 #

That could also very well be about CouchDB which implemented indexes/views as MapReduce functions.

replies(1): >>42190321 #

9. senderista ◴[20 Nov 24 02:30 UTC] No.42190321{3}[source]▶

>>42189635 #

Back in the day we had a CouchDB MapReduce view (on Cloudant) which took a full month to rebuild (while an angry customer was waiting). The I/O inefficiency was absolutely off the charts.

10. binary132 ◴[20 Nov 24 02:59 UTC] No.42190465{3}[source]▶

>>42189569 #

that sounds like a painful session

11. wbl ◴[20 Nov 24 03:01 UTC] No.42190476[source]▶

>>42188580 #

I feel building a threat intelligence product on Cassandra is a bit on the nose. What's next, calling the TCB Palladium?

12. jtuple ◴[20 Nov 24 03:50 UTC] No.42190678{3}[source]▶

>>42189569 #

TBH, we shipped fully working strong consistency in 2014. It just had a limited feature set, was disabled by default, and was never promoted/marketed since it didn't fit the direction the new CEO/CTO was pushing.

The engineering exodus around that time sorta killed the project though, and we never were able to do the big follow-up work to make it really shine.

(Disclaimer: Former Basho Principal Engineer, primary author of strong consistency work, lead riak_core dev from 2011-2015)

I think another 18 months would have been enough too. But it just wasn't the right environment after the hostile take-over / leadership transition.

replies(4): >>42190710 #>>42190924 #>>42191749 #>>42195459 #

13. macintux ◴[20 Nov 24 03:59 UTC] No.42190710{4}[source]▶

>>42190678 #

My apologies for misremembering. I’m glad you chimed in to correct the record.

14. NickM ◴[20 Nov 24 04:54 UTC] No.42190924{4}[source]▶

>>42190678 #

There were customers happily using strong consistency in production, but somehow the idea that it wasn’t “finished” kept getting repeated over and over by management. I was well on my way to solving the biggest rough edge (tombstone reaping in SC buckets) but then I got pulled off to work on the infamous “data platform” and never got to finish that work :-(

15. masleeds ◴[20 Nov 24 08:13 UTC] No.42191749{4}[source]▶

>>42190678 #

I'm not sure though for how much longer it will continue to make sense for the project as-is to continue to roll riak_ensemble forward as part of future releases. As there are no contributors who have direct direct experience or knowledge of using it in production, so it is hard to claim it as being a supported part of the product in any real sense.

I apologise if we do eventually cut it. Having worked through the code when chasing unstable tests, I developed an appreciation for the quality of the work.

16. mrweasel ◴[20 Nov 24 08:17 UTC] No.42191772[source]▶

>>42188238 (TP) #

I joined a company that had some investment in Basho and managed to sell Riak as the data store for a large client. It never really worked out, not enough of the SREs had be properly trained on Riak, the developers hated it because getting help and support was somewhat difficult, especially in an emergency.

In the end more and more data was offloaded to MariaDB, until one day the last remaining data couldn't justify the cost of the Riak cluster. I think we swapped out an eight node Riak cluster for two largish MariaDB database (one being a hot-standby).

For one of the other clients it was the exact same scenario, only we had been contracted in to help run the Riak cluster, which we didn't do well. Once they had migrate of it, to Oracle I think, the client left.

To me it always felt like it was just the wrong tool for that particular job. Someone really wanted to be able to jump on the NoSQL hype and sell something. They picked Riak, because it honestly looked really good, and probably was, compared to MongoDB, CouchDB or whatever else happened to float around at the time. It just wasn't the right tool for the problems it was applied to.

17. mst ◴[20 Nov 24 12:10 UTC] No.42193171[source]▶

>>42188580 #

I remember somebody whose competence I rate highly talking about ending up picking Cassandra over Riak (at a point where Riak was technically decently mature) simply because of availability of ops expertise - basically his take seemed to be that Basho support was solid but they hadn't succeeded at cultivating people he could *hire* to handle as much as possible in house before calling in the gurus.

(I can't, of course, speak to the truth of this, only that over a couple decades of knowing the dude in question and working with him on and off he had sufficient Clue that I expect he did put in the effort before coming to that conclusion)

18. binary132 ◴[20 Nov 24 16:22 UTC] No.42195459{4}[source]▶

>>42190678 #

If you don’t mind my asking, how did you find working with Erlang in such a context? I’ve always been curious about it, but these days it seems like most people are mostly interested in Elixir for REST APIs and I am content with my existing tools for that purpose. However, there are clearly places where OTP and other Erlang functionality would shine and I’ve always thought it might be fun to keep in my back pocket.

↑