How and why GraphQL will influence the Sourcehut alpha

(sourcehut.org)

153 points michaelanckaert | 2 comments | 11 Jun 20 04:46 UTC | HN request time: 0.454s | source

Show context

ianamartin ◴[11 Jun 20 05:51 UTC] No.23485568[source]▶

Well, that's too bad. I always thought this was a cool project. But if you can't dev your way into decent performance for a small alpha project using python/flask/sql, I don't think your tools are the problem. And I guarantee that a graphql isn't the solution.

So, I mean, good luck.

replies(5): >>23485616 #>>23485627 #>>23485632 #>>23487973 #>>23489310 #

CameronNemo ◴[11 Jun 20 06:04 UTC] No.23485632[source]▶

>>23485568 #

This quote stuck out to me:

>Today, the Python backends to the web services communicate directly with PostgreSQL via SQLAlchemy, but it is my intention to build out experimental replacement backends which are routed through GraphQL instead. This way, the much more performant and robust GraphQL backends become the single source of truth for all information in SourceHut.

I wonder how adding a layer of indirection can significantly improve performance. If I were writing this service, I would go all in on GraphQL and have the frontend talk to the GraphQL services directly rather than routing the requests from Python through to a GraphQL service then presumably to PostgreSQL.

Perhaps I am missing something. Indeed good luck to Drew here.

replies(3): >>23485739 #>>23485808 #>>23487994 #

ianamartin ◴[11 Jun 20 06:36 UTC] No.23485808[source]▶

>>23485632 #

That quote is sort of exactly what's conceptually wrong with what's goin on in my opinion. Yes, I know, armchair quarterback and I'm not the one out there building stuff like this for free, etc., etc.

But claiming some nebulous backend that's more performant and robust than Postgres is like, WTF? Are you using an actual GraphDB like Neo4J? Are you putting a graph frontend on Postgres like PostGraphQL? None of the post really makes any sense because GraphQL is a Query Language, not a data store. What are the CAP theorem tradeoffs in the new backend? What does more robust mean? What does more performant mean? This is a source control app. Those tradeoffs are meaningful.

There seems to be a lot of conflation between API design and data store and core programming tools all mixed into a big post that mostly sounds to me like, "I don't get how to make this (extremely popular and well-known platform that drives many websites 10000x my size) work well, so I'm trying something different that sounds cool."

Which, again, the author has always said this is an experiment, and that's cool. But the conceptual confusion in the post makes me think that moving away from boring tools and trying new tools is not going to end up going well.

But this is a source control app, and it's hopefully backed up somewhere besides sourcehut so it should be fine if he needs to backtrack.

replies(3): >>23486254 #>>23487671 #>>23489020 #

vertex-four ◴[11 Jun 20 11:30 UTC] No.23487671[source]▶

>>23485808 #

The goal here is to generate a typed API across a bunch of microservices (written in some typed language suited for the job) that are consumed by a Python frontend. The current design is a pile of vertically-integrated monoliths that touch the disk, database, perform backend operations and rendering all in one process.

Python's single-threaded design makes it difficult to be responsive to small queries quickly while simultaneously serving large, time-consuming queries (i.e. git operations). You can get around this using worker queues to separate interpreter processes and an async design, or otherwise splitting your workload up... or you can use a language where "have a threadpool" is actually a properly supported concept, and an architecture where sharding git/email/etc backends is feasible.

replies(1): >>23489341 #

1. pknopf ◴[11 Jun 20 14:35 UTC] No.23489341[source]▶

>>23487671 #

> The goal here is to generate a typed API across a bunch of microservices

You are describing gRPC.

replies(1): >>23489622 #

2. vertex-four ◴[11 Jun 20 15:03 UTC] No.23489622[source]▶

>>23489341 (TP) #

I'm describing a lot of things, JSON-Schema documented REST APIs being another one. The other thing about GraphQL is that you can make a query that contains multiple requests and allows the server to optimise how to process them, which is not something that REST or gRPC are very good at.

↑