←back to thread

Use One Big Server (2022)

(specbranch.com)
343 points antov825 | 5 comments | | HN request time: 0.001s | source
Show context
talles ◴[] No.45085392[source]
Don't forget the cost of managing your one big server and the risk of having such single point of failure.
replies(8): >>45085441 #>>45085488 #>>45085534 #>>45085637 #>>45086579 #>>45088964 #>>45090596 #>>45091993 #
Puts ◴[] No.45085534[source]
My experience after 20 years in the hosting industry is that customers in general have more downtime due to self-inflicted over-engineered replication, or split brain errors than actual hardware failures. One server is the simplest and most reliable setup, and if you have backup and automated provisioning you can just re-deploy your entire environment in less than the time it takes to debug a complex multi-server setup.

I'm not saying everybody should do this. There are of-course a lot of services that can't afford even a minute of downtime. But there is also a lot of companies that would benefit from a simpler setup.

replies(7): >>45085607 #>>45085628 #>>45085635 #>>45086355 #>>45088375 #>>45088512 #>>45091645 #
motorest ◴[] No.45085628[source]
> My experience after 20 years in the hosting industry is that customers in general have more downtime due to self-inflicted over-engineered replication, or split brain errors than actual hardware failures.

I think you misread OP. "Single point of failure" doesn't mean the only failure modes are hardware failures. It means that if something happens to your nodes whether it's hardware failure or power outage or someone stumbling on your power/network cable, or even having a single service crashing, this means you have a major outage on your hands.

These types of outages are trivially avoided with a basic understanding of well-architected frameworks, which explicitly address the risk represented by single points of failure.

replies(1): >>45086005 #
fogx ◴[] No.45086005{3}[source]
don't you think it's highly unlikely that someone will stumble over the power cable in a hosted datacenter like hetzner? and even if, you could just run a provisioned secondary server that jumps in if the first becomes unavailable and still be much cheaper.
replies(3): >>45086298 #>>45086456 #>>45089501 #
1. motorest ◴[] No.45089501{4}[source]
> don't you think it's highly unlikely that someone will stumble over the power cable in a hosted datacenter like hetzner?

You're not getting the point. The point is that if you use a single node to host your whole web app, you are creating a system where many failure modes, which otherwise could not even be an issue, can easily trigger high-severity outages.

> and even if, you could just run a provisioned secondary server (...)

Congratulations, you are no longer using "one big server", thus defeating the whole purpose behind this approach and learning the lesson that everyone doing cloud engineering work is already well aware.

replies(1): >>45090616 #
2. juped ◴[] No.45090616[source]
Do you actually think dead simple failover is comparable to elastic kubernetes whatever?
replies(1): >>45091327 #
3. motorest ◴[] No.45091327[source]
> Do you actually think dead simple failover is comparable to elastic kubernetes whatever?

References to "elastic Kubernetes whatever" is a red herring. You can have a dead simple load balancer spreading traffic across multiple bare metal nodes.

replies(1): >>45092123 #
4. juped ◴[] No.45092123{3}[source]
Thanks for switching sides to oppose yourself, I guess?
replies(1): >>45099782 #
5. motorest ◴[] No.45099782{4}[source]
> Thanks for switching sides to oppose yourself, I guess?

I'm baffled by your comment. Are you sure you read what I wrote?