←back to thread

Datadog's $65M/year customer mystery solved

(blog.pragmaticengineer.com)
151 points thunderbong | 7 comments | | HN request time: 0.824s | source | bottom
1. decimalenough ◴[] No.44427835[source]
> Assume that Datadog cuts the number of outages by half, by preventing them with early monitoring. That would mean that without Datadog, we’d look at 24 hours’ worth of downtime, not 12. Let’s also assume that using Datadog results in mitigating outages 50% faster than without - thanks to being able to connect health metrics with logs, debug faster, pinpoint the root cause and mitigate faster. In that case, without Datadog, we could be looking at 36 hours worth of total downtime, versus the 12 hours with Datadog. To put it in numbers: the company would make around $9M in revenue it would otherwise lose, Now that $10M/year fee practically pays for itself!

Those are some pretty heroic assumptions. In particular, they assume the only options are Datadog or nothing, when there are far cheaper alternatives like the Prometheus/Grafana/Clickhouse stack mentioned in the article itself.

replies(3): >>44428165 #>>44428684 #>>44428884 #
2. passivepinetree ◴[] No.44428165[source]
Another assumption that bothers me here is that the $9M in revenue would be completely lost during an outage. I imagine many customers would simply wait until the outage was resolved before performing their intended transactions, meaning far less than $9M would be lost.
replies(1): >>44428420 #
3. calt ◴[] No.44428420[source]
On the other hand, customers can become frustrated at being unable to trade when they need during an outage to and go to a competitor.
4. secondcoming ◴[] No.44428684[source]
We are moving from Datadog to Prometheus/Grafana and it's really not all a bed of roses. You'll need monitoring on your monitoring.
replies(2): >>44433452 #>>44441703 #
5. vjvjvjvjghv ◴[] No.44428884[source]
I bet they would get much better results if they spent a fraction of the money to better understand their systems and designing them better than spending millions on Datadog
6. disgruntledphd2 ◴[] No.44433452[source]
Ideally you want two independent monitors for critical things.
7. hagen1778 ◴[] No.44441703[source]
Ofc you need to monitor your monitoring, because you run it. Datadog runs their own systems and monitors them, that's why they charge you so much. I barely can imagine a criticial piece of software that I need to run and not monitor it in the same time.