←back to thread

73 points ajhool | 3 comments | | HN request time: 0.637s | source

It's common to see here that Postgres hosted in RDS can handle 99% of workloads up to millions of users. I'm building an IoT app with a plan to ingest the IoT traffic into dynamo partitioned on user id (I'm quite familiar with the tradeoffs) and everything else be in Postgres. A few services but not microservice (basically: core service, identity service, IoT data service, notification service). Ingesting and monitoring about 1,000,000 IoT devices daily (1 packer per device per day) and about 1,000,000 users with only 5,000 active users per day (basically we monitor user IoT devices 24/7 but only some 5,000 users will have anomalous results and log in).

In the database posts & discussions here I sometimes find that the opinions are strong but the numbers are missing. Obviously applications have wide variation in traffic and query complexity so apples to apples comparisons are hard. Still, I would greatly benefit from hearing some real world experiences with numbers.

Rough approximation database questions for current or prior applications:

1. How many customers do you have?

2. What's expected daily traffic? Peak traffic?

3. What database engine or engines do you use?

4. How many rows or how much storage does your db have?

5. What else about your application is relevant for database load?

6. Microservice, Service, or monolith. Happy with it?

Show context
dharmab ◴[] No.43367010[source]
Asking about customers is the wrong question.

1. It's often information that cannot be casually shared for legal reasons (MNPI)

2. A single customer might generate many queries. There have been times where a single one of my employer's customers generates more traffic than most companies will ever reach at peak.

replies(1): >>43367699 #
1. ajhool ◴[] No.43367699[source]
Fair. Please interpret as queries rather than customers.
replies(1): >>43368346 #
2. dharmab ◴[] No.43368346[source]
This is also the wrong question because the load of a query can vary drastically based on the query's structure/complexity, the size and types of the data, and which indexes and tables are used.

The right question is - discuss a problem you'd like to solve using PostgreSQL and find people who have solved similar problems.

replies(1): >>43368389 #
3. ajhool ◴[] No.43368389[source]
Thank you for pointing that out, I agree. I'm just looking for peoples' experiences with their applications, which of course all have different workloads.

People did get the spirit of the question and have provided helpful responses, including context about their query structure/complexity.