Blog

Scaling PostgreSQL to power 800 million ChatGPT users

2026.01.24

·Service·by 이호민

#PostgreSQL#Scalability#Database#OpenAI#High Availability

Key Points

1OpenAI has successfully scaled a single Azure PostgreSQL primary instance and nearly 50 read replicas to handle millions of queries per second for 800 million ChatGPT users, proving its viability for massive read-heavy workloads.
2This was achieved through rigorous optimizations including offloading write-heavy workloads to sharded systems, aggressive query tuning, workload isolation, connection pooling with PgBouncer, and a cache locking mechanism to prevent overload.
3Future plans involve migrating more challenging write-heavy workloads, implementing cascading replication to scale beyond 50 replicas, and exploring sharded PostgreSQL to ensure continued growth and stability.

Blog

2026.01.24

·Service·by 이호민

#PostgreSQL#Scalability#Database#OpenAI#High Availability

1OpenAI has successfully scaled a single Azure PostgreSQL primary instance and nearly 50 read replicas to handle millions of queries per second for 800 million ChatGPT users, proving its viability for massive read-heavy workloads.
2This was achieved through rigorous optimizations including offloading write-heavy workloads to sharded systems, aggressive query tuning, workload isolation, connection pooling with PgBouncer, and a cache locking mechanism to prevent overload.
3Future plans involve migrating more challenging write-heavy workloads, implementing cascading replication to scale beyond 50 replicas, and exploring sharded PostgreSQL to ensure continued growth and stability.