Which is cheaper: g5.16xlarge or g5.4xlarge?

The g5.16xlarge On-Demand price is $4.0960/hr, while g5.4xlarge is $1.6240/hr.

Which has better performance: g5.16xlarge or g5.4xlarge?

g5.16xlarge scores N/A (single-core) and N/A (multi-core). g5.4xlarge scores N/A (single-core) and N/A (multi-core).

AWS g5.16xlargevsAWS g5.4xlarge

g5.16xlarge:64 vCPUs · 256 GB RAM · x86_64 · $4.0960/hr On-Demand · $1.5821/hr Spot

g5.4xlarge:16 vCPUs · 64 GB RAM · x86_64 · $1.6240/hr On-Demand · $0.8202/hr Spot

g5.16xlarge

vCPUs: 64

RAM: 256 GB

Architecture: x86_64

On-Demand: $4.0960/hr

Spot: $1.5821/hr

Single-Core: N/A

Multi-Core: N/A

g5.4xlarge

vCPUs: 16

RAM: 64 GB

Architecture: x86_64

On-Demand: $1.6240/hr

Spot: $0.8202/hr

Single-Core: N/A

Multi-Core: N/A

See full g5.16xlarge specs, history & regional pricing →See full g5.4xlarge specs, history & regional pricing →

g5.16xlarge vs g5.4xlarge: how to choose

g5.16xlarge pairs 64 vCPUs with 256GB of RAM at $4.0960/hr On-Demand (about $2949/mo at 24×7). g5.4xlarge pairs 16 vCPUs with 64GB at $1.6240/hr (~$1169/mo). g5.4xlarge is 60% cheaper per hour than g5.16xlarge ($2.4720/hr gap).

Because both instances are in the **g5 family**, the only thing that changes between them is sizing — same silicon, same architecture (Intel Xeon (x86_64)), same burstable/sustained behavior. The choice is purely about how much capacity you actually need: g5.16xlarge gives you 64 vCPUs and 256GB of RAM, g5.4xlarge gives you 16 vCPUs and 64GB. AWS scales pricing close to linearly within a family, so picking the right size is mostly about right-sizing your workload, not getting a better deal per vCPU.

Benchmark data for at least one of these instances is still being collected, so a direct performance-per-dollar comparison isn't possible yet. Sysbench scores are pending for g5.16xlarge and pending for g5.4xlarge. Check back as the benchmark queue completes — newer-generation instances typically score 10–30% higher on single-thread and 15–50% higher on multi-core vs the previous generation in the same series.

In practice, pick g5.16xlarge when your workload is closer to GPU-accelerated (graphics + ML inference) (graphics workloads, video transcoding, ML inference). Pick g5.4xlarge when it's closer to GPU-accelerated (graphics + ML inference) (graphics workloads, video transcoding, ML inference). When neither side is obviously right, the cheaper hourly rate usually wins for fault-tolerant batch workloads, while the higher single-core score usually wins for latency-sensitive web traffic. The regional pricing tables linked from each instance page below show where each is currently cheapest — sometimes a >20% regional gap flips the comparison entirely.

On-Demand Price Comparison

Monthly trajectory

Spot Price Comparison

30-Day daily trajectory

g5.16xlarge

g5.4xlarge

g5.16xlarge vs g5.4xlarge: how to choose

On-Demand Price Comparison

Spot Price Comparison

Browse All Cloud Instances