pricegpu

Best Cloud GPU for LLM inference 70B

Minimum 80GB VRAM · Recommended 160GB+ · Runtime: tokens-per-second

Cheapest for LLM inference 70B: NVIDIA A100 80GB PCIe on Salad
$0.890/hr/hr · verify on provider site
Try Salad →

Cheapest GPU Options — 9 eligible GPUs

ProviderConfigurationRegionBillingAvailabilityPrice/hr
Saladcheapest1x A100 PCIe 80GBdistributedper-minuteon-demand$0.890/hr Rent →
vast1x A100 PCIe 80GBus-eastper-secondon-demand$1.59/hr Rent →
RunPod1x A100 PCIe 80GBus-eastper-secondon-demand$1.64/hr Rent →
FluidStack1x A100 SXM 80GBus-eastper-minuteon-demand$1.85/hr Rent →
DataCrunch1x A100 SXM4 80GBeu-northper-minuteon-demand$1.89/hr Rent →
RunPod1x A100 SXMus-eastper-secondon-demand$1.89/hr Rent →
genesis1x A100 PCIe 80GBus-eastper-minuteon-demand$1.89/hr Rent →
TensorDock1x A100 PCIe 80GBus-eastper-minuteon-demand$1.99/hr Rent →
Hyperstack1x A100 SXM4 80GBuk-londonper-minuteon-demand$2.06/hr Rent →
CoreWeave1x A100 SXM4 80GBus-eastper-secondon-demand$2.21/hr Rent →
lambda1x A100 SXMus-west-2per-minuteon-demand$2.21/hr Rent →
Paperspace1x A100 SXMus-eastper-minuteon-demand$2.30/hr Rent →
Paperspace1x A100 PCIe 80GBus-eastper-minuteon-demand$2.30/hr Rent →
together1x A100 SXM 80GBus-eastper-secondon-demand$2.49/hr Rent →
fal1x A100 SXM 80GBus-eastper-millisecondon-demand$2.99/hr Rent →

GPU Requirements

Minimum VRAM80 GB
Recommended VRAM160 GB
Ideal GPUs
Typical Runtimetokens-per-second
Billing Patternsustained

FAQ

What GPU do I need for LLM inference 70B?
Requires at least 80GB VRAM. Recommended: 160GB+. Ideal: NVIDIA H100 80GB SXM, NVIDIA H100 80GB PCIe, NVIDIA H200 141GB SXM.
What is the cheapest GPU for LLM inference 70B?
NVIDIA A100 80GB PCIe at $0.890/hr/hr on Salad.
How much does LLM inference 70B cost per hour?
From $0.890/hr/hr. Runtime: tokens-per-second.

GPU-Specific Pages

NVIDIA H100 80GB SXM for LLM inference 70B
80GB VRAM
NVIDIA H100 80GB PCIe for LLM inference 70B
80GB VRAM
NVIDIA H200 141GB SXM for LLM inference 70B
141GB VRAM
NVIDIA A100 80GB SXM for LLM inference 70B
80GB VRAM