L40 vs CPU
Explore a head to head comparison of specifications, performance, and pricing.
L40
The NVIDIA L40 delivers high-performance computing capabilities for AI, machine learning, and data science applications.
CPU
The n/a CPU delivers high-performance computing capabilities for AI, machine learning, and data science applications.
See how the L40 & CPU compare
Compare detailed hardware specifications and average pricing for the L40 and CPU.
Compare Hardware Specifications
| L40 | CPU | |
|---|---|---|
| GPU Type | L40 | CPU |
| VRAM per GPU | 48 GB | 0 GB |
| Manufacturer | NVIDIA | n/a |
| Architecture | Ada Lovelace | N/A |
| Interconnect | PCIe Gen4 | pcie |
| Memory Bandwidth | 864 GB/s | N/A |
| FP16 TFLOPS | 90.52 TFLOPS (1:1) | N/A |
| CUDA Cores | 18176 | N/A |
| Tensor Cores | 568 (4th Gen) | N/A |
| RT Cores | 142 (3rd Gen) | N/A |
| Base Clock | 735 MHz | N/A |
| Boost Clock | 2490 MHz | N/A |
| TDP | 300W | N/A |
| Process Node | TSMC 4N | N/A |
| Data Formats | FP8, INT8, BF16, FP16, TF32, FP32 | N/A |
Compare Average On-Demand Pricing
| L40 | CPU | |
|---|---|---|
| 1 GPU | $0.99 /hr | N/A |
| 2 GPUs | $1.99 /hr | N/A |
| 4 GPUs | $4.98 /hr | N/A |
| 8 GPUs | $8.00 /hr | N/A |
Frequently Asked Questions: L40 vs CPU
The main differences are VRAM (48 GB vs 0 GB).
The L40 is generally better for large language model training due to its higher throughput and 48 GB of VRAM, which allows fitting larger models or larger batch sizes in a single pass. For smaller models or fine-tuning tasks where cost matters more, both GPUs can be effective.
Pricing for the L40 and CPU varies by cloud provider, region, and contract type. Shadeform aggregates pricing from 30+ GPU cloud providers so you can compare and find the best rate. Use the instance table above to see current on-demand prices.
The L40 has more VRAM at 48 GB, compared to 0 GB on the CPU. Higher VRAM allows you to run larger models without quantization, use longer context windows, and process larger batch sizes — all of which improve throughput and reduce latency for memory-bound workloads.
The L40 is currently available across 2 cloud providers on Shadeform's network, compared to 2 for the CPU. Shadeform lets you deploy either GPU across all available providers from a single platform, so you can always find available capacity without manually checking each cloud.
Mixing different GPU types in a single training cluster is generally not recommended, as it creates performance bottlenecks where faster GPUs wait for slower ones. For best results, use a homogeneous cluster of either L40 or CPU. Shadeform supports on-demand clusters of up to 64 GPUs of the same type with no commitment required.
Explore L40 & CPU Instances
Browse available instances with L40 and CPU GPUs. Filter by provider, availability, and more to find the perfect instance for your needs.