L40 GPU
The L40 targets visual computing and AI inference, supporting media processing, real-time rendering, and computer vision tasks in one GPU. It helps consolidate workloads that previously required separate hardware.
Starting Price
$0.43/hr
Available on 6 cloud providers

Key Specifications
๐พMemory
40GB VRAM
๐๏ธArchitecture
Ada Lovelace
โ๏ธCompute Units
N/A
๐งฎTensor Cores
568
Technical Specifications
Hardware Details
ManufacturerNVIDIA
ArchitectureAda Lovelace
CUDA Cores18176
Tensor Cores568
RT Cores142
Compute UnitsN/A
GenerationN/A
Memory & Performance
VRAM40GB
Memory Interface384-bit
Memory Bandwidth864 GB/s
FP32 Performance90.5 TFLOPS
FP16 Performance181.05 TFLOPS
INT8 Performance362 TOPS
Performance
Computing Power
CUDA Cores18,176
Tensor Cores568
RT Cores142
Computational Performance
FP32 (TFLOPS)90.5
FP16 (TFLOPS)181.05
INT8 (TOPS)362
Common Use Cases
neural graphics, virtualization, compute, AI
Machine Learning & AI
- Training large language models and transformers
- Computer vision and image processing
- Deep learning model development
- High-performance inference workloads
Graphics & Compute
- 3D rendering and visualization
- Scientific simulations
- Data center graphics virtualization
- High-performance computing (HPC)
Market Context
The L40 sits within NVIDIA's Ada Lovelace architecture lineup, positioned in the high performance tier.
Cloud Availability
Available across 6 cloud providers with prices ranging from $0.43/hr. Pricing and availability may vary by region and provider.
Market Position
Released in 2022, this GPU is positioned for professional workloads.