NVIDIA
[PDF] NVIDIA A100 Tensor Core GPU Architecture [web site] NVIDIA Ampere Architecture [developer blog] NVIDIA Ampere Architecture In-Depth [blog] TensorFloat-32 in the A100 GPU Accelerates AI Training, HPC up to 20xWikipedia
Ampere (microarchitecture) CUDA 11 High Bandwidth Memory 2 (HBM2) NVLink 3.0 PCI Express 4.0