- Cash On Delivery + Free Shipping Across India (For all Physical Products)
- Store Location
-
- support@grabnpay.in
- Data Center Networking
- Wireless Networking
- Optical Networking
- Wordpress Hosting
- Blog Hosting
- WooCommerce Hosting
- Kubernetes Playground
- VPS Hosting
- Home
- Nvidia GPUs
- NVIDIA A40 PCIe data center GPU accelerator for Visual Computing NVIDIA A40 PCIe data center GPU accelerator for Visual Computing
NVIDIA A40 PCIe data center GPU accelerator for Visual Computing
- Description
- Shipping & Returns
- Reviews
NVIDIA A40 accelerates the most demanding visual computing workloads from the data center, combining cutting-edge NVIDIA Ampere architecture RT Cores, Tensor Cores, and CUDA Cores with 48 GB of graphics memory. This powerful GPU supports everything from virtual workstations accessible from anywhere to dedicated render nodes, bringing next-generation NVIDIA RTX technology to the data center for advanced professional visualization workloads.
TECHNICAL SPECIFICATION
Brand |
Mellanox nvidia |
Type |
Tensor Core GPU |
Model |
A40 |
GPU Architecture |
NVIDIA Ampere |
GPU Memory |
48 GB GDDR6 with ECC |
GPU Memory Bandwidth |
696 GB/s |
Interconnect Interface |
NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4: 64GB/s |
NVIDIA Ampere architecture-based CUDA Cores |
10,752 |
NVIDIA second-generation RT Cores |
84 |
NVIDIA third-generation Tensor Cores |
336 |
Peak FP32 TFLOPS (non-Tensor) |
37.4 |
Peak FP16 Tensor TFLOPS with FP16 Accumulate |
149.7 | 299.4* |
Peak TF32 Tensor TFLOPS |
74.8 | 149.6* |
RT Core performance TFLOPS |
73.1 |
Peak BF16 Tensor TFLOPS with FP32 Accumulate |
149.7 | 299.4* |
Peak INT8 Tensor TOPS Peak INT 4 Tensor TOPS |
299.3 | 598.6* 598.7 | 1,197.4* |
Display ports |
3x DisplayPort 1.4**; Supports NVIDIA Mosaic and Quadro Sync4 |
Max power consumption |
300 W |
Power connector |
8 Pin CPU |
Thermal solution |
Passive |
Virtual GPU (vGPU) software support |
NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation, NVIDIA Virtual Compute Server |
vGPU profiles supported |
YES |
NVENC | NVDEC |
1x | 2x (includes AV1 decode) |
Secure and measured boot with hardware root of trust |
YES (optional) |
NEBS ready |
Level 3 |
Compute APIs |
CUDA, DirectCompute, OpenCL, OpenACC |
Graphics APIs |
Shader Model 5.175,DirectX 12.075, OpenGL 4.686, Vulkan 1.186 |
MIG support |
NO |
Form factor |
4.4" (H) x 10.5" (L) dual slot |
NVIDIA Ampere Architecture CUDA Cores
The NVIDIA A40 features CUDA Cores that double the processing speed for single-precision floating point (FP32) operations while improving power efficiency. This results in significant performance enhancements for graphics and compute workflows, such as complex 3D computer-aided design (CAD) and computer-aided engineering (CAE).
Second-Generation RT Cores
Second-generation RT Cores deliver up to twice the throughput compared to the previous generation, allowing for concurrent ray tracing with shading or denoising capabilities. This results in significant speedups for tasks such as photorealistic movie rendering, architectural design assessments, and virtual prototyping. Moreover, it enhances the rendering of ray-traced motion blur, delivering quicker results with improved visual precision.
Third-Generation Tensor Cores
A40's Tensor Cores offer Tensor Float 32 (TF32) precision, providing up to 5X the training throughput of the previous generation. This accelerates AI and data science model training without any code changes. Hardware support for structural sparsity doubles the throughput for inferencing tasks. Tensor Cores also enhance graphics through deep learning super sampling (DLSS), AI denoising, and advanced editing capabilities in select applications.
Memory Clock |
7251 Grams |
Memory Type |
GDDR6 |
Memory Size |
48 GB/s |
Memory Bus width |
384 Bits |
Peak Memory Bandwidth |
Upto 696 GB/s |
48 GB GDDR6 Memory with NVLink
A40 equipped with ultra-fast GDDR6 memory, scalable up to 96 GB with NVLink. This large memory capacity is essential for data scientists, engineers, and creative professionals working with massive datasets and workloads such as data science and simulation.
PCI Express Gen 4
PCI Express Gen 4 doubles the bandwidth of PCIe Gen 3, significantly enhancing data transfer speeds from CPU memory. This improvement benefits data-intensive tasks such as AI, data science, and 3D design. Enhanced PCIe performance also accelerates GPU direct memory access (DMA) transfers, facilitating faster input/output communication of video data between the GPU and GPUDirect for Video-enabled devices, making it a powerful solution for live broadcasts. A40 is also backward compatible with PCI Express Gen 3, providing deployment flexibility.
Data Center Efficiency and Security
With a dual-slot, power-efficient design, the NVIDIA A40 is twice upto as power-efficient as the previous generation and is compatible with a wide range of servers from global OEMs. It includes a secure and measured boot with hardware root-of-trust technology to ensure firmware integrity and security.
NVIDIA A40 GPU empowers advanced visual computing functions such as real-time ray tracing, AI acceleration, and versatile multi-workload support, enhancing the speed of deep learning, data science, and compute-driven tasks. Virtual workstations leveraging the power of the NVIDIA A40, coupled with NVIDIA RTX Virtual Workstation (vWS) and NVIDIA Virtual Compute Server software, undergo rigorous testing across diverse industry applications and professional software, guaranteeing top-notch performance and reliability.
Order Information
Manufacturers are responsible for warranties, which are based on purchase date and validity. We make sure the products we sell are delivered on time and in their original condition at Grabnpay. Whether it's physical damage or operational issues, our team will help you find a solution. We help you install, configure, and manage your devices. We'll help you file a warranty claim and guide you every step of the way.
Product |
Description |
Nvidia A40 Tensor Core GPU |
NVIDIA A40 PCIe data center GPU accelerator for Visual Computing |