Turbine fan design, with the air duct running from front to back, is suitable for forced convection cooling of server enclosures. It offers better cooling efficiency when deploying multiple cards.
Dual-slot thickness, height specifications are available (115/120/125/135mm), compatible with mainstream server enclosures.
The fifth-generation Tensor Core offers up to 3352 AI TOPS, suitable for large model inference, deep learning training, and scientific computing.
Made of all-metal construction, using industrial-grade materials, it supports 7×24-hour high-load operation.
| Category | Specification |
| GPU Model | GeForce RTX 5090 (GB202-300) |
| Architecture | Blackwell (TSMC 4N) |
| CUDA Cores | 21760 |
| Base Clock | 2017 MHz |
| Boost Clock | 2437–2467 MHz |
| FP32 Performance | 104.8 TFLOPS |
| Ray Tracing Cores | 4th Generation, 318 RT-TFLOPS |
| Tensor Cores | 5th Generation, 3352 AI TOPS |
| Memory Size | 32 GB |
| Memory Type | GDDR7 |
| Memory Interface | 512-bit |
| Memory Speed | 28 Gbps |
| Memory Bandwidth | 1792 GB/s |
| Bus Interface | PCIe 5.0 x16 |
| TDP | 575 W |
| Power Connector | 16-pin (12VHPWR) / 12V-2×6 |
| Display Connectors | 3×DisplayPort 2.1b, 1×HDMI 2.1b |
| Max Resolution | 8K@165Hz, 4K@480Hz |
| Form Factor | Dual-slot, Blower/Turbo Fan |
| Dimensions | 266.7mm (L) × 37.5mm (W), Height: 115/120/125/135mm (optional) |
| Recommended PSU | 1000W+ |
Server, Storage, Workstations, Memory, Hard Disk, laptop, Desktop.
Yes, we can customize products as your requirements.
We have the best professional engineer and strict QA and QC system.
We are looking for distributor and agent all over the world.
Normally are cartons, but also we can pack it according to your requirements.
It depends on the quantity you need, 7 days usually. Language Spoken: English, Chinese, Spanish, Japanese, Portuguese, German, Arabic, French, Russian, Korean, Hindi, Italian.