Skip to product information
1 of 1

Kentino s.r.o.

Inference 70B L40 Ai server

Inference 70B L40 Ai server

Regular price €41.646,27 EUR
Regular price €41.646,27 EUR Sale price €41.646,27 EUR
Sale Sold out
Taxes included.

70B L40 Computer

Specifications

  • GPU: 6x NVIDIA L40 (288 GB VRAM total)
  • Motherboard: ASRock Rack ROMED8-2T
  • CPU: AMD EPYC 7542
  • RAM: 512GB SK Hynix 2666MHz REG ECC DDR4 LRDIMM (8 x 64GB)
  • GPU-Motherboard Connection: RYSER PCIe 4.0 x16 Cable
  • Power Supply: 2x AX1600i 1000W
  • Case: 4U Rack Mount
  • Storage:
    • 2TB NVMe SSD
    • 500GB SATA Drive

Key Features

  1. High-Performance GPU Compute: Equipped with 6 NVIDIA L40 GPUs, providing a total of 288 GB VRAM for demanding AI, machine learning, and visualization workloads.
  2. Server-Grade Components: Features the reliable ASRock Rack ROMED8-2T motherboard and a powerful AMD EPYC 7542 CPU for exceptional processing capabilities.
  3. Ample Memory: 512GB of high-speed SK Hynix DDR4 RAM ensures smooth multitasking and efficient data processing for complex computations.
  4. High-Speed GPU Integration: Utilizes the RYSER PCIe 4.0 x16 cable for fast, full-bandwidth connection between the GPUs and the motherboard, ensuring optimal performance and data transfer speeds.
  5. Robust Power Supply: Dual AX1600i 1000W units provide stable and ample power delivery to support the high-performance components under heavy loads.
  6. Expandable Storage: Comes with a fast 2TB NVMe SSD for primary storage and an additional 500GB SATA drive for extra capacity.
  7. Professional-Grade Cooling: Housed in a spacious 24U rack mount case, providing optimal airflow and thermal management for sustained high-performance operation.
  8. Versatile Configuration: Designed for a wide range of high-performance computing tasks, from AI and machine learning to professional visualization and rendering.

Ideal Use Cases

  • Large Language Model Inference (e.g., 70B parameter models)
  • AI and Machine Learning Research
  • Data Analytics and Visualization
  • Professional 3D Rendering and Animation
  • Scientific Simulations
  • High-Performance Computing (HPC) Applications
  • Computer Vision and Image Processing
  • Financial Modeling and Risk Analysis

Special Notes

  • Optimized for 70B Models: With 288 GB of total GPU VRAM, this system is specifically designed to handle large language models with up to 70 billion parameters, making it ideal for cutting-edge AI research and applications.
  • NVIDIA L40 Advantage: The L40 GPUs offer a balance of compute performance and memory, suitable for a wide range of AI, HPC, and professional visualization workloads.
  • PCIe 4.0 Performance: The RYSER PCIe 4.0 x16 cable ensures that each GPU can operate at full bandwidth, maximizing data throughput and minimizing latency.
  • Scalable Design: While optimized for 70B parameter models, this system can be easily scaled or clustered for even larger workloads or multi-user environments.

The 70B L40 Computer represents a powerful and versatile solution for organizations and researchers working with large AI models, particularly in the realm of natural language processing and generation. Its balanced configuration of NVIDIA L40 GPUs, AMD EPYC CPU, and high-speed memory makes it suitable for a wide range of high-performance computing tasks beyond AI, including scientific simulations, data analytics, and professional visualization.

View full details