Casey

Expert in access network, PON, GPON, etc.

CloudMatrix-M8

SemiAnalysis of Huawei CloudMatrix and the 910C

Huawei has recently made a significant impact on the industry with its innovative AI accelerator and rack-level architecture. China’s latest domestically developed cloud supercomputing solution, CloudMatrix M8, was officially unveiled. Built upon the Ascend 910C processor, this solution is positioned to directly rival Nvidia’s GB200 NVL72 system, exhibiting superior technological advantages in several key metrics …

SemiAnalysis of Huawei CloudMatrix and the 910C Read More »

data-center-design-best-practices

A Comprehensive Comparison of 400G QSFP112 SR4, OSFP SR4, QSFP-DD SR4, and QSFP-DD SR8 Modules

Introduction The rapid rise of AI computing clusters and hyperscale data centers has led to exponential growth in network bandwidth requirements. As critical hardware for intra-data center interconnects, the selection of 400G optical modules directly impacts network performance, cost, and scalability. In short-distance multimode scenarios, four mainstream modules—QSFP112 SR4, OSFP SR4, QSFP-DD SR4, and QSFP-DD …

A Comprehensive Comparison of 400G QSFP112 SR4, OSFP SR4, QSFP-DD SR4, and QSFP-DD SR8 Modules Read More »

vs

What are the Advantages of a Layer 3 Switch Compared to a Router and a Layer 2 Switch?

In the fields of network engineering and information technology, selecting the right equipment is a core issue in network architecture design. Layer 3 switches are widely used in enterprise and data center networks, and their advantages over routers and Layer 2 switches often determine their crucial position in network design. This article will delve into …

What are the Advantages of a Layer 3 Switch Compared to a Router and a Layer 2 Switch? Read More »

liquid-cooling

NVIDIA GB200 Delivered, and Here Comes the GB300!

According to Taiwan’s Economic Daily News, NVIDIA plans to launch the next-generation GB300 AI server product line at the GTC conference in March next year. Recently, Foxconn and Quanta have proactively started the research and development of GB300 to seize the opportunity early. It is understood that NVIDIA has preliminarily determined the GB300 order configuration, …

NVIDIA GB200 Delivered, and Here Comes the GB300! Read More »

Lenovo-Neptune-Liquid-Cooling-System

Is Liquid Cooling Efficient Enough for Blackwell?

As AI technology continues to advance, more data centers are turning to liquid cooling. Compared to traditional air cooling methods, liquid cooling—especially Direct Liquid Cooling (DLC)—offers significantly higher heat dissipation efficiency. Liquid’s thermal conductivity is 50 to 3,000 times greater than air, enabling better thermal management in high-density server environments that generate substantial heat. Additionally, …

Is Liquid Cooling Efficient Enough for Blackwell? Read More »

Multi-Rate Switch

What is a Multi-Rate Switch?

In the construction of enterprise networks and data centers, switches play a vital role as an important part of the network infrastructure. As bandwidth demand continues to increase, the transmission rate of traditional switches often fails to meet the demand, especially when data traffic surges. To meet these challenges, multirate switches have emerged. The emergence …

What is a Multi-Rate Switch? Read More »

DGX H100 dpu

Understanding the Power of NVIDIA’s BlueField-3 DPU

Introduction When working with NVIDIA’s H100 SXM servers, you may often see a configuration that includes two BFD-3 units. This raises questions, especially since the system already comes with eight CX-7 400G network cards. What are the fundamental differences and roles of BFD-3 compared to CX-7? Moreover, why does BFD have a BMC port when …

Understanding the Power of NVIDIA’s BlueField-3 DPU Read More »

Detailed Analysis of NVIDIA GH200 Chip, Servers, and Cluster Networking

Traditional OEM GPU Servers: Intel/AMD x86 CPU + NVIDIA GPU Before 2024, both NVIDIA’s own servers and third-party servers equipped with NVIDIA GPUs were based on x86 CPU machines. The GPUs were connected to the motherboard via PCIe cards or 8-card modules. At this stage, the CPU and GPU were independent. Server manufacturers could assemble …

Detailed Analysis of NVIDIA GH200 Chip, Servers, and Cluster Networking Read More »

fp

In-Depth Analysis and Performance Profiling of NV Switch

NVIDIA’s GPU technology undoubtedly shines brightly in today’s high-performance computing landscape. With the rapid development of artificial intelligence and machine learning, the demand for computational power continues to grow, making interconnectivity between GPUs increasingly crucial. Against this backdrop, NVIDIA introduced the NVLink protocol and the multi-GPU interconnect solution based on this technology: NV Switch. This …

In-Depth Analysis and Performance Profiling of NV Switch Read More »

Quantum-X800 IB Switch

Analysis of NVIDIA’s Latest Hardware: B100/B200/GH200/NVL72/SuperPod

Overview We have previously briefly introduced NVIDIA’s latest Blackwell GPU, but some of the content may be easily misunderstood, such as the ambiguity or vague concepts in NVIDIA’s official introduction. Additionally, we have seen some misunderstandings about the capabilities of the new generation of GPUs, such as the belief that they have dozens of times …

Analysis of NVIDIA’s Latest Hardware: B100/B200/GH200/NVL72/SuperPod Read More »

800G CPO switch

Exploring Internet Data Centers: the Evolution of DCN

Data Center Network (DCN) Demand Evolution The network is a crucial component of IT infrastructure, serving as the foundation that connects all IaaS layer resources to provide services. In the era of data, the core of cloud computing, big data, and artificial intelligence is data itself, with the network acting as the high-speed highway that …

Exploring Internet Data Centers: the Evolution of DCN Read More »

In-band system management

How to Build a Cluster with 128 DGX H100?

The NVIDIA DGX H100, released in 2022, is equipped with 8 single-port ConnectX-7 network cards, supporting NDR 400Gb/s bandwidth, and 2 dual-port Bluefield-3 DPUs (200Gb/s) that can support IB/Ethernet networks. The appearance is shown in the following figure. The DGX H100 has 4 QSFP56 ports for storage network and In-Band management network; In addition, there …

How to Build a Cluster with 128 DGX H100? Read More »

Scroll to Top