Any Difference in the Number of Nodes Managed by the Subnet Manager for Switch, OFED, and UFM?

The switch subnet manager, the OFED subnet manager, and UFM are software that are used to initialize and manage switches and hosts in the InfiniBand network.
Subnet manager of switch, subnet manager of OFED, UFM, is there any difference in the number of management nodes of them? Which one is more suitable for clients to choose when deploying?
Harry Collins

Harry Collins

Answered on 3:31 am

The switch subnet manager, the OFED subnet manager, and UFM are software that are used to initialize and manage switches and hosts in the InfiniBand network. Their main difference lies in where they run and the scope of their functions.

The switch subnet manager is software that runs on the switch itself. It can automatically discover and configure devices in the network and provide basic network monitoring and diagnostic functions. It does not require additional management nodes, but it also does not support advanced network optimization and security functions.

The OFED subnet manager is software that runs on the host. It can use the subnet management service provided by OpenSM or Mellanox OFED. It can run on multiple hosts, achieving subnet manager redundancy and load balancing. It supports some advanced network functions, such as adaptive routing and congestion control, but it also does not support network security and preventive maintenance functions.

UFM is software that runs on dedicated management nodes. It is NVIDIA’s unified network management platform. It can monitor and manage the InfiniBand network comprehensively, supporting various advanced network functions, such as adaptive routing, congestion control, network security, preventive maintenance, performance analysis, etc. It requires at least one management node and a UFM license.

Which one is more suitable for customers to choose depends on their needs and budgets. If customers only need basic network functions and do not want to add extra management nodes, they can choose the switch subnet manager. If customers need some advanced network functions and already have host nodes, they can choose the OFED subnet manager. If customers need the most comprehensive and advanced network functions and are willing to invest in dedicated management nodes and UFM licenses, they can choose UFM.

People Also Ask

SemiAnalysis of Huawei CloudMatrix and the 910C

Huawei has recently made a significant impact on the industry with its innovative AI accelerator and rack-level architecture. China’s latest domestically developed cloud supercomputing solution, CloudMatrix M8, was officially unveiled.

How to Extend the Life of GPU Servers?

Routine maintenance of GPU servers is critical to ensuring their stability and extending their service life. Here are some key maintenance details. Cleaning Exterior Cleaning: Clean the server housing regularly with

NVIDIA HGX B300 Overview

The NVIDIA HGX B300 platform represents a significant advancement in our computing infrastructure. Notably, the latest variant—designated as the NVIDIA HGX B300 NVL16—indicates the number of compute chips interconnected via

Optical Transceivers Overcome Heat

The rapid development of AI and large language models has led to a surge in demand for high-speed optical transceivers in data centers and AI cluster computers. As optical transceiver speeds

Related Articles

Daily maintenance of GPU servers

How to Extend the Life of GPU Servers?

Routine maintenance of GPU servers is critical to ensuring their stability and extending their service life. Here are some key maintenance details. Cleaning Exterior Cleaning: Clean the server housing regularly with

Read More »
NVIDIA-HGX-B300-Overview

NVIDIA HGX B300 Overview

The NVIDIA HGX B300 platform represents a significant advancement in our computing infrastructure. Notably, the latest variant—designated as the NVIDIA HGX B300 NVL16—indicates the number of compute chips interconnected via

Read More »
800G OSFP SR8 FLT

Optical Transceivers Overcome Heat

The rapid development of AI and large language models has led to a surge in demand for high-speed optical transceivers in data centers and AI cluster computers. As optical transceiver speeds

Read More »

Leave a Comment

Scroll to Top