Cluster HA Verification

Only    2seat(s) available

Experiment name: Cluster HA Verification of IBM Spectrum NAS

Experiment content:
Before starting this experiment, please make sure that you have completed the experiment "Expanding Capacity of Cluster System Nodes" so that the cluster has been expanded to cover 6 nodes.
IBM Spectrum NAS has excellent reliability. It adopts a symmetrical architecture so that each node runs the same small but efficient system cores. Combined with the data redundancy capability, this architecture helps avoid any single point of failure, enhancing reliability. With the erasure code capability, the data striping can be achieved on multiple nodes and locations, providing data redundancy and protection. In case of any hardware failure with a node, a notification will be sent to other nodes in the same cluster, and then the parallel working will be started to re-create any lost data, ensuring that no impact is brought to the business. This experiment will lead you to get through that the NAS cluster provides NAS-related services normally even when 2 nodes are out of order. (duration: 15-20 min)

Experiment resources:
IBM Spectrum NAS 1.7.0 software
VMware vSphere 6.5
- CenterOS 7 (VM)
- Windows Server 2008 R2 (VM)

Cluster HA Verification of IBM Spectrum NAS

The following content is displayed on the same screen for your experiment so that you can make any necessary reference in experiment. Start your experiment now!

  1. Log onto the maintenance interface(Duration: 5 min)
    Before starting this experiment, please make sure that you have completed the experiment "Expanding Capacity of Cluster System Nodes" so that the cluster has been expaned to cover 6 nodes.
    In the left-hand navigation bar, select IBM-NASDemo1[+2]: if the system will prompt you to type the cluster password, type Passw0rd! and click OK to enter the cluster management interface; if you have logged, just ignore it.

    Then, enter the NAS management interface, select the cluster name and then click the option Maintenance.
  2. Copy files into the shared directory to maintain the NAS service(Duration: 10 min)
    Open the file browser in the Windows host, and type \\192.168.180.23 in the address bar.

    Type the username and password (ibmnas/passw0rd) in the popping dialog.

    Copy the large file "test" (4.64GB) on the Windows desktop into the shared directory.

    The copying process starts.
  3. Simulate the failure of two nodes(Duration: 5 min)
    Return to the NAS management interface, randomly select two nodes, click Take offline to close the two nodes.

    Select the option "Selected" in the popping menu to only close the nodes selected right now.

    In the popping menu, select "Yes, I confirm" and "Skip draining of nodes before taking them offline" in turn, then set the option "Delay self-healing by" as 0. Finally, click "Take Offline" to close the two NAS nodes immediately.

    In the NAS management interface, you can see that the status of the two nodes has been changed from Online to Offline.

    You can see that the copying process is progressing in the Windows host.

    After the data copying is completed, view the source file and target file, and you will see that they are the same. Through the above actions, we can find that the concurrent failure of two nodes is allowed in the NAS cluster, i.e. the NAS file system still can provide services externally even in case of the failure of two nodes.

    Finally, in the NAS management interface, select the two nodes that were set Offline right now, and then select "Bring online" to add the two nodes into the NAS cluster again.
card_3

This experiment needs to be performed on PC side. Please turn to the PC side for experiment.

https://csc.cn.ibm.com/

The resources required are unavailable now, and the estimated waiting time is:

You have just experienced this experiment, and you have to wait for :

before next experiment is allowed

card_3