This section explains how to replace hardware components in your platform's nodes.

For detailed instructions, see the documentation from your hardware vendor.

Locating a Failed Drive

Gold-Tier hardware doesn’t use predefined drive mapping or panel LEDs to indicate drive health.

To Locate a Failed Drive by using the Web UI

  1. Log in to Qumulo Core.

  2. Click Cluster > Overview and then click the name of the node with a failed drive.

  3. On the page for the node, under Drive Details, the serial number for the failed drive is listed.

  4. Use the failed drive’s serial number and a server management tool to determine the physical location of the failed drive.

Initializing a Replacement Boot Drive

After you replace the boot drive, you must initialize the replacement boot drive by using the Qumulo Core Installer and then rebuild the replacement boot drive by using a script on the node in your cluster.

Step 1: Initialize the Replacement Boot Drive

  1. Create a Qumulo Core USB Drive Installer.

  2. Power on your node, enter the boot menu, and select your USB drive.

    The Qumulo Core Installer begins to run automatically.

  3. When prompted, take the following steps:

    1. Select [x] Perform maintenance.

    2. Select [1] Boot drive reset and then follow the prompts.

    The Qumulo Core Installer initializes the boot drive.

  4. When the process is complete, the node is powered down automatically.

Step 2: Rebuild the Replacement Boot Drive

  1. Power on your node and log in to the node by using the qq CLI.

  2. To get root privileges, run the sudo qsh command.

  3. To stop the Qumulo Networking Services, run the service qumulo-networking stop command.

  4. To configure the IP address for the node, run the ip addr add command and specify the node’s IP address. For example:

    ip addr add 203.0.113.0/CDR dev bond0
    
  5. Ensure that the node can ping other nodes in the cluster.

  6. Run the rebuild_boot_drive.py script and specify the IP address of another node in the cluster, the ID of the node whose boot drive has been replaced, and the password of the administrative account of the cluster. For example:

    /opt/qumulo/rebuild_boot_drive.py \
      --address 203.0.113.1 \
      --node-id 2 \
      --username admin \
      --password my\(Special\*Password
    

    Follow the prompts.

  7. When the process is complete, reboot the node.