Procedure for safely identifying and replacing a failed drive on Cisco platforms running Qumulo Core.
Drives in Qumulo clusters are hot-swappable. You can replace a failed drive without taking the node offline.
-
Identify the failed drive using the Qumulo Web UI or the
qq cluster_slotscommand. Note the node, drive bay, and drive serial number reported. -
Locate the failed drive in the chassis at the drive bay you noted on the indicated node.
Caution
Use the drive bay — not the Qumulo drive ID — to physically locate the drive. The Qumulo drive ID (shown in failure alerts such as "drive 7 failed") is a logical software identifier and does not correspond to a physical bay on the chassis. -
Confirm you have the right drive:
- Check for the amber fault LED on the drive carrier.
- Verify the drive serial number on the carrier matches the one you noted from Qumulo.
- If unsure, contact the Qumulo Care Team before removing any drive.
-
Remove the failed drive.
-
Insert the replacement drive.
-
Qumulo Core automatically detects and incorporates the new drive, then begins the reprotect process. Progress can be monitored from the Qumulo Web UI or with the
qq restriper_statuscommand.