Replacing/Repairing a failed drive in 2-node Hyper-V S2D Cluster.

Justin Buckley 0 Reputation points
2025-02-14T17:54:53.5566667+00:00

Hello! I'm working with a 2-node Windows Server 2022 Hyper-V S2D Cluster (we'll call the nodes HV1 and HV2). There are 8 x 4TB NVMe drives per server, and 2 CSVs (two-way mirror/CSVFS_ReFS) used for VM data storage.

HV1 had a physical disk lose connection and was subsequently auto-retired. The storage pool (fixed) and both CSVs are shown to be healthy and online.

I have a replacement drive on hand, but I'd like to also attemp to re-seat the existing failed/retired drive in case it was actually just a connection issue. How do I safely go about this, while also keeping the cluster up and running?

Hyper-V
Hyper-V
A Windows technology providing a hypervisor-based virtualization solution enabling customers to consolidate workloads onto a single server.
2,855 questions
Clustering
Clustering
The grouping of multiple servers in a way that allows them to appear to be a single unit to client computers on a network. Clustering is a means of increasing network capacity, providing live backup in case one of the servers fails, and improving data security.
3 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Ian Xue-MSFT 41,716 Reputation points Microsoft External Staff
    2025-02-19T03:37:48.4+00:00

    Hi Justin Buckley,

    Thanks for your post. Please refer to the following steps to replace drivers:

    1. Check status of storage subsystem and storage jobs by using PowerShell:

    Get-StorageSubSystem Cluster | Get-StorageJob

    1. See physical disk footprint:

    Get-PhysicalDisk | ft DeviceId,FriendlyName,SerialNumber,Uniqueid,*Status,foot,Usage,PhysicalLocation

    1. Retire bad disk, one specific disk

    Get-PhysicalDisk -SerialNumber XXXXXXXXX | Set-PhysicalDisk -Usage Retired

    1. See physical disk footprint (to get any disk by SerialNumber). Monitor until footprint is zero.

    Get-PhysicalDisk -SerialNumber XXXXXXXXX | ft DeviceId,FriendlyName,SerialNumber,Uniqueid,*Status,foot,Usage,PhysicalLocation

    1. Remove retired disk from storage pool.

    $FailedDisk = Get-PhysicalDisk -SerialNumber XXXXXXXX

    $Pool = $FailedDisk | Get-StoragePool -IsPrimordial:$false

    Remove-PhysicalDisk -StoragePool $pool -PhysicalDisks $FailedDisk

    1. Enable LED light of physical disk on a server rack, to more easily identify the disk.

    Get-PhysicalDisk |? -SerialNumber XXXXXXXXX | Enable-PhysicalDiskIdentification

    1. Physically remove the failed disk from the server chassis.
    2. Physically install the new disk into the server chassis.
    3. Disable LED light of physical disk on a server chassis.

    Disable-PhysicalDiskIdentification

    1. Add new physical disk into pool.

    $disk = Get-PhysicalDisk |? CanPool -like True

    Get-StoragePool S2D | Add-PhysicalDisk -PhysicalDisks $disk

    1. Get virtual disk and repair virtual disk. Repair job may start on its own.

    Get-VirtualDisk

    Repair-VirtualDisk VirtualDiskName

    1. Wait until repair completes.

    Get-StorageJob

    1. Once the repair storage jobs are complete, repeat steps above for any other capacity disks which need to be replaced.

    Best Regards,

    Ian Xue


    If the Answer is helpful, please click "Accept Answer" and upvote it.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.