NVidia drivers not installed by extension on Windows 2019 Scale Set NV6ads A10 v5

Henrik Aldermo 56 Reputation points
2025-03-17T15:42:27.4266667+00:00

Related to https://learn.microsoft.com/en-us/answers/questions/2180437/how-to-fix-no-gpu-issue-with-nc-series-on-azure-vm

Ping @kobulloc-MSFT

On an Azure Virtual Machine Scale set running Windows Server 2019 on Standard_NV6ads_A10_v5, the Nvidia GPU Driver Extension fails, and no graphic drivers are installed.

Manual installation of NVidia drivers is successful. Does not affect Windows Server 2022.

VM has reported a failure when processing extension 'NvidiaGpuDriverWindows' (publisher 'Microsoft.HpcCompute' and type 'NvidiaGpuDriverWindows'). Error message: 'Enable Extension'

Log on VM (excerpt):

Execution Error:

Number of Tries: 1

Command C:\Packages\Plugins\Microsoft.HpcCompute.NvidiaGpuDriverWindows\1.10.0.4\run.cmd of Microsoft.HpcCompute.NvidiaGpuDriverWindows has exited with Exit code: 0

Plugin (name: Microsoft.HpcCompute.NvidiaGpuDriverWindows, version: 1.10.0.4) completed successfully.
Azure Virtual Machine Scale Sets
Azure Virtual Machine Scale Sets
Azure compute resources that are used to create and manage groups of heterogeneous load-balanced virtual machines.
444 questions
{count} votes

Accepted answer
  1. Mounika Reddy Anumandla 5,055 Reputation points Microsoft External Staff Moderator
    2025-03-19T08:55:09.7366667+00:00

    Hi Henrik Aldermo,

    Glad the issue is resolved for you. I will have this answer promoted by reposting it. This is in the attempt to help others looking for a solution for a similar issue.

    The extension (Microsoft.HpcCompute.NvidiaGpuDriverWindows) always installs the latest GRID driver. The extension might be trying to install a driver that is incompatible with Windows Server 2019.

    Windows Server 2019 does support the GPU but may require a specific driver version that the extension doesn’t install correctly. For links to all previous Nvidia GRID driver versions, visit GitHub.

    Since Windows Server 2022 is officially supported, the extension installs the correct GRID 17.5 driver without issues.

    However, in your case the affected system is a virtual machine scale set, so installing drivers manually on every new instance is not viable.

    A working solution by specifying the driver version in the NVidia Driver Extension, as documented in the "Known Issues" section here: https://learn.microsoft.com/en-us/azure/virtual-machines/extensions/hpccompute-gpu-windows#known-issues

    For a Virtual Machine Scale set, use the following:

    az vmss extension set --resource-group MyResourceGroup --vmss-name MyVmss --name NvidiaGpuDriverWindows --publisher Microsoft.HpcCompute --settings "{'driverVersion':'538.46'}"

    To view the settings:

    az vmss extension list --resource-group MyResourceGroup --vmss-name MyVmss

    Please remember to "Accept Answer" if any answer/reply helped, so that others in the community facing similar issues can easily find the solution.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.