Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -127,11 +127,11 @@ Always update the openshift docset when there is a new gpu-operator docset versi
{
"preferred": "true",
+ "url": "../1.17.4",
+ "version": "1.17.4"
+ "version": "1.17.4/"
+ },
+ {
"url": "../1.17.3",
"version": "1.17.3"
"version": "1.17.3/"
},
```

Expand Down
39 changes: 26 additions & 13 deletions gpu-operator/life-cycle-policy.rst
Original file line number Diff line number Diff line change
Expand Up @@ -88,11 +88,12 @@ Refer to :ref:`Upgrading the NVIDIA GPU Operator` for more information.
:header-rows: 2

* - :rspan:`1` Component
- :cspan:`2` GPU Operator Version
- :cspan:`3` GPU Operator Version

* - v26.3.0
- v26.3.1
- v26.3.2
- v26.3.3

* - NVIDIA GPU Driver |ki|_
- | `595.71.05 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-595-71-05/index.html>`_
Expand Down Expand Up @@ -122,64 +123,76 @@ Refer to :ref:`Upgrading the NVIDIA GPU Operator` for more information.
| `570.211.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-570-211-01/index.html>`_
| `535.309.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-535-309-01/index.html>`_
| `535.288.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-535-288-01/index.html>`_
- | `595.71.05 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-595-71-05/index.html>`_
| `595.58.03 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-595-58-03/index.html>`_
| `590.48.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-590-48-01/index.html>`_
| `580.159.04 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-580-159-04/index.html>`_ (**R**)
| `580.159.03 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-580-159-03/index.html>`_
| `580.126.20 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-580-126-20/index.html>`_ (**D**)
| `570.211.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-570-211-01/index.html>`_
| `535.309.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-535-309-01/index.html>`_
| `535.288.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-535-288-01/index.html>`_

* - NVIDIA Driver Manager for Kubernetes
- :cspan:`1` `v0.10.0 <https://ngc.nvidia.com/catalog/containers/nvidia:cloud-native:k8s-driver-manager>`__
- `v0.11.0 <https://ngc.nvidia.com/catalog/containers/nvidia:cloud-native:k8s-driver-manager>`__
- :cspan:`1` `v0.11.0 <https://ngc.nvidia.com/catalog/containers/nvidia:cloud-native:k8s-driver-manager>`__

* - NVIDIA Container Toolkit
- :cspan:`1` `1.19.0 <https://github.com/NVIDIA/nvidia-container-toolkit/releases>`__
- `1.19.1 <https://github.com/NVIDIA/nvidia-container-toolkit/releases>`__
- :cspan:`1` `1.19.1 <https://github.com/NVIDIA/nvidia-container-toolkit/releases>`__

* - NVIDIA Kubernetes Device Plugin
- :cspan:`1` `0.19.0 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
- `0.19.2 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
- `0.19.3 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__

* - DCGM Exporter
- :cspan:`1` `v4.5.1-4.8.0 <https://github.com/NVIDIA/dcgm-exporter/releases>`__
- `v4.5.3-4.8.2 <https://github.com/NVIDIA/dcgm-exporter/releases>`__
- :cspan:`1` `v4.5.3-4.8.2 <https://github.com/NVIDIA/dcgm-exporter/releases>`__

* - Node Feature Discovery
- :cspan:`2` `v0.18.3 <https://github.com/kubernetes-sigs/node-feature-discovery/releases/>`__
- :cspan:`3` `v0.18.3 <https://github.com/kubernetes-sigs/node-feature-discovery/releases/>`__

* - | NVIDIA GPU Feature Discovery
| for Kubernetes
- :cspan:`1` `0.19.0 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
- `0.19.2 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
- `0.19.3 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__

* - NVIDIA MIG Manager for Kubernetes
- :cspan:`1` `0.14.0 <https://github.com/NVIDIA/mig-parted/blob/main/CHANGELOG.md>`__
- `0.14.2 <https://github.com/NVIDIA/mig-parted/blob/main/CHANGELOG.md>`__
- :cspan:`1` `0.14.2 <https://github.com/NVIDIA/mig-parted/blob/main/CHANGELOG.md>`__

* - DCGM
- :cspan:`2` `4.5.2-1 <https://docs.nvidia.com/datacenter/dcgm/latest/release-notes/changelog.html>`__
- :cspan:`3` `4.5.2-1 <https://docs.nvidia.com/datacenter/dcgm/latest/release-notes/changelog.html>`__

* - Validator for NVIDIA GPU Operator
- v26.3.0
- v26.3.1
- v26.3.2
- v26.3.3

* - NVIDIA KubeVirt GPU Device Plugin
- :cspan:`2` `v1.5.0 <https://github.com/NVIDIA/kubevirt-gpu-device-plugin>`__
- :cspan:`3` `v1.5.0 <https://github.com/NVIDIA/kubevirt-gpu-device-plugin>`__

* - NVIDIA vGPU Device Manager
- :cspan:`2` `v0.4.2 <https://github.com/NVIDIA/vgpu-device-manager>`__
- :cspan:`3` `v0.4.2 <https://github.com/NVIDIA/vgpu-device-manager>`__

* - NVIDIA GDS Driver |gds|_
- :cspan:`2` `2.27.3 <https://github.com/NVIDIA/gds-nvidia-fs/releases>`__
- :cspan:`3` `2.27.3 <https://github.com/NVIDIA/gds-nvidia-fs/releases>`__

* - | NVIDIA Confidential Computing
| Manager for Kubernetes
- `v0.3.0 <https://github.com/NVIDIA/k8s-cc-manager/releases>`__
- :cspan:`1` `v0.4.0 <https://github.com/NVIDIA/k8s-cc-manager/releases>`__
- :cspan:`2` `v0.4.0 <https://github.com/NVIDIA/k8s-cc-manager/releases>`__

* - NVIDIA GDRCopy Driver
- `v2.5.1 <https://github.com/NVIDIA/gdrcopy/releases>`__
- :cspan:`1` `v2.5.2 <https://github.com/NVIDIA/gdrcopy/releases>`__
- :cspan:`2` `v2.5.2 <https://github.com/NVIDIA/gdrcopy/releases>`__

* - NVIDIA Kata Sandbox Device Plugin
- `v0.0.2 <https://github.com/NVIDIA/sandbox-device-plugin/releases>`__
- :cspan:`1` `v0.0.3 <https://github.com/NVIDIA/sandbox-device-plugin/releases>`__
- :cspan:`2` `v0.0.3 <https://github.com/NVIDIA/sandbox-device-plugin/releases>`__

.. _known-issue:

Expand Down
24 changes: 24 additions & 0 deletions gpu-operator/release-notes.rst
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,30 @@ Refer to the :ref:`GPU Operator Component Matrix` for a list of software compone

----

.. _v26.3.3:

26.3.3
=======

New Features
------------

* Updated software component versions:

- NVIDIA Kubernetes Device Plugin v0.19.3

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please confirm these component versions are correct.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

- NVIDIA GPU Feature Discovery for Kubernetes v0.19.3

Fixed Issues
------------

* Fixed a regression where feature flags such as ``MOFED_ENABLED`` and ``GDS_ENABLED`` were enabled by default for the device plugin operand.
This caused all ``ibverbs`` device nodes on a node to be injected into GPU workload containers, disrupting RDMA and NCCL workloads by exposing network interfaces that were not intended for the workload.
The GPU Operator now infers these feature flags dynamically from the kernel modules that are loaded on each node, rather than enabling them unconditionally.
(`PR #2525 <https://github.com/NVIDIA/gpu-operator/pull/2525>`__, `k8s-device-plugin PR #1837 <https://github.com/NVIDIA/k8s-device-plugin/pull/1837>`__)


----

.. _v26.3.2:

26.3.2
Expand Down
2 changes: 1 addition & 1 deletion repo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -172,7 +172,7 @@ docs_root = "${root}/gpu-operator"
project = "gpu-operator"
name = "NVIDIA GPU Operator"
version = "26.3" # Update repo_docs.projects.openshift.version to match latest patch version maj.min.patch
source_substitutions = { minor_version = "26.3", version = "v26.3.2", recommended = "580.126.20", dra_version = "25.12.0" }
source_substitutions = { minor_version = "26.3", version = "v26.3.3", recommended = "580.126.20", dra_version = "25.12.0" }
copyright_start = 2020
sphinx_exclude_patterns = [
"life-cycle-policy.rst",
Expand Down
Loading