1
0
mirror of https://github.com/openshift/openshift-docs.git synced 2026-02-05 12:46:18 +01:00
Files
openshift-docs/modules/serverless-gpu-resources-kn.adoc
2023-11-15 11:13:03 +00:00

44 lines
1.8 KiB
Plaintext

// Module included in the following assemblies:
//
// * serverless/integrations/gpu-resources.adoc
:_mod-docs-content-type: PROCEDURE
[id="serverless-gpu-resources-kn_{context}"]
= Specifying GPU requirements for a service
After GPU resources are enabled for your {product-title} cluster, you can specify GPU requirements for a Knative service using the Knative (`kn`) CLI.
.Prerequisites
* The {ServerlessOperatorName}, Knative Serving and Knative Eventing are installed on the cluster.
* You have installed the Knative (`kn`) CLI.
* GPU resources are enabled for your {product-title} cluster.
* You have created a project or have access to a project with the appropriate roles and permissions to create applications and other workloads in {product-title}.
ifndef::openshift-rosa[]
[NOTE]
====
Using NVIDIA GPU resources is not supported for {ibm-z-name} and {ibm-power-name}.
====
endif::openshift-rosa[]
.Procedure
. Create a Knative service and set the GPU resource requirement limit to `1` by using the `--limit nvidia.com/gpu=1` flag:
+
[source,terminal]
----
$ kn service create hello --image <service-image> --limit nvidia.com/gpu=1
----
+
A GPU resource requirement limit of `1` means that the service has 1 GPU resource dedicated. Services do not share GPU resources. Any other services that require GPU resources must wait until the GPU resource is no longer in use.
+
A limit of 1 GPU also means that applications exceeding usage of 1 GPU resource are restricted. If a service requests more than 1 GPU resource, it is deployed on a node where the GPU resource requirements can be met.
. Optional. For an existing service, you can change the GPU resource requirement limit to `3` by using the `--limit nvidia.com/gpu=3` flag:
+
[source,terminal]
----
$ kn service update hello --limit nvidia.com/gpu=3
----