mirror of
https://github.com/openshift/openshift-docs.git
synced 2026-02-06 06:46:26 +01:00
[WIP] Add details about GPU resources for Knative services
This commit is contained in:
@@ -1704,11 +1704,6 @@ Topics:
|
||||
# Apps
|
||||
- Name: Creating and managing serverless applications
|
||||
File: serving-creating-managing-apps
|
||||
# HA
|
||||
- Name: High availability on OpenShift Serverless
|
||||
File: serverless-HA
|
||||
- Name: Tracing requests
|
||||
File: serverless-tracing
|
||||
# Knative CLI
|
||||
- Name: Knative CLI
|
||||
Dir: knative_cli
|
||||
@@ -1752,6 +1747,18 @@ Topics:
|
||||
# Channels
|
||||
- Name: Using channels
|
||||
File: serverless-channels
|
||||
# HA
|
||||
- Name: High availability on OpenShift Serverless
|
||||
File: serverless-HA
|
||||
# Tracing
|
||||
- Name: Tracing requests
|
||||
File: serverless-tracing
|
||||
# Metering
|
||||
- Name: Using metering with OpenShift Serverless
|
||||
File: serverless-metering
|
||||
# Integrations
|
||||
# - Name: Integrations
|
||||
# Dir: integrations
|
||||
# Topics:
|
||||
# - Name: Using NVIDIA GPU resources with serverless applications
|
||||
# File: gpu-resources
|
||||
|
||||
39
serverless/integrations/gpu-resources.adoc
Normal file
39
serverless/integrations/gpu-resources.adoc
Normal file
@@ -0,0 +1,39 @@
|
||||
include::modules/serverless-document-attributes.adoc[]
|
||||
[id="gpu-resources"]
|
||||
= Using NVIDIA GPU resources with serverless applications
|
||||
:context: gpu-resources
|
||||
include::modules/common-attributes.adoc[]
|
||||
|
||||
toc::[]
|
||||
|
||||
NVIDIA supports experimental use of GPU resources on {product-title}.
|
||||
See link:https://docs.nvidia.com/datacenter/kubernetes/openshift-on-gpu-install-guide/index.html[{product-title} on NVIDIA GPU accelerated clusters] for more information about setting up GPU resources on {product-title}.
|
||||
|
||||
After GPU resources are enabled for your {product-title} cluster, you can specify GPU requirements for a Knative service using the `kn` CLI.
|
||||
|
||||
.Procedure
|
||||
|
||||
You can specify a GPU resource requirement when you create a Knative service using `kn`.
|
||||
|
||||
. Create a service.
|
||||
. Set the GPU resource requirement limit to `1` by using `nvidia.com/gpu=1`:
|
||||
+
|
||||
----
|
||||
$ kn service create hello --image docker.io/knativesamples/hellocuda-go --limit nvidia.com/gpu=1
|
||||
----
|
||||
+
|
||||
A GPU resource requirement limit of `1` means that the service has 1 GPU resource dedicated.
|
||||
Services do not share GPU resources. Any other services that require GPU resources must wait until the GPU resource is no longer in use.
|
||||
+
|
||||
A limit of 1 GPU also means that applications exceeding usage of 1 GPU resource are restricted.
|
||||
If a service requests more than 1 GPU resource, it is deployed on a node where the GPU resource requirements can be met.
|
||||
|
||||
.Updating GPU requirements for a Knative service using `kn`
|
||||
|
||||
* Update the service. Change the GPU resource requirement limit to `3` by using `nvidia.com/gpu=3`:
|
||||
----
|
||||
$ kn service update hello --limit nvidia.com/gpu=3
|
||||
----
|
||||
|
||||
== Additional resources
|
||||
* For more information about limits, see xref:../../applications/quotas/quotas-setting-per-project.adoc[Setting resource quotas for extended resources].
|
||||
1
serverless/integrations/images
Symbolic link
1
serverless/integrations/images
Symbolic link
@@ -0,0 +1 @@
|
||||
../images
|
||||
1
serverless/integrations/modules
Symbolic link
1
serverless/integrations/modules
Symbolic link
@@ -0,0 +1 @@
|
||||
../modules
|
||||
Reference in New Issue
Block a user