Skip to content

Commit

Permalink
ET-6009 Add probes
Browse files Browse the repository at this point in the history
  • Loading branch information
aazon committed May 6, 2024
1 parent d983cfe commit 80604a4
Show file tree
Hide file tree
Showing 3 changed files with 25 additions and 2 deletions.
2 changes: 1 addition & 1 deletion charts/vllm-otc/Chart.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ type: application
# This is the chart version. This version number should be incremented each time you make changes
# to the chart and its templates, including the app version.
# Versions are expected to follow Semantic Versioning (https://semver.org/)
version: 0.2.0
version: 0.3.0

# This is the version number of the application being deployed. This version number should be
# incremented each time you make changes to the application. Versions are not expected to
Expand Down
4 changes: 4 additions & 0 deletions charts/vllm-otc/templates/deployment.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -51,6 +51,10 @@ spec:
resources:
limits:
"nvidia.com/gpu": "{{ .Values.gpus }}"
livenessProbe:
{{ toYaml .Values.livenessProbe | indent 10 }}
startupProbe:
{{ toYaml .Values.startupProbe | indent 10 }}
volumeMounts:
- name: huggingface-cache
mountPath: /models
Expand Down
21 changes: 20 additions & 1 deletion charts/vllm-otc/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,6 @@ service:
type: ClusterIP
port: 8000

# TODO most likely a PV must not be inside the Helm chart
pv:
cryptKeyID: ""
exportLocation: ""
Expand All @@ -73,3 +72,23 @@ tolerations:
- key: "gpu-node"
operator: "Exists"
effect: "PreferNoSchedule"

livenessProbe:
httpGet:
path: "/v1/models"
port: http
scheme: HTTP
initialDelaySeconds: 5
timeoutSeconds: 5
periodSeconds: 10
successThreshold: 1
failureThreshold: 3

startupProbe:
httpGet:
path: "/v1/models"
port: http
scheme: HTTP
initialDelaySeconds: 30
periodSeconds: 10
failureThreshold: 60

0 comments on commit 80604a4

Please sign in to comment.