Skip to content

Commit

Permalink
Merge pull request #423 from Jiaqicao257/master
Browse files Browse the repository at this point in the history
Update RDMA NCCL installer to disable martian logging
  • Loading branch information
Jiaqicao257 authored Dec 12, 2024
2 parents 75fa18d + 5ee5ea0 commit 1ddee13
Showing 1 changed file with 16 additions and 0 deletions.
16 changes: 16 additions & 0 deletions gpudirect-rdma/nccl-rdma-installer.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,22 @@ spec:
hostPath:
path: /home/kubernetes/bin/gib
initContainers:
- name: disable-log-martian
image: alpine:latest
command: ["/bin/sh"]
securityContext:
privileged: true
args:
- -c
- |
sysctl -w net.ipv4.conf.eth2.log_martians=0
sysctl -w net.ipv4.conf.eth3.log_martians=0
sysctl -w net.ipv4.conf.eth4.log_martians=0
sysctl -w net.ipv4.conf.eth5.log_martians=0
sysctl -w net.ipv4.conf.eth6.log_martians=0
sysctl -w net.ipv4.conf.eth7.log_martians=0
sysctl -w net.ipv4.conf.eth8.log_martians=0
sysctl -w net.ipv4.conf.eth9.log_martians=0
- image: us-docker.pkg.dev/gce-ai-infra/gpudirect-gib/nccl-plugin-gib:v1.0.2
name: nccl-rdma-installer
resources:
Expand Down

0 comments on commit 1ddee13

Please sign in to comment.