Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Smartswitch] Orchagent is crashing when the MGMT_VRF is enabled #19638

Open
vivekrnv opened this issue Jul 19, 2024 · 5 comments
Open

[Smartswitch] Orchagent is crashing when the MGMT_VRF is enabled #19638

vivekrnv opened this issue Jul 19, 2024 · 5 comments
Assignees
Labels
MSFT Triaged this issue has been triaged

Comments

@vivekrnv
Copy link
Contributor

Description

When the MGMT_VRF_CONFIG is present, zmq_bind fails in orchagent and aborts. Seen on smartswitch platforms

Steps to reproduce the issue:

  1. Add the MGMT_VRF config
"MGMT_VRF_CONFIG": {"vrf_global": {
       "mgmtVrfEnabled": "true"}}
  1. Do config reload or reboot

or run the following test test_syslog_config_work_after_reboot

Describe the results you received:

Jul  8 16:09:52.748231 r-smartswitch-03 ERR swss#orchagent: :- mqPollThread: zmq_bind failed on endpoint: tcp://10.210.25.4:8100, zmqerrno: 99

Describe the results you expected:

Output of show version:

(paste your output here)

Output of show techsupport:

(paste your output here or download and attach the file here )

Additional information you deem important (e.g. issue happens only occasionally):

@vivekrnv
Copy link
Contributor Author

@liuh-80 @Pterosaur @prsunny Please check

@vivekrnv
Copy link
Contributor Author

https://github.com/sonic-net/sonic-buildimage/blob/master/dockers/docker-orchagent/orchagent.sh#L87

@judyjoseph
Copy link
Contributor

@liuh-80 could you take a look at this issue

@judyjoseph judyjoseph added Triaged this issue has been triaged MSFT labels Jul 31, 2024
@liuh-80
Copy link
Contributor

liuh-80 commented Aug 12, 2024

will check and fix the issue soon

@liuh-80
Copy link
Contributor

liuh-80 commented Sep 24, 2024

The root cause of this issue is because currently ZMQ not bind to VRF by ZMQ_BINDTODEVICE when VRF enabled:
https://libzmq.readthedocs.io/en/latest/zmq_setsockopt.html
This need code change in sonic-swss-common/sonic-gnmi/sonic-buildimage.

Need 4 PRs to fix this issue:

Repo PR
sonic-swss-common sonic-net/sonic-swss-common#920
sonic-swss sonic-net/sonic-swss#3299
sonic-gnmi sonic-net/sonic-gnmi#290
sonic-buildimage #20345

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
MSFT Triaged this issue has been triaged
Projects
None yet
Development

No branches or pull requests

3 participants