Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

部署失败,本机的flow访问失败? #5725

Open
Yinbenxin opened this issue Oct 18, 2024 · 6 comments
Open

部署失败,本机的flow访问失败? #5725

Yinbenxin opened this issue Oct 18, 2024 · 6 comments

Comments

@Yinbenxin
Copy link

(venv) [root@fate_216 logs]# pipeline init --ip 10.248.202.216 --port 9380
Pipeline configuration succeeded.
(venv) [root@fate_216 logs]# flow test toy -gid 9999 -hid 9999
{
"code": 13,
"message": "Permission denied"
}
本机IP:10.248.202.216

@yx0090sh
Copy link

用的那个版本, 用的怎样的部署方式呢 ? 部署失败有具体的失败信息么, toy测试是报Permission denied 无法访问指定的服务器, 部署的时候填写的ip 是什么, 另外flow init 的IP 应该是和部署时用的ip 是一致的

@Yinbenxin
Copy link
Author

部署没有失败,就是测试toy时报这个错误,版本v2.0, allInone部署,ip:10.248.202.216。
[root@fate_216 aby_lr]# source ../projects/fate/bin/init_env.sh
(venv) [root@fate_216 aby_lr]# python test_lr.py
╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮
│ /data/projects/fate/common/python/venv/lib/python3.8/site-packages/fate_client-2.0.0-py3.8.egg/f │
│ ate_client/pipeline/utils/fateflow/fate_flow_job_invoker.py:78 in submit_job │
│ │
│ 75 │ │ try: │
│ 76 │ │ │ code = response["code"] │
│ 77 │ │ │ if code != 0: │
│ ❱ 78 │ │ │ │ raise ValueError(f"Return code {code}!=0") │
│ 79 │ │ │ │
│ 80 │ │ │ job_id = response["job_id"] │
│ 81 │ │ │ model_id = response["data"]["model_id"] │
╰──────────────────────────────────────────────────────────────────────────────────────────────────╯
ValueError: Return code 13!=0

During handling of the above exception, another exception occurred:

╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮
│ /home/bijianfeng/aby_lr/test_lr.py:40 in │
│ │
│ 37 │
│ 38 # compile and train │
│ 39 pipeline.compile() │
│ ❱ 40 pipeline.fit() │
│ 41 │
│ 42 │
│ 43 │
│ │
│ /data/projects/fate/common/python/venv/lib/python3.8/site-packages/fate_client-2.0.0-py3.8.egg/f │
│ ate_client/pipeline/pipeline.py:264 in fit │
│ │
│ 261 │ │ raise NotADirectoryError │
│ 262 │ │
│ 263 │ def fit(self) -> "Pipeline": │
│ ❱ 264 │ │ self._model_info = self._executor.fit(self._dag.dag_spec, │
│ 265 │ │ │ │ │ │ │ │ │ │ │ self.get_component_specs(), │
│ 266 │ │ │ │ │ │ │ │ │ │ │ local_role=self._local_role, │
│ 267 │ │ │ │ │ │ │ │ │ │ │ local_party_id=self._local_party_id, │
│ │
│ /data/projects/fate/common/python/venv/lib/python3.8/site-packages/fate_client-2.0.0-py3.8.egg/f │
│ ate_client/pipeline/executor/task_executor.py:33 in fit │
│ │
│ 30 │ │ flow_job_invoker = FATEFlowJobInvoker() │
│ 31 │ │ local_party_id = self.get_site_party_id(flow_job_invoker, dag_schema, local_role │
│ 32 │ │ │
│ ❱ 33 │ │ return self._run( │
│ 34 │ │ │ dag_schema, │
│ 35 │ │ │ local_role, │
│ 36 │ │ │ local_party_id, │
│ │
│ /data/projects/fate/common/python/venv/lib/python3.8/site-packages/fate_client-2.0.0-py3.8.egg/f │
│ ate_client/pipeline/executor/task_executor.py:67 in run │
│ │
│ 64 │ │ │ callback_handler: CallbackHandler, │
│ 65 │ │ │ event="fit") -> FateFlowModelInfo: │
│ 66 │ │ │
│ ❱ 67 │ │ job_id, model_id, model_version = flow_job_invoker.submit_job(dag_schema.dict(ex │
│ 68 │ │ │
│ 69 │ │ getattr(callback_handler, f"on
{event}_begin")( │
│ 70 │ │ │ job_info=dict( │
│ │
│ /data/projects/fate/common/python/venv/lib/python3.8/site-packages/fate_client-2.0.0-py3.8.egg/f │
│ ate_client/pipeline/utils/fateflow/fate_flow_job_invoker.py:85 in submit_job │
│ │
│ 82 │ │ │ model_version = response["data"]["model_version"] │
│ 83 │ │ │ return job_id, model_id, model_version │
│ 84 │ │ except BaseException: │
│ ❱ 85 │ │ │ raise ValueError(f"submit job is failed, response={response}") │
│ 86 │ │
│ 87 │ def query_job(self, job_id, role, party_id): │
│ 88 │ │ response = self._client.job.query(job_id, role, party_id) │
╰──────────────────────────────────────────────────────────────────────────────────────────────────╯
ValueError: submit job is failed, response={'code': 13, 'message': 'Permission denied'}

@yx0090sh
Copy link

猜测是flow 没有权限去写入机器文件, 运行的账号和部署的时候是不是有区分呢

@Yinbenxin
Copy link
Author

我看了一下错误时mysql没起来,又没单独重新安装mysql的方式

@yx0090sh
Copy link

可以先进入mysql的目录, 重启下MySQL, 看看报错的内容,根据报错内容解决

@yyf1986
Copy link
Contributor

yyf1986 commented Oct 21, 2024

是不是有依赖包没安装,导致命令行无法进入mysql,没完成初始化

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants