Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The issue of enjoy_husky_navigate after it is trained #99

Open
Berk035 opened this issue Aug 21, 2019 · 2 comments
Open

The issue of enjoy_husky_navigate after it is trained #99

Berk035 opened this issue Aug 21, 2019 · 2 comments

Comments

@Berk035
Copy link

Berk035 commented Aug 21, 2019

Hello everyone,

I study on navigate examples of Gibson. I am trying to reach target position and something is missing. There is a wrong with enjoy_husky_navigate_ppo1.py while I am trying to apply my trained model. The error is shown below:

Error in atexit._run_exitfuncs: Traceback (most recent call last): File "/home/deepsrv/anaconda3/envs/py35/lib/python3.5/site-packages/gym/utils/closer.py", line 67, in close closeable.close() File "/home/deepsrv/PycharmProjects/Gibson_Env/gibson/envs/env_modalities.py", line 490, in _close self.r_camera_rgb._close() AttributeError: 'NoneType' object has no attribute '_close'

Also I met with this error at fuse_policy function. My model consists 3000 timesteps per actorbatch. How can handle with this error?

Thanks in advance.

@Berk035
Copy link
Author

Berk035 commented Aug 22, 2019

I fixed the problem which is above. It is occurred due to 'out of memory' for CUDA. I closed some programs and it runs properly.

Unfortunately, I have a different problem with running model on enjoy_husky_navigate. I met with the error this time:

killing <subprocess.Popen object at 0x7fe1439ed8d0> File "/home/deepsrv/PycharmProjects/Gibson_Env/examples/train/enjoy_husky_navigate_ppo1.py", line 101, in <module> File "/home/deepsrv/PycharmProjects/Gibson_Env/examples/train/enjoy_husky_navigate_ppo1.py", line 88, in main File "/home/deepsrv/PycharmProjects/Gibson_Env/examples/train/enjoy_husky_navigate_ppo1.py", line 73, in train File "/home/deepsrv/PycharmProjects/Gibson_Env/gibson/utils/pposgd_simple.py", line 378, in enjoy File "/home/deepsrv/anaconda3/envs/py35/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 1560, in restore File "/home/deepsrv/anaconda3/envs/py35/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 895, in run File "/home/deepsrv/anaconda3/envs/py35/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1124, in _run File "/home/deepsrv/anaconda3/envs/py35/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1321, in _do_run File "/home/deepsrv/anaconda3/envs/py35/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1340, in _do_call NotFoundError: Key pi/polfc1/kernel not found in checkpoint [[Node: save/RestoreV2_19 = RestoreV2[dtypes=[DT_FLOAT], _device="/job:localhost/replica:0/task:0/cpu:0"](_arg_save/Const_0_0, save/RestoreV2_19/tensor_names, save/RestoreV2_19/shape_and_slices)]]

What is wrong this time?

@Berk035
Copy link
Author

Berk035 commented Aug 23, 2019

I realized that training mode is different. So, it causes this error. But, I want to ask another thing.
How can I continue to training process with trained model file?
Is the code this one:
if reload name: saver = tf.train.Saver() saver.restore(tf.get_default_session(), reload name)

Thanks in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant