multi_agent_env #73

yasuohayashibara · 2023-09-18T21:49:17Z

multi-agentの学習を行う環境を作成する．
インターフェイスはPettingZooを参考にする．

yasuohayashibara · 2023-09-18T21:56:20Z

速度の検証から（訓練の時間に影響する要素であるため）
検証PC　Galleria　ノートPC　i7　RTX2070
簡易的なモデルで6体並べてシミュレーションすると最高16倍程度
https://youtu.be/o6aQ4tWirx0

yasuohayashibara · 2023-09-18T22:28:20Z

GankenKun_walkを6台で試した様子
0.25倍程度と非常に遅い
https://youtu.be/1n6tCZ5th2c

yasuohayashibara · 2023-09-18T22:38:26Z

c言語で書かれた非常にシンプルなコントローラにしても同様であった
https://youtu.be/FDsIXMfO7Wc

yasuohayashibara · 2023-09-18T22:54:05Z

チューイングをしたというのと，コントローラが別プロセスで動いているというのもあるが，strategy_devの方が処理が速い
1台のみの環境だと，2倍以上で動作する．

https://youtu.be/VnktNcgNRtE

yasuohayashibara · 2023-09-18T23:02:11Z

以下の設定の値を大きくすることで，速度が上昇
0.25 -> 0.6倍程度になる．

optimalThreadCount 10

yasuohayashibara · 2023-09-18T23:11:14Z

歩行した場合も0.5倍程度の速度となる．
ただし，スクリーンキャプチャしながらだと，速度が低下する．
https://youtu.be/_Zcngm6PI-s

yasuohayashibara · 2023-09-19T00:18:43Z

RTX4090のPCで1.5倍程度

yasuohayashibara · 2023-09-19T06:09:48Z

各ロボットにコマンドを送って歩行させる仕組みを入れた．

https://youtu.be/-T_NXYqWWvE

yasuohayashibara · 2023-09-19T13:12:03Z

RTX4090で実行したときの様子
4倍程度で実行できている．

yasuohayashibara · 2023-09-21T11:42:44Z

間を開けて前進するコマンドを与えたら直進しない．
キックのときに停止するので，対策することが必要である．

https://youtu.be/H5UZMzI1068

yasuohayashibara · 2023-09-21T20:36:49Z

歩行を調整した
１）歩行時に左右に位置がずれる現象が見られたのでその対策
２）足を上げるまでの時間を0.34sとした．（従来は0.68s）
３）停止の時間を0.34sとした．（従来は1.6s）

全てのタイミングを0.34とした．

yasuohayashibara · 2023-09-21T20:42:44Z

周期を0.34->0.32sに変更
webotsの計算の周期が0.008なので，40step分となり扱いやすいため．
挙動は0.34と変わらず

yasuohayashibara · 2023-09-26T00:08:20Z

キックも含めた動きの完成

yasuohayashibara · 2023-09-26T02:25:03Z

以下を参考に実装する

PPO
https://pettingzoo.farama.org/tutorials/sb3/kaz/

maddpg
https://pettingzoo.farama.org/main/tutorials/agilerl/MADDPG/

yasuohayashibara · 2023-09-26T14:52:21Z

学習環境の整備

yasuohayashibara · 2023-09-27T20:55:24Z

~/.local/lib/python3.8/site-packages/supersuit/vectorの
vector_constructors.pyの64行目をコメントアウト

    #vec_env = MakeCPUAsyncConstructor(num_cpus)(*vec_env_args(vec_env, num_vec_envs))

yasuohayashibara · 2023-09-29T21:51:15Z

学習できるようになった．
ロボット転倒時にプログラムが停止する問題があるようである．

yasuohayashibara · 2023-10-02T00:10:54Z

学習した結果

RTX4090

yasuohayashibara · 2023-10-03T04:23:35Z

相手側へのx軸のボールの速度を報酬としたときの様子
転倒時に最初の位置に戻す挙動を追加

Check single

yasuohayashibara · 2023-10-11T22:49:22Z

PPOを実行するときの環境設定

virtualenv -p python3.8 env
source env/bin/activate
pip install supersuit pettingzoo stable_baselines3 tensorboard control

以下のファイルを編集

code env/lib/python3.8/site-packages/supersuit/vector/vector_constructors.py

vector_constructors.pyの64行目をコメントアウト

    #vec_env = MakeCPUAsyncConstructor(num_cpus)(*vec_env_args(vec_env, num_vec_envs))

add multi_agent_env

0779180

optimalThreadCount

2c02ff6

control robot walking

511cafe

yasuohayashibara added 3 commits September 19, 2023 20:48

start to make gym

2747ffe

add step

44d1b34

control 6 robots

ec8701e

adjust walk

af06f70

period 0.32

3ad00d7

yasuohayashibara added 4 commits September 22, 2023 08:11

add kick

746062e

add play_motion.py

0bd1f8e

add kick

cba9184

6 robots

0d654a1

yasuohayashibara added 2 commits September 26, 2023 18:17

according pettingzoo

340e0d9

make environment

6f04fc1

PPO

77697c4

yasuohayashibara added 2 commits October 1, 2023 17:14

learnable

46affe6

local position

d021ca8

yasuohayashibara added 2 commits October 3, 2023 09:37

reposition

fa58c55

ball speed

740f3de

yasuohayashibara added 12 commits October 3, 2023 17:40

opposite teams obs

8765ceb

add random

e6fa329

add tensorboard

a7b2953

display reward

5c3fa60

save model

2f6e618

check single

011c106

fix goal position

7b8510b

fix parameters

7672898

add kick

ee06944

ppo

0cb523b

Merge pull request #74 from citbrains/check_single

d18d284

Check single

revert multi

cd9d33a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi_agent_env #73

multi_agent_env #73

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023 •

edited

Loading

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 19, 2023

yasuohayashibara commented Sep 19, 2023

yasuohayashibara commented Sep 19, 2023

yasuohayashibara commented Sep 21, 2023

yasuohayashibara commented Sep 21, 2023

yasuohayashibara commented Sep 21, 2023

yasuohayashibara commented Sep 26, 2023

yasuohayashibara commented Sep 26, 2023

yasuohayashibara commented Sep 26, 2023

yasuohayashibara commented Sep 27, 2023 •

edited

Loading

yasuohayashibara commented Sep 29, 2023

yasuohayashibara commented Oct 2, 2023 •

edited

Loading

yasuohayashibara commented Oct 3, 2023

yasuohayashibara commented Oct 11, 2023 •

edited

Loading

multi_agent_env #73

Are you sure you want to change the base?

multi_agent_env #73

Conversation

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023 • edited Loading

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 18, 2023

yasuohayashibara commented Sep 19, 2023

yasuohayashibara commented Sep 19, 2023

yasuohayashibara commented Sep 19, 2023

yasuohayashibara commented Sep 21, 2023

yasuohayashibara commented Sep 21, 2023

yasuohayashibara commented Sep 21, 2023

yasuohayashibara commented Sep 26, 2023

yasuohayashibara commented Sep 26, 2023

yasuohayashibara commented Sep 26, 2023

yasuohayashibara commented Sep 27, 2023 • edited Loading

yasuohayashibara commented Sep 29, 2023

yasuohayashibara commented Oct 2, 2023 • edited Loading

yasuohayashibara commented Oct 3, 2023

yasuohayashibara commented Oct 11, 2023 • edited Loading

yasuohayashibara commented Sep 18, 2023 •

edited

Loading

yasuohayashibara commented Sep 27, 2023 •

edited

Loading

yasuohayashibara commented Oct 2, 2023 •

edited

Loading

yasuohayashibara commented Oct 11, 2023 •

edited

Loading