- worked with update per trajectory without average on trajectory length
- worked with update over all trajectories with average on both trajectory length and experiences length (batch size)
- worked with update per trajectory with average on trajectory length