You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This happens for async RL or in general when mini_batch_size == train_batch_size - we unnecessarily do a fwd pass on the old policy model even though the policy loss may not use it.
This happens for async RL or in general when
mini_batch_size==train_batch_size- we unnecessarily do a fwd pass on the old policy model even though the policy loss may not use it.