[RLlib] Use config
(not self.config
) in Learner.compute_loss_for_module
to prepare these for multi-agent-capability.
#45053
Loading