DataPredict™ [Release 2.4] (Mature + Maintenance Mode) - Machine Learning And Deep Learning Library - 40+ Models + Deep Reinforcement Learning!


I wanted to try A2C for the troop agents
I get a gradient explosion/nil label value, is there something wrong with this setup?

Above is the function that generates the Actor, below is the function that generates the Critic.

EDIT: it appears my inversedistance was becoming inf due to the fact that the npcs were inside eachother and essentially I was dividing 1 by 0 which is technically inf.