I wanted to try A2C for the troop agents
I get a gradient explosion/nil label value, is there something wrong with this setup?
Above is the function that generates the Actor, below is the function that generates the Critic.
EDIT: it appears my inversedistance was becoming inf due to the fact that the npcs were inside eachother and essentially I was dividing 1 by 0 which is technically inf.