There’s a problem where player’s pings/latency are always different, which means you can’t easily sync up their singing if there’s more than 2 players at a time, because the other 2 players (if there are, for example, 3 players) would have slightly different pings/latency, meaning one of them will be desynced. If you use averages, and their pings are similar enough, however, you can potentially make the desync “invisible” to the ear.