https://www.quora.com/What-are-the-differences-between-maximum-likelihood-and-cross-entropy-as-a-loss-function https://rdipietro.github.io/friendly-intro-to-cross-entropy-loss/