先不做分类了, 等多了再分类. MDN A Hitchhiker’s Guide to Mixture Density NetworksMixture Density Networks with TensorFlow NLP BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding