三个指标怎么做分层图
Hierarchical machine learning models are one top-notch trick. As discussed in previous posts, considering the natural taxonomy of the data when designing our models can be well worth our while. Instead of flattening out and ignoring those inner hierarchies, we’re able to use them, making our models smarter and more accurate.
分层机器学习模型是一种一流的技巧。 正如讨论在 以前 的帖子 ,设计我们的模型时,可以是非常值得我们在考虑数据的自然分类。 我们可以使用它们来代替扁平化处理并忽略那些内部层次结构,从而使我们的模型更智能,更准确。
“More accurate”, I say — are they, though? How can we tell? We are people of science, after all, and we expect bold claims to be be supported by the data. This is why we have performance metrics. Whether it’s precision, f1-score, or any other lovely metric we’ve got our eye on — if using hierarchy in our models improves their performance, the metrics should show it.
我说“更准确”-是吗? 我们怎么知道? 毕竟,我们是科学人,我们希望大胆的主张能得到数据的支持。 这就是为什么我们有性能指标。 无论是精度,f1得分,还是我们关注的任何其他可爱指标,如果在模型中使用层次结构可以提高其性能,则指标都应显示出来。
Problem is, if we use regular performance metrics — the ones designed for flat, one-level classification — we go back to ignoring that natural taxonomy of the data.
问题是,如果我们使用常规的性能指标(为平坦的一级分类而设计的指标),那么我们会回避忽略数据的自然分类法。
If we do hierarchy, let’s do it all the way. If we’ve decided to celebrate our data’s taxonomy and build our model in its image, this needs to also be a part of measuring its performance.
如果我们执行层次结构,那么就一路做下去。 如果我们决定庆祝数据的分类法并按照其图像构建模型,则这也必须成为衡量其性能的一部分。
How do we do this? The answer lies below.
我们如何做到这一点? 答案就在下面。
在我们深入之前 (Before We Dive In)
This post is about measuring the performance of machine learning models designed for hierarchical classification. It kind of assumes you know what all those words mean. If you don’t, check out my previous posts on the topic. Especially the one introducing the subject. Really. You’re gonna want to know what hierarchical classification is before learning how to measure it. That’s kind of an obvious one.