数据分析中的三种数据类型
Nominal data is classified without a natural order or rank, whereas ordinal data has a predetermined or natural order. On the other hand, numerical or quantitative data will always be a number that can be measured.
In the example previously alluded to, the presence or absence of pain would be considered nominal data, while the severity of pain represented by categories such as none, mild, moderate, or severe would-be ordinal data. If pain was analyzed on a Visual Analog Scale from 0-100, where 100 would be two times as painful as 50, this would be numerical data.
名义数据(Nominal)的分类没有自然顺序或等级,而有序数据(ordinal)具有预定或自然顺序。另一方面,数字(numerical)或定量数据将始终是一个可以测量的数字。
在前面提到的示例中,疼痛的存在或不存在将被视为名义数据(Nominal),而由诸如无、轻度、中度或重度等类别表示的疼痛严重程度将是有序数据(ordinal)。如果在 0-100 的视觉模拟量表上分析疼痛,其中 100 是 50 的两倍,这将是数字数据(numerical)。
Semantic and non-semantic are based on understanding the variable name itself, such as the Gender attribute in the following table. If the meaning of Gender is not understood, the value is 1 and 2, and the system defaults to numerical data. After understanding the semantics, Gender represents Gender category, while 1 and 2 represent two categories. Then it will be adjusted to nominal data
语义和非语义是站在理解变量名本身的基础上,比如下表中的Gender属性,如果不理解Gender的意思,该值为1和2,系统默认为数字数据(numerical),而理解语义后,Gender表示性别类别,1和2表示两个类别,则会调整为名义数据(nominal)
Note: Dates are numeric data by default!
注意:日期默认属于数字型数据!