对于特征分析与特征提取,主要从两个角度挖掘特征和两个维度区分目标,从而总结了四类特征。
首先从站点角度分析,每日客流变化跟前一同类型的日期大体相同,但是什么造成了波动。其次,从用户角度分析,大部分用户的行为轨迹应该具有重复性,即客流稳定性。对于两个维度,时间上,区分节假日、高峰时段带来的客流影响。空间上,区分地理位置不同、功能属性不同带来的客流影响。通过上述的两个角度和维度,总结出了四类特征,即基本特征,时间类特征,历史时间类特征和地理信息类特征。
To build the characteristics of passenger flow prediction under the framework of passenger flow machine learning prediction, it is necessary to analyze its changing characteristics and influencing factors on the basis of fully understanding the passenger flow data. Construct some characteristics with physical meaning from the original and available data, It makes its input play a certain role in the prediction of subway passenger flow.
Feature analysis overview
Generally speaking, there is no established theory and standard for feature construction, so we need to have a deep understanding and grasp of the problem. The purpose of feature construction is to build features related to the prediction target. As the basic set of subsequent feature selection, this link should build more features, and the features that affect the prediction result should be included as much as possible, so as to leave room for subsequent feature selection.
(1)Basic features: common features are those used for passenger flow forecasting in non-special cases. If the construction of the conventional features in this paper is to use the features often used in the passenger flow prediction problem in the current study and judge the passenger flow prediction based on experience Effectively select relevant passenger flow data and quantify relevant qualitative features.
(2)Time features: commuting characteristics of regular passenger flow. Include date type and peak hours. The data types include weekdays, weekends and holidays, and peak hours include peacetime, peak hours and special peak hours.
(3)Historical passenger flow features: continuous correlation of passenger flow changes. The change in passenger flow is a dynamic and continuous process in time. The state of passenger flow at any moment is the change of the state of passenger flow at the previous moment.
(4) Geographic information features: Mainly describe
the factors related to geographic information that may affect passenger flow.