测试数据包含以下字段
Data Fields
- fullVisitorId- A unique identifier for each user of the Google Merchandise Store.
- channelGrouping - The channel via which the user came to the Store.
- date - The date on which the user visited the Store.
- device - The specifications for the device used to access the Store.
- geoNetwork - This section contains information about the geography of the user.
- sessionId - A unique identifier for this visit to the store.
- socialEngagementType - Engagement type, either "Socially Engaged" or "Not Socially Engaged".
- totals - This section contains aggregate values across the session.
- trafficSource - This section contains information about the Traffic Source from which the session originated.
- visitId - An identifier for this session. This is part of the value usually stored as the _utmb cookie. This is only unique to the user. For a completely unique ID, you should use a combination of fullVisitorId and visitId.
- visitNumber - The session number for this user. If this is the first session, then this is set to 1.
- visitStartTime - The timestamp (expressed as POSIX time).
- hits - This row and nested fields are populated for any and all types of hits. Provides a record of all page visits.
- customDimensions - This section contains any user-level or session-level custom dimensions that are set for a session. This is a repeated field and has an entry for each dimension that is set.
- totals - This set of columns mostly includes high-level aggregate data.
print(train.describe())
date visitId visitNumber visitStartTime
count 9.036530e+05 9.036530e+05 903653.000000 9.036530e+05
mean 2.016589e+07 1.485007e+09 2.264897 1.485007e+09
std 4.697698e+03 9.022124e+06 9.283735 9.022124e+06
min 2.016080e+07 1.470035e+09 1.000000 1.470035e+09
25% 2.016103e+07 1.477561e+09 1.000000 1.477561e+09
50% 2.017011e+07 1.483949e+09 1.000000 1.483949e+09
75% 2.017042e+07 1.492759e+09 1.000000 1.492759e+09
max 2.017080e+07 1.501657e+09 395.000000 1.501657e+09