用户行为路径 (一)
数据清洗
采用hive编写hql代码实现:
代码如下:
USE src_dat_zuma;DROP TABLE 20170329_log_tycx;
CREATE TABLE IF NOT EXISTS 20170329_log_tycx (
SERVER_DT string,
SP_CD string,
EVENTS string,
S_ID string,
U_ID string,
TEL string,
WEIXIN_ID string,
EVENT_NM string,
EVENT_PARA string,
EXPLANATION string,
UNIKEY string,
NET_TP string,
UNIKEY_STAT string,
VST_URL string,
SRC_URL string,
IP string,
Browser_INFO string) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS TEXTFILE;
load data inpath "oss://20170329_tycx.log" into table src_dat