2018阿里广告点击率预估模型---DIN,Tensorflow2.0代码实践,并附上github

前言

最近看了2018年阿里在KDD上发表的论文《Deep Interest Network for Click-Through Rate Prediction》,想复现下,看了文章给出的github开源代码,发现环境是TF1.4的,并且注释太少,有些没大理解【还是太菜了】,因此准备参考原有代码使用TF2.0来对模型进行简单的复现。如果有些地方有些出入或者错误,请大佬们给我指出,感谢【因为现在没服务器,所以没像开源中跑完50个epoch】

数据分析

1、数据集为论文中的Amazon Dataset,下载并解压:

wget -c http://snap.stanford.edu/data/amazon/productGraph/categoryFiles/reviews_Electronics_5.json.gz
gzip -d reviews_Electronics_5.json.gz
wget -c http://snap.stanford.edu/data/amazon/productGraph/categoryFiles/meta_Electronics.json.gz
gzip -d meta_Electronics.json.gz

其中reviews_Electronics_5.json为用户的行为数据,meta_Electronics为广告的元数据。

reviews某单个样本如下:

{
  "reviewerID": "A2SUAM1J3GNN3B",
  "asin": "0000013714",
  "reviewerName": "J. McDonald",
  "helpful": [2, 3],
  "reviewText": "I bought this for my husband who plays the piano.  He is having a wonderful time playing these old hymns.  The music  is at times hard to read because we think the book was published for singing from more than playing from.  Great purchase though!",
  "overall": 5.0,
  "summary": "Heavenly Highway Hymns",
  "unixReviewTime": 1252800000,
  "reviewTime": "09 13, 2009"
}

各字段分别为:

  • reviewerID:用户ID;
  • asin: 物品ID;
  • reviewerName:用户姓名;
  • helpful :评论帮助程度,例如上述为 2/3
  • reviewText :文本信息;
  • overall :物品评分;
  • summary:评论总结
  • unixReviewTime :时间戳
  • reviewTime :时间

meta某样本如下:

{
   
  "asin""0000031852",
  "title""Girls Ballet Tutu Zebra Hot Pink",
  "price"3.17,
  "imUrl""http://ecx.images-amazon.com/images/I/51fAmVkTbyL._SY300_.jpg",
  "related":
  {
    "also_bought": ["B00JHONN1S""B002BZX8Z6""B00D2K1M3O""0000031909""B00613WDTQ""B00D0WDS9A""B00D0GCI8S""0000031895""B003AVKOP2""B003AVEU6G""B003IEDM9Q""B002R0FA24""B00D23MC6W""B00D2K0PA0""B00538F5OK""B00CEV86I6""B002R0FABA""B00D10CLVW""B003AVNY6I""B002GZGI4E""B001T9NUFS""B002R0F7FE""B00E1YRI4C""B008UBQZKU""B00D103F8U""B007R2RM8W"],
    "also_viewed": ["B002BZX8Z6""B00JHONN1S""B008F0SU0Y""B00D23MC6W""B00AFDOPDA""B00E1YRI4C""B002GZGI4E""B003AVKOP2""B00D9C1WBM""B00CEV8366""B00CEUX0D8""B0079ME3KU""B00CEUWY8K""B004FOEEHC""0000031895""B00BC4GY9Y""B003XRKA7A""B00K18LKX2""B00EM7KAG6""B00AMQ17JA""B00D9C32NI""B002C3Y6WG""B00JLL4L5Y""B003AVNY6I""B008UBQZKU""B00D0WDS9A""B00613WDTQ""B00538F5OK""B005C4Y4F6""B004LHZ1NY""B00CPHX76U""B00CEUWUZC""B00IJVASUE""B00GOR07RE""B00J2GTM0W""B00JHNSNSM""B003IEDM9Q""B00CYBU84G""B008VV8NSQ""B00CYBULSO"
  • 6
    点赞
  • 17
    收藏
    觉得还不错? 一键收藏
  • 1
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值