张博208-CSDN博客

原创推荐系统中的排序学习

https://lumingdong.cn/learning-to-rank-in-recommendation-system.html

2021-03-15 11:08:22 117

转载 Beyond Predictive Models: The Causal Story Behind Hotel Booking Cancellations

Understanding why Hotel Bookings are cancelled using Microsoft DoWhy in PythonImage by authorWhy did Medium/Linkedin or any other platform recommend this post to you? More importantly, what piqued your interest and made you click on this post? Was.

2021-03-12 15:18:29 332

转载 Causal Inference: Trying to Understand the Question of Why

Why are you reading this article?Why did you choose to learn about causal inference? Why are you thinking that this is a really weird way to start an article? Who knows. A more interesting question to ask is why can we, as humans, think about and understa.

2021-03-12 15:03:33 618

原创 pandas通过loc生成新的列

pandas中一个很便捷的使用方法通过loc、iloc、ix等索引方式，这里记录一下：df.loc[条件,新增列] = 赋初始值如果新增列名为已有列名，则在原来的数据列上改变import pandas as pdimport numpy as npdata = pd.DataFrame(np.random.randint(0,100,40).reshape(10,4),columns=list('abcd'))print(data)data.loc[data.d >= 50,'.

2021-03-03 12:10:18 769 1

原创 if else 连写

def is_bad_fpd_7_handle(x): if x>7: return 1 elif x>0: return -1 elif x<=0: return 0 else : return np.NaN等价于：x=np.NaNb=1 if x > 7 else -1 if x>0 else 0 if x<=0 else np.NaN...

2021-03-02 15:38:02 289

转载 How to train a GAN model in keras?

https://medium.com/dive-into-ml-ai/using-kerass-model-fit-to-train-a-gan-model-a0f02ed6d39eIn this article, I present three different methods for training a Discriminator-generator (GAN) model using keras(v2.4.3)on a tensorflow(v2.2.0)backend. The...

2021-03-01 11:32:31 157

原创 tf.py_func()函数

tensorflow由于构建的是静态图，所以导致在tf.Session().run()之前是没有实际值的，因此，在网络搭建的时候，是不能对tensor进行判值操作的，即不能插入if…else…之类的代码。第二，相较于numpy array，Tensorflow中对tensor的操作接口灵活性并没有那么高，使得Tensorflow的灵活性减弱。在笔者使用Tensorflow的一年中积累的编程经验来看，扩展Tensorflow程序的灵活性，有一个重要的手段，就是使用tf.py_func接口。接口解析代

2021-02-26 11:19:45 129

原创 Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics

https://blog.csdn.net/cv_family_z/article/details/78749992https://blog.csdn.net/cdknight_happy/article/details/102618883https://zhuanlan.zhihu.com/p/146082763

2021-02-02 10:12:43 132 1

原创 How to use Learning Curves to Diagnose Machine Learning Model Performance

https://machinelearningmastery.com/learning-curves-for-diagnosing-machine-learning-model-performance/

2021-02-01 11:25:27 92

转载如何根据训练/验证损失曲线诊断我们的CNN

前言在关于训练神经网路的诸多技巧Tricks(完全总结版)这篇文章中，我们大概描述了大部分所有可能在训练神经网络中使用的技巧，这对如何提升神经网络的准确度是很有效的。然而在实际中，在方法几乎定型的时候，我们往往需要针对自己的任务和自己设计的神经网络进行debug才能达到不错的效果，这也就是一个不断调试不断改进的一个过程。(炼金何尝不是呢？各种配方温度时间等等的调整)那么到底如何去Debug呢？如何Debug以下的内容部分来自CS231n课程，以及汇总了自己在训练神经网络中遇到的很多

2021-02-01 11:22:33 5694 4

转载调参技巧

转自：https://zhuanlan.zhihu.com/p/56745640本期问题能否聊一聊深度学习中的调参技巧？我们主要从以下几个方面来讲.1. 深度学习中有哪些参数需要调？2. 深度学习在什么时候需要动用调参技巧？又如何调参？3. 训练网络的一般过程是什么？1. 深度学习有哪些需要们关注的参数呢？大家记住一点：需要用到调参技巧的参数都是超参数！！因此，这个问题还可以换成更专业一点：神经网络中有哪些超参数？主要从两个方面来看：和网络设计相关的参数：神经网络的.

2021-01-29 18:38:00 596

原创为什么深度神经网络验证集损失低于训练集

1. 在训练的过程中应用了正则化，但是在对验证集计算损失的时候没有采用正则化。比如在损失函数中加入了L1，L2等正则项，或者dropout。正则化会牺牲训练精度，但是可以通过提高验证集和测试集的精度防止过拟合。如果在验证集中也加入正则项，那么会改善验证集损失小于训练集损失这种情况。2. 在计算训练集的损失时，它是边训练边计算的，不是等训练完一轮（epoch）后再计算总的训练集损失的。实际上，我们的数据是一个batch一个batch的输入到模型中训练的。在一轮训练中，每训练完一个batch就计算一下该ba

2021-01-28 10:34:39 1393

转载金融风控里的WOE前的分箱一定要单调吗？

转：今天我们来讲讲一个金融风控里的“常识点”，就是那种我们习以为常但若要讲出个所以然来比较困难的点，正如标题所言：WOE前的分箱一定要单调吗？????✍️ 背景交代相信每一个在金融风控领域做过模型的人，应该对分箱满足badrate单调性有一定的认知，特别是在用逻辑回归做A卡的时候，老司机们会经常对我们说变量要满足单调性，当变量单调了，再进行WOE转换，然后作为LR的入参喂给模型，简单训练一下就收工。但作为一个合格的风控建模大师，仅仅知道这些套路还是不够的，我们需要进一步去思考一下当中的原理，或者

2021-01-11 16:38:24 2223 1

转载使用 Isotonic Regression 校准分类器

21 December 20151. 引言对有监督机器学习问题，通常的训练流程包括这样几步：先建立起模型，然后在训练集上训练模型，如果有超参数，还需要在验证集上应用交叉验证以确定超参数，总之最终会得到一个模型。在这样的流程下，不断优化模型，如果在测试集上取得了较高的准确率、召回率、F-score或者AUC后，那事情就结束了吗，模型的输出结果是符合需要的吗？这并不一定。当给定一个样本，大部分分类器能够输出该样本属于某类的分数，通常这个分数介于0到1之间，我们称之为概率，严格来讲，是后验概率，数学上

2021-01-11 15:18:55 490

转载风控模型中的概率分数校准

https://zhuanlan.zhihu.com/p/92958088

2021-01-11 15:04:26 329

转载 Netflix推荐系统模型的快速线上评估方法——Interleaving

这里是「王喆的机器学习笔记」的第十八篇文章，今天我们关注模型的评估和线上测试。有经验的算法工程师肯定非常清楚，在一个模型的开发周期中，占工作量大头的其实是特征工程和模型评估及上线的过程。在机器学习平台已经非常成熟的现在，模型结构的实现和调整反而仅仅是几行代码的事情。所以如果能够将模型评估和线上AB Test的效率提高，那一定是大大解放算法工程师效率的事情。今天这篇文章我们就介绍一下流媒体巨头Netflix的“独门线上评估秘笈”——Interleaving。周所周知，Netflix是美国的流媒体巨头，

2020-12-23 16:00:12 327

转载最全面的推荐系统评估方法介绍

编辑：子墨来源：《深度学习推荐系统》笔记，并进行补充和说明推荐系统覆盖于生活中的各个方面，无论是电商购物，还是内容咨询，都离不开它的身影，作为一名推荐算法从业者，深知做好推荐系统的必要性，那么做好推荐系统的评估就显得至关重要了，其主要体现在：推荐系统评估所采用的指标直接决定了推荐系统的优化方向是否客观合理推荐系统评估是机器学习团队与其他团队沟通合作的接口性工作推荐系统评估指标的选取直接选定了推荐系统是否符合公司的商业目标和发展愿景做好推荐系统的评估的前提是必须要

2020-12-23 15:27:41 1221

原创 LinUCB算法理解

https://blog.csdn.net/weixin_42944192/article/details/102863460

2020-12-22 16:25:53 620

原创在线优化算法 FTRL 的原理与实现

https://www.cnblogs.com/massquantity/p/12693314.htmlhttps://zhuanlan.zhihu.com/p/36410780https://blog.csdn.net/hellozhxy/article/details/82688983https://blog.csdn.net/china1000/article/details/51176654

2020-12-22 16:01:15 257

转载大数据处理中的Lambda架构和Kappa架构

首先我们来看一个典型的互联网大数据平台的架构，如下图所示：在这张架构图中，大数据平台里面向用户的在线业务处理组件用褐色标示出来，这部分是属于互联网在线应用的部分，其他蓝色的部分属于大数据相关组件，使用开源大数据产品或者自己开发相关大数据组件。你可以看到，大数据平台由上到下，可分为三个部分：数据采集、数据处理、数据输出与展示。数据采集将应用程序产生的数据和日志等同步到大数据系统中，由于数据源不同，这里的数据同步系统实际上是多个相关系统的组合。数据库同步通常用 Sqoop，日志同步可以选择

2020-12-21 17:27:39 174 1

原创 ESMM

https://zhuanlan.zhihu.com/p/42214716https://zhuanlan.zhihu.com/p/57481330https://zhuanlan.zhihu.com/p/101595226

2020-12-21 14:40:04 150

原创 DSSM

https://zhuanlan.zhihu.com/p/53326791https://blog.csdn.net/jokerxsy/article/details/107169406

2020-12-21 11:46:31 191

原创 Youtube推荐双塔模型——SBCNM

https://zhuanlan.zhihu.com/p/138551027

2020-12-21 11:45:19 945

转载 DSSM算法-计算文本相似度

导语在NLP领域，语义相似度的计算一直是个难题：搜索场景下query和Doc的语义相似度、feeds场景下Doc和Doc的语义相似度、机器翻译场景下A句子和B句子的语义相似度等等。本文通过介绍DSSM、CNN-DSSM、LSTM-DSSM等深度学习模型在计算语义相似度上的应用，希望给读者带来帮助。1. 背景以搜索引擎和搜索广告为例，最重要的也最难解决的问题是语义相似度，这里主要体现在两个方面：召回和排序。在召回时，传统的文本相似性如 BM25，无法有效发现语义类 query-Doc 结.

2020-12-21 11:35:45 726

转载推荐系统中不得不说的DSSM双塔模型

摘要：本篇主要介绍了项目中用于商业兴趣建模的DSSM双塔模型。作为推荐领域中大火的双塔模型，因为效果不错并且对工业界十分友好，所以被各大厂广泛应用于推荐系统中。通过构建user和item两个独立的子网络，将训练好的两个“塔”中的user embedding 和item embedding各自缓存到内存数据库中。线上预测的时候只需要在内存中计算相似度运算即可。DSSM双塔模型是推荐领域不中不得不会的重要模型。目录01 为什么要学习DSSM双塔模型02 DSSM模型理论知识03 推荐..

2020-12-21 11:32:16 1266 1

原创向量索引算法HNSW和NSG的比较

https://zhuanlan.zhihu.com/p/105594786?utm_source=wechat_session

2020-12-18 15:51:46 624 2

原创【Faiss】PQ和IVF介绍

https://blog.csdn.net/u013066730/article/details/106252573

2020-12-18 15:50:47 458 2

原创 DRN 模型

https://zhuanlan.zhihu.com/p/58280384https://zhuanlan.zhihu.com/p/38875317

2020-12-18 12:03:44 456

原创 DIEN模型

https://zhuanlan.zhihu.com/p/269162581https://blog.csdn.net/pearl8899/article/details/106304536https://zhuanlan.zhihu.com/p/109821378https://zhuanlan.zhihu.com/p/195705761https://www.jianshu.com/p/25446bbf0e49

2020-12-17 17:21:20 122

原创 DIN模型

https://zhuanlan.zhihu.com/p/103552262?utm_source=wechat_sessionhttps://zhuanlan.zhihu.com/p/271912732https://zhuanlan.zhihu.com/p/103092757https://zhuanlan.zhihu.com/p/139417423

2020-12-17 16:55:59 236

原创 AFM 模型

https://zhuanlan.zhihu.com/p/82299967https://zhuanlan.zhihu.com/p/94009156https://blog.csdn.net/wwwsctvcom/article/details/98038484

2020-12-16 18:11:58 399

原创 NFM理论与实践

https://zhuanlan.zhihu.com/p/92293407https://blog.csdn.net/qq_18293213/article/details/90439401

2020-12-16 17:38:37 98

原创 DeepFM

https://blog.csdn.net/maqunfi/article/details/99635620https://www.cnblogs.com/wkang/p/9881921.htmlhttps://www.jianshu.com/p/6f1c2643d31b

2020-12-16 17:32:53 66

原创 FNN模型

https://blog.csdn.net/rosefun96/article/details/108456353https://www.cnblogs.com/yinzm/p/11758595.htmlhttps://www.cnblogs.com/Jesee/archive/2004/01/13/11165309.html

2020-12-16 17:05:27 462

原创 deep&cross(DCN)

https://www.cnblogs.com/wmx24/p/10341332.htmlhttps://zhuanlan.zhihu.com/p/43364598https://zhuanlan.zhihu.com/p/140458768https://zhuanlan.zhihu.com/p/55234968

2020-12-16 16:25:29 209 1

原创 Wide&Deep理论与实践

https://www.cnblogs.com/yinzm/p/11878831.htmlhttps://www.jianshu.com/p/dbaf2d9d8c94

2020-12-16 15:59:00 82

原创推荐算法-PNN

https://blog.csdn.net/qq_18293213/article/details/90262378https://www.jianshu.com/p/be784ab4abc2https://zhuanlan.zhihu.com/p/56651241

2020-12-16 11:53:27 164

原创 Neural Collaborative Filtering (NeuralCF)

https://blog.csdn.net/qq_48314528/article/details/109400730https://blog.csdn.net/stalbo/article/details/79431662https://zhuanlan.zhihu.com/p/160158270

2020-12-16 11:28:13 171

原创 Deep Crossing理论与实践

https://www.cnblogs.com/yinzm/p/11827905.htmlhttps://www.jianshu.com/p/e1873e9a97ad

2020-12-16 11:05:28 120

转载 spark driver节点的搭建，在集群之外搭建一个节点用于提交spark程序到spark集群

好多人不知道怎么做，转载来的在集群之外搭建一个节点用于提交spark程序到spark集群说明:用于提交程序的节点ip: 192.168.1.188 spark集群Master节点ip:192.168.1.73(spark集群和hadoop集群是在一起的)1.保证该节点和集群的master节点是互通的,在该节点安装和集群同样版本的spark和hadoop程序，不需要启动,只用于提交作业时在driver端用于获取集群信息2.配置文件 core-site.xml 修改ip都改成spark集群Maste

2020-12-16 10:29:47 298

GPU-知识点资料合集

bank_conflicts coalescing

2023-08-03

Pro Go The Complete Guide -go语言学习最新书籍

Best-selling author Adam Freeman explains how to get the most from Go, starting from the basics and building up to the most advanced and sophisticated features. You will learn how Go builds on a simple and consistent type system to create a comprehensive and productive development experience that produces fast and robust applications that run across platforms 参见：https://www.amazon.com/Pro-Go-Complete-Programming-Efficient/dp/1484273540/ref=sr_1_1?crid=1K22H21ZB1EIZ&keywords=Pro+Go+The+Complete+G

2023-06-19

扩散模型讲义美国大学之一

2023-03-28

Advanced_Programming_in_the_UNIX_Environment，_3rd

Advanced_Programming_in_the_UNIX_Environment，_3rd_Edition very good book for unix user

2018-11-30

Pattern_Recognition_and_Big_Data

Pattern_Recognition_and_Big_Data 很好的资源，对于学习大数据的朋友来说

2018-09-07

图论引导中文

中文版本图论引导

2018-09-05

现代图论--------------

现代图论研究生教材适合大家学习与总结了

2018-09-05

Deep_Learning_Quick_Reference

Deep_Learning_Quick_Reference, a cookbook for deep learning

2018-09-01

Convex Optimization Algorithms

Convex Optimization Algorithms, understand convex optimization algorithms, this is good chances

2018-09-01

Guide.to.Medical.Image.Analysis.Methods.and.Algorithms

Guide.to.Medical.Image.Analysis.Methods.and.Algorithms very good book for computer vision

2018-09-01

machine learning algorithm

machine learning algorithm 想学习的可以好好学学了

2018-04-02

Hands-On Data Science and Python Machine Learning py

2018-03-27

Python Machine Learning Machine Learning and Deep Learning

Python Machine Learning Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow, 2nd Edition 很受推荐

2018-03-27

Data Structures and Algorithms Using Python and C++

Data Structures and Algorithms Using Python and C++ 数据结构与算法方面的书籍

2018-03-27

R_for_Data_Science

R_for_Data_Science_－_Import，_Tidy，_Transform，_Visualize_and_Model_Data.rar

2018-03-27

深度学习之Pytorch

国内少有的学习 pytorch的资料,适合初学者, 希望对大家有帮助,清晰版本

2018-03-27

Deep Learning with PyTorch pdf版本

Pdf 版本, 方便阅读而且操作, 如果需要代码,请到如下地址

2018-03-27

Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow

Table of Contents Giving Computers the Ability to Learn from Data Training Simple Machine Learning Algorithms for Classification A Tour of Machine Learning Classifiers Using Scikit-Learn Building Good Training Sets - Data Preprocessing Compressing Data via Dimensionality Reduction Learning Best Practices for Model Evaluation and Hyperparameter Tuning Combining Different Models for Ensemble Learning Applying Machine Learning to Sentiment Analysis Embedding a Machine Learning Model into a Web Application Predicting Continuous Target Variables with Regression Analysis Working with Unlabeled Data - Clustering Analysis Implementing a Multilayer Artificial Neural Network from Scratch Parallelizing Neural Network Training with TensorFlow Going Deeper - The Mechanics of TensorFlow Classifying Images with Deep Convolutional Neural Networks Modeling Sequential Data using Recurrent Neural Networks

2018-03-17

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

GPU-知识点资料合集

Pro Go The Complete Guide -go语言学习最新书籍

扩散模型讲义 美国大学之一

Advanced_Programming_in_the_UNIX_Environment，_3rd

Pattern_Recognition_and_Big_Data

图论引导 中文

现代图论--------------

Deep_Learning_Quick_Reference

Convex Optimization Algorithms

Guide.to.Medical.Image.Analysis.Methods.and.Algorithms

machine learning algorithm

Hands-On Data Science and Python Machine Learning py

Python Machine Learning Machine Learning and Deep Learning

Data Structures and Algorithms Using Python and C++

R_for_Data_Science

深度学习之Pytorch

Deep Learning with PyTorch pdf版本

Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow

Approximate.Dynamic.Programming.2011

计算群体智能基础

Swarm Intelligence Principles Advances and Applications

Neural_Network_Methods_in_Natural_Language_Processing

Reinforcement Learning With Open A TensorFlow and Keras Using Python.pdf

Fundamentals of Deep Learning完整非扫描版本2017

Data Wrangling with R

NLTK基础教程-用NLTK和Python库构建机器学习应用2017-06

Text Mining in Practice with R 2017.12

Text_Mining－From_Ontology_Learning_to_Automated_Text_Processing_Applications

Python Natural Language Processing最新版本

Mastering Scipy

Elegant SciPy

Tensorflow 机器学习参考手册2007

reinforcement learning An Introduction 第二版

Spark大数据处理技术 带标签 完整版

TensorFlow技术解析与实战 高清晰完整版- 2017新书

TENSORFLOW深度学习

模式分类11

集体编程智慧

敏捷软件开发：原则、模式与实践

面向对象方法原理与实践

空空如也

扩散模型讲义美国大学之一

图论引导中文

Spark大数据处理技术带标签完整版

TensorFlow技术解析与实战高清晰完整版- 2017新书