自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+
  • 博客(39)
  • 收藏
  • 关注

翻译 但是什么是模型

The term model gets thrown around a lot. The word is ubiquitous to the point of lost meaning. The Wikipedia page alone shows the variety of usage of the word model, including statistics, astronomy, bi...

2020-10-11 00:11:15 436

翻译 r 语言初学者指南_阻止自然语言处理的初学者指南

r 语言初学者指南My job focuses almost exclusively on NLP. So I work with text. A lot.我的工作几乎完全专注于NLP 。 所以我处理文本。 很多。Text ML comes with its own challenges and tools. But one recurring problem is excessive dime...

2020-10-11 00:01:12 288

翻译 em算法和gmm算法_ml gmm em算法

em算法和gmm算法GMM is a really popular clustering method you should know as a data scientist. K-means clustering is also a part of GMM. GMM can overcome the limitation of k-means clustering. In this post, ...

2020-10-10 23:52:08 571

翻译 djing的数据科学

介绍(Introduction)Electronic Dance Music (EDM) is a unique genre of music in that live performances rarely involve instruments or singing. Rather, DJs excite the crowd through their mixing techniques a...

2020-10-10 23:43:05 248

翻译 如何自动化股票数据的提取和组织Yahoo Finance API

This past summer, my grandfather taught me the tips and tricks of investing in the stock market. We did detailed analyses of various companies, comparing profit margins, price to earnings, and other a...

2020-10-10 23:33:01 1472

翻译 一切从头_从头六个月中激发出重要的教训

一切从头重点(Top highlight)入门(Getting Started)If you’re reading this article then perhaps, like me, you have just started a new tech job and are trying to leverage Spark & Databricks for big data opera...

2020-10-10 23:22:41 192

翻译 mnist数据集数据_应用于mnist数据集的拓扑特征

mnist数据集数据Persistent homology is a fascinating mathematical tool that continues to be studied, developed, and applied. The purpose of this article is to give a friendly introduction on how to use the ...

2020-10-10 23:11:44 1010

翻译 python 抽样方法_用python解释概率抽样方法

python 抽样方法概率和统计的PYTHON(PYTHON FOR PROBABILITY AND STATISTICS) 为什么我们需要采样?(Why do we need Sampling?)Sampling is used when we try to draw a conclusion without knowing the population. Population refer...

2020-10-10 23:02:04 1206

翻译 bert nlp_nlp365第122天nlp论文摘要将bert应用于桦木文档检索

bert nlp内置AI NLP365(INSIDE AI NLP365)Project #NLP365 (+1) is where I document my NLP learning journey every single day in 2020. Feel free to check out what I have been learning over the last 262 days...

2020-10-10 22:52:47 364

翻译 如何使您的深度学习实验可再现并且代码可扩展

Note this is roughly based on a presentation I made back in February at the Boston Data Science Meetup Group. You can find the full slide deck here. I have also included some more recent experiences a...

2020-10-10 22:42:52 235

翻译 hashmap 从头到尾_从头到尾开发和销售机器学习应用程序

hashmap 从头到尾入门(Getting Started)COVID-19预测端到端应用(COVID-19 prediction end-to-end app)After developing and selling a Python API, I now want to expand the idea with a machine learning solution. So I deci...

2020-10-10 22:33:47 169

翻译 创建假设检验工作流程

A step-by-step guide on how to determine which hypothesis test is right for your situation有关如何确定哪种假设检验适合您的情况的分步指南Hypothesis testing is a very common and useful form of data analysis in the the world ...

2020-10-10 22:23:09 353

翻译 大数据 端到端_端到端数据分析性能

大数据 端到端I came across an article from NVIDIA talking about their TPCx-BB benchmark results on A100. As a data scientist, I was immediately intrigued because I’m a big fan of the Transaction Processing ...

2020-10-10 22:13:25 695

翻译 4分钟内说明sql窗口分析功能

表中的内容(Table of Content)Introduction介绍What are Window (Analytic) Functions?什么是窗口(分析)功能?Anatomy of a Window Function 窗函数剖析Example of an Aggregate Function vs Window Function集合函数与窗口函数的示例Advantages of W...

2020-10-10 22:03:29 313

翻译 保险理赔 kaggle_R上的Kaggle大篷车保险挑战的功能选择

保险理赔 kaggleRecapping from the previous post, this post will explains the feature selection to the Kaggle caravan insurance challenge before we feed the features into machine learning algorithms (proba...

2020-10-10 21:53:56 691

翻译 数据建模分层_bigartm库进行分层主题建模

数据建模分层Topic modeling is a type of statistical modeling for discovering the abstract “topics” in a collection of documents. LDA (Latent Dirichlet Allocation) is one of the most popular and widely used ...

2020-10-10 21:44:40 655

翻译 bert文本相似度计算_使用bert和其他模型计算文档相似度

bert文本相似度计算入门(Getting Started)Introduction介绍Document similarities is one of the most crucial problems of NLP. Finding similarity across documents is used in several domains such as recommending simi...

2020-10-10 21:35:20 7807

翻译 数据分析 数据清理_通过清理数据入门数据科学

数据分析 数据清理介绍(Introduction)Whenever we start doing analysis on Python, usually the first step after importing the necessary packages is to load the data into a Pandas DataFrame using for example read_c...

2020-10-10 21:25:44 221

翻译 数据分析师 组织架构_如何计划和组织数据科学分析项目

数据分析师 组织架构Conducting a data science/analytics project always takes time and has never been easy. A successful and comprehensive analytics project is way beyond coding. Instead, it involves sophisticat...

2020-10-10 21:14:55 455

翻译 局部异常因子lof_局部离群因子lof的异常检测

局部异常因子lofToday’s article is my 5th in a series of “bite-size” article I am writing on different techniques used for anomaly detection. If you are interested, the following are the previous four articl...

2020-10-10 20:54:55 1374

翻译 cps归因_归因及其应用

cps归因TL; DR(TL;DR)Imputation is the process of inferring unknown data. It has useful applications in model validation, data preprocessing, and generative modeling. When dealing with sequential data, ...

2020-10-10 20:45:36 462

翻译 物化视图日志不实例化咋查询_按需实例化视图一种可扩展的解决方案,用于图形分析或机器学习w...

物化视图日志不实例化咋查询Aggregating data for graphs, analysis, portfolios, or even machine learning can be an arduous task and difficult to scale. In this article, I will go over MongoDB’s new(ish) $merge pipeli...

2020-10-10 20:36:14 190

翻译 国民生产总值饼状图_预期寿命和国内生产总值

国民生产总值饼状图This report visualizes the data of the life expectancy of the countries across the World. Also, it tries to establish a relationship between life expectancy and GDP per capita of the countrie...

2020-10-10 20:26:13 1149

翻译 sql server的sql单元测试存储过程

Unit testing is an essential component of the database DevOps process. Its primary goal is to test the constituent parts of the database objects in order to identify any malfunctions or flaws early in...

2020-10-10 20:16:35 680

翻译 python查找最快_python在列表中查找项目的最快方法

python查找最快If you want to find the first number that matches some criteria, what do you do? The easiest way is to write a loop that checks numbers one by one and returns when it finds the correct one.如...

2020-10-10 20:07:05 1024

翻译 epl2编程指南_epl幻想gw2回顾和gw3算法精选

epl2编程指南If this is the first time you land on one of my Fantasy EPL Blogs, you might want to check out some of our original EPL blogs in my Medium archives to get familiar with how this project starte...

2020-10-10 19:56:31 1467

翻译 套索回归 岭回归_岭和套索回归简介

套索回归 岭回归Recently my class has been covering topics of regression and classification. We are now able to use the data that we have to make predictions, analyze the data better, and draw significant con...

2020-10-10 19:46:34 1306

翻译 可扩展的交互式可视化框架,用于衡量新闻中的性别偏见

背景(Background)Over the last several months, I’ve been working at the Discourse Processing Lab at Simon Fraser University (under the leadership of Dr. Maite Taboada), where we’ve been actively develop...

2020-10-10 19:35:44 1311

翻译 大数据数据科学家常用面试题_想要成为数据科学家,解决数据科学面试的简单指南...

大数据数据科学家常用面试题Choose a job you love, and you will never have to work a day in your life. — Confucius选择一份自己喜欢的工作,您将永远不必工作一天。 —Kong子介绍(Introduction)An interview is a formal meeting which occurs between ...

2020-10-10 19:25:27 1120

翻译 ecg心率和ppg心率区别_基于ppg的心率变异性hrv分析的伪影去除

ecg心率和ppg心率区别Artifact removal is probably the most important and (unfortunately) most overlooked step of the signal processing pipeline required to compute HRV features一个rtifact去除可能是最重要的,(可惜)最容易被忽视需要计...

2020-10-10 19:15:34 5030

翻译 性交后的性感香烟到底有多性感

I was reading an article from Pudding.cool talking about which singer has the most EMO song lyrics, rapper or band? The Pudding.cool defines EMO as sadness and fear emotion in song lyrics and rank 100...

2020-10-10 19:05:26 10167

翻译 做预测时预测降维升维作用_我们可以在预订时预测预订取消吗

做预测时预测降维升维作用As I write these lines on a sunny day from a little town on the island of Mallorca, news about lockdowns, prevention measures, social distancing, and economic recession have become common....

2020-10-10 18:54:46 370

翻译 mlflow_mlflow第1部分开始使用mlflow

mlflowHello again friends! We’re back here with another quick tip, and because I do attempt to keep these posts quick, this is actually going to be part one in a series of tips related to MLFlow. In t...

2020-10-10 18:45:32 224

翻译 熊猫烧香源码分析_熊猫体育分析入门

熊猫烧香源码分析Sports analytics is a major subfield of data science. The advancements in data collection techniques and data analysis have made it more appealing to the teams to adapt strategies based on dat...

2020-10-10 18:34:48 493

翻译 机器学习应对R上的商队保险挑战

Recapping from the previous two posts, this post will utilise machine learning algorithms to predict customers who are mostly likely to purchase caravan policy based on 85 historic socio-demographic a...

2020-10-10 18:25:15 407

翻译 数据之旅的点点滴滴_数据科学之旅

数据之旅的点点滴滴介绍(Introduction)By no means am I where I’d want to be in my career, yet I am enjoying every bit of my Journey. Previously, I stated that The Most Important Data Science Project consist of a ...

2020-10-10 18:15:57 139

翻译 二项式和泊松分布

二项分布(Binomial Distribution)What is Binomial Distribution ?什么是二项分布?It is a discrete distribution and describes success or failure of an event. e.g:- In an examination student can either pass or fail ,...

2020-10-10 18:05:50 1320

翻译 什么是透视变形的opencv和python

计算机视觉(Computer Vision)Computer vision is all abuzz now. People everywhere are working on some form of deep-learning-based computer vision projects. But before the advent of Deep Learning, image proce...

2020-10-10 17:55:28 147

翻译 ios 通过时间分组_通过...分组

ios 通过时间分组We’ve seen that even though PANDAS allows us to iterate over every row in a data frame this is generally a slow way to accomplish a given task and it’s not very pandorable. For instance, if ...

2020-10-10 17:46:07 217

空空如也

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

提示
确定要删除当前文章?
取消 删除