data science
文章平均质量分 76
艳艳儿
这个作者很懒,什么都没留下…
展开
-
[Getting and Cleaning data] Week 4
Week 4Editing text variablesRegular expressionsWorking with datesMore details could be found in the html file here Week 4Editing text variablesImportant points about text in data setNames of variab原创 2016-03-17 09:25:19 · 998 阅读 · 0 评论 -
[Machine Learning] XGBoost
1. XGBoost介绍XGBoost模型即是一些“串联”树结构的组合,最终预测结果由多棵树共同决定。 模型公式:y=f(z)=∑Kk=1fk(z)y=f(z)=∑k=1Kfk(z)y=f(z)=\sum_{k=1}^K f_k(z) 模型预测:y^i=∑Kk=1f^k(xi)y^i=∑k=1Kf^k(xi)\hat y_i = \sum_{k=1}^K \hat f_k(x_i...原创 2018-03-20 17:03:21 · 360 阅读 · 0 评论 -
Reshape Data Form in R
Convert long form data to wide or wide to long.原创 2016-06-16 04:48:28 · 616 阅读 · 0 评论 -
[Getting and Cleaning data] Quiz 2
Question 1Question 2Question 3Question 4Question 5For more detail, see the html file here.Question 1Register an application with the Github API here github application. Access the API to get info原创 2016-03-14 11:52:43 · 4965 阅读 · 1 评论 -
[Getting and Cleaning data] Week 2
[TOC] For more detail, see the html file here[http://download.csdn.net/detail/comeyan/9460707].Week 2Reading data from MySQLWhat is MySQL? SQL is short for Structured Query Language and MySQL is the w原创 2016-03-14 11:44:47 · 866 阅读 · 0 评论 -
[R] How to install RMySQL package on Window
一 Installing RMySQL package under Window二 Playing with RMySQL一. Installing RMySQL package under WindowThis section is from the instruction given in the following website http://www.ahschulz.de/2013/转载 2016-03-09 04:13:10 · 952 阅读 · 0 评论 -
[Getting and Cleaning data] Quiz 1
Quiz 1Question 1QUestion 2Question 3Question 4Question 5For more detail, you can download the html file here.Quiz 1Question 1The American Community Survey distributes downloadable data about Unit原创 2016-03-14 01:16:51 · 2373 阅读 · 0 评论 -
[Getting and Cleaning data] Week 1
Week 1This course is following thatRaw data →\rightarrow Processing script →\rightarrow Tidy data →\rightarrow Data analysis →\rightarrow Data communicationRaw Data VS Processed Data:Raw DataThe ori原创 2016-03-13 23:18:40 · 1345 阅读 · 0 评论 -
[Exploratory Data Analysis] Project 1
Project summaryReview criteriaLoading the dataMaking PlotsCodeProject summaryThis assignment uses data from the UC Irvine Machine Learning Repository, a popular repository for machine learning dat原创 2016-04-01 06:21:31 · 6519 阅读 · 0 评论 -
[Exploratory Data Analysis] Week 1
Principles of analysis graphicsExploratory GraphicsPlottingBase Plotting SystemGraphics DeviesPrinciples of analysis graphicsPrinciple 1: Show corparisons Evidence for a pyhothesis is always rel原创 2016-04-01 06:20:19 · 1938 阅读 · 0 评论 -
[Getting and Cleaning data] swirl
Manipulating Data with dplyr PackageGrouping and Chaining with dplyr packageTidying Data with tidyr packageType one column headers are values not variable namesType two multiple variables are stored原创 2016-03-17 23:41:21 · 2337 阅读 · 0 评论 -
[Getting and Cleaning data] Project
Project introductionProject codeMore details can be found here.Project introductionThe purpose of this project is to demonstrate your ability to collect, work with, and clean a data set.Review criter原创 2016-03-18 06:12:08 · 3350 阅读 · 0 评论 -
[Getting and Cleaning data] Quiz 4
Question 1Question 2Question 3Question 4Question 5More details can be found in the html file here.Question 1The American Community Survey distributes downloadable data about United States communit原创 2016-03-17 23:23:36 · 3306 阅读 · 0 评论 -
[Getting and Cleaning data] Quiz 3
Question 1Question 2Question 3Question 4Question 5More details can be found in the html file here.Question 1The American Community Survey distributes downloadable data about United States communit原创 2016-03-17 23:21:17 · 1692 阅读 · 0 评论 -
[Getting and Cleaning data] Week 3
Week 3Subsetting and SortingSummarizing dataCreating new variablesReshaping dataManaging data frame with dplyr package IntroductionManaging data frames with dplyr package Basic toolsMerging da原创 2016-03-15 03:16:25 · 1167 阅读 · 0 评论 -
[code] Chap 10 for ESL
Figure 10.2Firstly define the function to generate simulated data.# function for generating data for figure 10.2gen_eq_10_2_data <- function(N = 2000, p = 10){ X <- matrix( rnorm(N*p), ...原创 2018-12-13 22:25:40 · 362 阅读 · 0 评论