约翰霍普金斯大学 data science toolbox 笔记

data scientists toolbox

这门课是由生物学院的老师讲授的,选择的是R语言
statistics is the science of learning from data
需要学会用google和stackoverflow, ie 学会一些hacking skill
列出了这些参考书Specialization Textbooks

Elements of Data Analytic Style by Jeff Leek
R Programming for Data Science by Roger Peng
Exploratory Data Analysis with R by Roger Peng
Report Writing for Data Science in R by Roger Peng
Statistical Inference for Data Science by Brian Caffo
Regression Modeling for Data Science in R by Brian Caffo
Developing Data Products in R by Brian Caffo
In addition, to the above books, two additional books that are highly relevant to the Specialization are

The Art of Data Science by Roger Peng
How to Be A Modern Scientist by Jeff Leek
配套参考书
the elements of data science 电子版下了一本 98 页
另外有个更全的 相关书籍的整理
安装r语言,安装interface Rstudio
安装git,安装github
基本的命令行,基本的git command
基本的hack skill ,比如说如何google/stack overflow
这里写图片描述

git add 新的文件
git add -u 更改的文件
git add -a 以上所有
git push 将文件push到github上
git checkout -b branchname 创建分支
git checkout master 回到主版本
在github网站上执行 pull request 将改变merge到其他branch

基本的markdown

哎呀 发现没有字幕也能听懂

R package 从CRAN网站上下载
available.package() 寻找package
install.package(“name”) 安装package
install.package(c(“name1”,”name2”))一次安装多个
或者Studio -》 tools -》 install
library(name)导入 R
search() 查看所有package

data science希望解决哪些问题

descriptive analyses 主要是画图?
exploratory analysis 数据之间的关系?e.g. sloan digital sky survey
inference analysis从部分样本中总结一个对全体样本的结论?
prediction analysis 比较富有挑战性
causal analysis 因果分析
mechanistic analysis

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Review, 'A must-read resource for anyone who is serious about embracing the opportunity of big data.', -- Craig Vaughan, Global Vice President at SAP, 'This book goes beyond data analytics 101. It's the essential guide for those of us (all of us?) whose businesses are built on the ubiquity of data opportunities and the new mandate for data-driven decision-making.', --Tom Phillips, CEO of Media6Degrees and Former Head of Google Search and Analytics, 'Data is the foundation of new waves of productivity growth, innovation, and richer customer insight. Only recently viewed broadly as a source of competitive advantage, dealing well with data is rapidly becoming table stakes to stay in the game. The authors' deep applied experience makes this a must read--a window into your competitor's strategy.', -- Alan Murray, Serial Entrepreneur; Partner at Coriolis Ventures, 'This timely book says out loud what has finally become apparent: in the modern world, Data is Business, and you can no longer think business without thinking data. Read this book and you will understand the Science behind thinking data.', -- Ron Bekkerman, Chief Data Officer at Carmel Ventures, 'A great book for business managers who lead or interact with data scientists, who wish to better understand the principles and algorithms available without the technical details of single-disciplinary books.', -- Ronny Kohavi, Partner Architect at Microsoft Online Services Division, About the Author, Foster Provost is Professor and NEC Faculty Fellow at the NYU Stern School of Business where he teaches in the MBA, Business Analytics, and Data Science programs. His award-winning research is read and cited broadly. Prof. Provost has co-founded several successful companies focusing on data science for marketing., Tom Fawcett holds a Ph.D. in machine learning and has worked in industry R&D for more than two decades for companies such as GTE Laboratories, NYNEX/Verizon Labs, and HP Labs. His published work has become standar

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值