数据科学作为一个新兴领域,植根于其他学科:统计推断、算法、统计模型、机器学习、实验设计、优化理论、概率论、人工智能、数据可视化和探索性数据分析等。每门学科都值得花好几门课或好几本书专门讲解,这正是写作本书面临的一个极大挑战,因此我们将这些补充阅读材料附在后面,希望读者在需要时可以参考。
数学
- Linear Algebra and Its Applications,Gilbert Strang著(Cengage Learning)
- Convex Optimization,Stephen Boyd和Lieven Vendenberghe著(Cambridge University Press)
- A First Course in Probability(Pearson)、Introduction to Probability Models(Academic Press),Sheldon Ross著
编程
- R in a Nutshell,Joseph Adler著(O'Reilly)
- Learning Python,Mark Lutz和David Ascher著(O'Reilly)
- R for Everyone: Advanced Analytics and Graphics,Jared Lander著(Addison-Wesley)
- The Art of R Programming: A Tour of Statistical Software Design,Norman Matloff著(No Starch Press)
- Python for Data Analysis,Wes McKinney著(O'Reilly)
数据分析与统计推断
- Statistical Inference,George Casella和Roger L. Berger著(Cengage Learning)
- Bayesian Data Analysis,Andrew Gelman等著(Chapman & Hall)
- Data Analysis Using Regression and Multilevel/Hierarchical Models,Andrew Gelman和Jennifer Hill著(Cambridge University Press)
- Advanced Data Analysis from an Elementary Point of View(http://goo.gl/udICRX),Cosma Shalizi著(Cambridge University Press)
- The Elements of Statistical Learning: Data Mining, Inference and Prediction,Trevor Hastie、Robert Tibshirani和Jerome Friedman著(Springer)
人工智能和机器学习
- Pattern Recognition and Machine Learning,Christopher Bishop著(Springer)
- Bayesian Reasoning and Machine Learning,David Barber著(Cambridge University Press)
- Programming Collective Intelligence,Toby Segaran著(O'Reilly)
- Artificial Intelligence: A Modern Approach,Stuart Russell和Peter Norvig著(Prentice Hall)
- Foundations of Machine Learning,Mehryar Mohri、Afshin Rostamizadeh和Ameet Talwalkar著(MIT Press)
- Introduction to Machine Learning (Adaptive Computation and Machine Learning),Ethem Alpaydim著(MIT Press)
实验设计
- Field Experiments,Alan S. Gerber和Donald P. Green著(Norton)
- Statistics for Experimenters: Design, Innovation, and Discovery,George E. P. Box等著(Wiley-Interscience)
可视化
- The Elements of Graphing Data,William Cleveland著(Hobart Press)
- Visualize This: The FlowingData Guide to Design,Visualization, and Statistics,Nathan Yau著(Wiley)
转载自:图灵社区
摘自:Doing Data Science by Rachel Schutt and Cathy O'Neil (O'Reilly). Copyright 2014 Rachel Schutt and Cathy O'Neil, 978-1-449-35865-5