python 数据分析步骤_如何通过5个步骤学习用于数据科学的Python

python 数据分析步骤

为什么要学习Python进行数据科学? (Why Learn Python For Data Science?)

How to Learn Python for Data Science In 5 Steps

Before we explore how to learn Python for data science, we should briefly answer why you should learn Python in the first place.

在探讨如何学习用于数据科学的Python之前,我们应该简要回答为什么首先应该学习Python。

In short, understanding Python is one of the valuable skills needed for a data science career.

简而言之,了解Python是数据科学职业所需的宝贵技能之一。

Though it hasn’t always been, Python is the programming language of choice for data science. Here’s a brief history:

尽管并非一直如此,但Python是数据科学的首选编程语言。 以下是简短的历史记录:

  • In 2016, it overtook R on Kaggle, the premier platform for data science competitions.
  • In 2017, it overtook R on KDNuggets’s annual poll of data scientists’ most used tools.
  • In 2018, 66% of data scientists reported using Python daily, making it the number one tool for analytics professionals.
  • 2016年,它在数据科学竞赛的主要平台Kaggle上超过R。
  • 2017年,它超过了KDNuggets对数据科学家最常用工具的年度调查中的R.
  • 在2018年,有66%的数据科学家报告说每天都在使用Python,这使其成为分析专业人员的第一大工具。

Data science experts expect this trend to continue with increasing development in the Python ecosystem. And while your journey to learn Python programming may be just beginning, it’s nice to know that employment opportunities are abundant (and growing) as well.

数据科学专家预计,随着Python生态系统的不断发展,这种趋势将继续下去。 虽然您学习Python编程的旅程可能才刚刚开始,但很高兴知道就业机会也很丰富(并且还在不断增长)。

According to Indeed, the average salary for a Data Scientist is $127,918.

根据Indeed,数据科学家的平均工资为127,918美元。

The good news? That number is only expected to increase. The experts at IBM predicted a 28% increase in demand for data scientists by the year 2020.

好消息? 这个数字只会增加。 IBM的专家预测,到2020年,对数据科学家的需求将增长28%。

So, the future is bright for data science, and Python is just one piece of the proverbial pie. Fortunately, learning Python and other programming fundamentals is as attainable as ever. We’ll show you how in five simple steps.

因此,数据科学的前途一片光明,Python只是其中的一小部分。 幸运的是,学习Python和其他编程基础知识是一如既往的。 我们将通过五个简单的步骤向您展示。

But remember – just because the steps are simple doesn’t mean you won’t have to put in the work. If you apply yourself and dedicate meaningful time to learning Python, you have the potential to not only pick up a new skill, but potentially bring your career to a new level.

但是请记住–步骤很简单并不意味着您不必进行任何工作。 如果您投入自己的精力并投入大量时间学习Python,那么您不仅有可能学习新技能,而且有可能将您的职业提升到一个新的水平。

如何学习Python进行数据科学 (How to Learn Python for Data Science)

How to Learn Python for Data Science In 5 Steps

First, you’ll want to find the right course to help you learn Python programming. Dataquest’s courses are specifically designed for you to learn Python for data science at your own pace.

首先,您需要找到正确的课程来帮助您学习Python编程。 Dataquest的课程是专门为您设计的,以您自己的方式学习Python进行数据科学。

In addition to learning Python in a course setting, your journey to becoming a data scientist should also include soft skills. Plus, there are some complimentary technical skills we recommend you learn along the way.

除了在课程设置中学习Python外,您成为数据科学家的过程还应该包括一些软技能。 另外,我们建议您一路学习一些免费的技术技能。

第1步:学习Python基础知识 (Step 1: Learn Python Fundamentals)

Everyone starts somewhere. This first step is where you’ll learn Python programming basics. You’ll also want an introduction to data science.

每个人都从某个地方开始。 第一步是学习Python编程基础的地方。 您还将需要介绍数据科学。

One of the important tools you should start using early in your journey is Jupyter Notebook, which comes prepackaged with Python libraries to help you learn these two things.

Jupyter Notebook是您旅途中应该开始使用的重要工具之一 ,它预先包装了Python库,可以帮助您学习这两件事。

Kickstart your learning by: Joining a community

通过以下方式来开始学习 :加入社区

By joining a community, you’ll put yourself around like-minded people and increase your opportunities for employment. According to the Society for Human Resource Management, employee referrals account for 30% of all hires.

通过加入社区,您将与志趣相投的人交往,并增加就业机会。 根据人力资源管理协会的说法,员工推荐占所有员工的30%。

Create a Kaggle account, join a local Meetup group, and participate in Dataquest’s members-only Slack discussions with current students and alums.

创建一个Kaggle帐户,加入一个本地的Meetup组,并参加Dataquest与现有学生和校友的仅限会员的Slack讨论。

Related skills: Try the Command Line Interface

相关技能 :尝试命令行界面

The Command Line Interface (CLI) lets you run scripts more quickly, allowing you to test programs faster and work with more data.

命令行界面(CLI)使您可以更快地运行脚本 ,从而可以更快地测试程序并使用更多数据。

步骤2:练习Mini Python项目 (Step 2: Practice Mini Python Projects)

We truly believe in hands-on learning. You may be surprised by how soon you’ll be ready to build small Python projects.

我们坚信动手学习。 您可能会很快准备好构建小型Python项目,这可能会让您感到惊讶。

Try programming things like calculators for an online game, or a program that fetches the weather from Google in your city. Building mini projects like these will help you learn Python. programming projects like these are standard for all languages, and a great way to solidify your understanding of the basics.

尝试对诸如在线游戏的计算器之类的程序进行编程,或者尝试从您所在城市的Google获取天气的程序中进行编程。 构建这样的小型项目将帮助您学习Python。 像这样的编程项目对于所有语言都是标准的,并且是巩固您对基础知识的一种好方法。

You should start to build your experience with APIs and begin web scraping. Beyond helping you learn Python programming, web scraping will be useful for you in gathering data later.

您应该开始建立使用API​​的经验,并开始进行网络抓取。 除了帮助您学习Python编程外,Web抓取还将对您以后收集数据很有用。

Kickstart your learning by: Reading

通过以下方式开始学习 :阅读

Enhance your coursework and find answers to the Python programming challenges you encounter. Read guidebooks, blog posts, and even other people’s open source code to learn Python and data science best practices – and get new ideas.

增强您的课程并找到您遇到的Python编程挑战的答案。 阅读指南,博客文章,甚至其他人的开源代码,以学习Python和数据科学最佳实践-并获得新的想法。

Automate The Boring Stuff With Python by Al Sweigart is an excellent and entertaining resource.

Al Sweigart撰写的《用Python自动完成无聊的事情》是一种极好的娱乐资源。

Related skills: Work with databases using SQL

相关技能 :使用SQL处理数据库

SQL is used to talk to databases to alter, edit, and reorganize information. SQL is a staple in the data science community, as 40% of data scientists report consistently using it.*

SQL用于与数据库对话以更改,编辑和重组信息。 SQL是数据科学界的重要组成部分,因为40%的数据科学家表示一直在使用SQL。*

第3步:学习Python数据科学库 (Step 3: Learn Python Data Science Libraries)

Unlike some other programming languages, in Python, there is generally a best way of doing something. The three best and most important Python libraries for data science are NumPy, Pandas, and Matplotlib.

与某些其他编程语言不同,在Python中,通常存在一种最佳的处理方式。 数据科学的三个最好也是最重要的Python库是NumPy,Pandas和Matplotlib。

NumPy and Pandas are great for exploring and playing with data. Matplotlib is a data visualization library that makes graphs like you’d find in Excel or Google Sheets.

NumPy和Pandas非常适合探索和处理数据。 Matplotlib是一个数据可视化库,可制作类似于Excel或Google表格中的图形。

Kickstart your learning by: Asking questions

通过以下方式开始学习 :提问

You don’t know what you don’t know!

你不知道你不知道!

Python has a rich community of experts who are eager to help you learn Python. Resources like Quora, Stack Overflow, and Dataquest’s Slack are full of people excited to share their knowledge and help you learn Python programming. We also have an FAQ for each mission to help with questions you encounter throughout your programming courses with Dataquest.

Python拥有丰富的专家社区,他们渴望帮助您学习Python。 Quora,Stack Overflow和Dataquest的Slack等资源充满了分享知识并帮助您学习Python编程的热情。 对于每个任务,我们都有常见问题解答,以帮助您在使用Dataquest进行编程课程时遇到的问题。

Related skills: Use Git for version control

相关技能 :使用Git进行版本控制

Git is a popular tool that helps you keep track of changes made to your code, which makes it much easier to correct mistakes, experiment, and collaborate with others.

Git是一种流行的工具,可以帮助您跟踪对代码所做的更改,从而更容易纠正错误,进行实验以及与他人合作。

步骤4:在学习Python的同时构建数据科学产品组合 (Step 4: Build a Data Science Portfolio as you Learn Python)

For aspiring data scientists, a portfolio is a must.

对于有抱负的数据科学家来说,必须有一个投资组合。

These projects should include several different datasets and should leave readers with interesting insights that you’ve gleaned. Your portfolio doesn’t need a particular theme; find datasets that interest you, then come up with a way to put them together.

这些项目应包括几个不同的数据集,并应使读者从中获得有趣的见解。 您的投资组合不需要特定的主题; 找到您感兴趣的数据集,然后提出一种将它们组合在一起的方法。

Displaying projects like these gives fellow data scientists something to collaborate on and shows future employers that you’ve truly taken the time to learn Python and other important programming skills.

显示这样的项目可以为数据科学家提供一些合作的机会,并向未来的雇主表明您已经真正花时间学习Python和其他重要的编程技能。

One of the nice things about data science is that your portfolio doubles as a resume while highlighting the skills you’ve learned, like Python programming.

数据科学的优点之一是您的投资组合可以兼作简历,同时突出显示您所学的技能,例如Python编程。

Kickstart your learning by: Communicating, collaborating, and focusing on technical competence

通过以下方式来开始学习 :沟通,合作和专注于技术能力

During this time, you’ll want to make sure you’re cultivating those soft skills required to work with others, making sure you really understand the inner workings of the tools you’re using.

在这段时间里,您将需要确保正在培养与他人一起工作所需的那些软技能,以确保您真正了解所使用工具的内部运作方式。

Related skills: Learn beginner and intermediate statistics

相关技能 :学习初学者和中级统计

While learning Python for data science, you’ll also want to get a solid background in statistics. Understanding statistics will give you the mindset you need to focus on the right things, so you’ll find valuable insights (and real solutions) rather than just executing code.

在学习Python进行数据科学时,您还希望获得扎实的统计背景。 了解统计信息将为您提供所需的思维方式,让您专注于正确的事情,因此您将发现有价值的见解(和实际解决方案),而不仅仅是执行代码。

第5步:应用高级数据科学技术 (Step 5: Apply Advanced Data Science Techniques)

Finally, aim to sharpen your skills. Your data science journey will be full of constant learning, but there are advanced courses you can complete to ensure you’ve covered all the bases.

最后,旨在提高您的技能。 您的数据科学之旅将充满不断的学习,但是您可以完成一些高级课程,以确保您已覆盖所有基础。

You’ll want to be comfortable with regression, classification, and k-means clustering models. You can also step into machine learning – bootstrapping models and creating neural networks using scikit-learn.

您需要对回归,分类和k均值聚类模型感到满意。 您还可以进入机器学习–引导模型并使用scikit-learn创建神经网络。

At this point, programming projects can include creating models using live data feeds. Machine learning models of this kind adjust their predictions over time.

此时,编程项目可以包括使用实时数据源创建模型。 这种机器学习模型会随着时间调整其预测。

Remember to: Keep learning!

记住 :继续学习!

Data science is an ever-growing field that spans numerous industries.

数据科学是一个跨越众多行业的不断发展的领域。

At the rate that demand is increasing, there are exponential opportunities to learn. Continue reading, collaborating, and conversing with others, and you’re sure to maintain interest and a competitive edge over time.

随着需求的增长,学习的机会成倍增加。 继续阅读,与他人合作和交谈,随着时间的推移,您一定会保持兴趣和竞争优势。

学习Python需要多长时间? (How Long Will It Take To Learn Python?)

After reading these steps, the most common question we have people ask us is: “How long does all this take?”

阅读完这些步骤后,我们最常见的问题是:“这需要多长时间?”

There are a lot of estimates for the time it takes to learn Python. For data science specifically, estimates a range from 3 months to a year of consistent practice.

关于学习Python的时间有很多估计。 具体来说,对于数据科学,估计范围为3个月到一年的一致实践。

We’ve watched people move through our courses at lightning speed and others who have taken it much slower.

我们已经看到人们以闪电般的速度完成课程设置,而其他人则慢得多。

Really, it all depends on your desired timeline, free time that you can dedicate to learn Python programming and the pace at which you learn.

确实,这完全取决于您所需的时间表,您可以专用于学习Python编程的空闲时间以及学习的进度。

Dataquest’s courses are created for you to go at your own speed. Each path is full of missions, hands-on learning and opportunities to ask questions so that you get can an in-depth mastery of data science fundamentals.

Dataquest的课程是为您创建的,以您自己的速度发展。 每条路径都充满使命,动手学习和提出问题的机会,因此您可以深入掌握数据科学基础知识。

Get started for free. Learn Python with our Data Scientist path and start mastering a new skill today.

免费开始使用。 通过我们的数据科学家路径学习Python,并立即开始掌握一项新技能。

Resources and studies cited:

引用的资源和研究

翻译自: https://www.pybloggers.com/2019/02/how-to-learn-python-for-data-science-in-5-steps/

python 数据分析步骤

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值