自学数据分析找工作_自学数据科学:我在Jet.com上获得分析工作的学习路径

自学数据分析找工作

by Dan Stern

丹·斯特恩(Dan Stern)

自学数据科学:我在Jet.com上获得分析工作的学习路径 (Teach Yourself Data Science: the learning path I used to get an analytics job at Jet.com)

How can you go from zero programming skills to a job in technology or analytics?

您如何从零编程技能变成技术或分析工作?

If you’re interested in learning these skills, whether for fun or for a career change, what’s the best way to go about it?

如果您有兴趣学习这些技能,无论是为了娱乐还是为了改变职业,那么最好的方法是什么?

Countless lists of the best online courses exist, but how can you forge your own learning path with all of the noise?

存在无数的最佳在线课程清单,但是您如何才能在所有干扰下打造自己的学习道路?

I personally never thought I’d learn any practical skills around programming, data analysis, machine learning, or technology in general. As a finance major, I always assumed I’d be the “business guy.” Yet somehow, I taught myself Python and SQL, and found myself working in analytics at Jet.com, using one of these languages everyday.

我个人从没想过我会学到编程,数据分析,机器学习或一般技术方面的任何实用技能。 作为财务专业的学生,​​我一直以为自己会是“商务人士”。 无论如何,我自学了PythonSQL,并发现自己每天在Jet.com从事分析工作,每天都使用其中一种语言。

Why Python and SQL, you might ask?

您可能会问为什么使用Python和SQL?

Python is the fastest growing programming language out there, and for good reason. It has an insane number of libraries that you can use for machine learning applications, data analysis, visualization, web apps, API integrations, and much more. Plus, it’s one of the easier languages to pick up and learn. As for SQL, databases power technology companies, and SQL allows you to better understand, explore, and make use of the troves of collected data.

Python是目前发展最快的编程语言,这是有充分理由的。 它拥有大量的库,可用于机器学习应用程序,数据分析,可视化,Web应用程序,API集成等等。 另外,它是更容易学习和学习的语言之一。 对于SQL,数据库为技术公司提供了支持,SQL使您可以更好地理解,探索和利用收集的数据宝库。

Below, I outline the path I took in learning these languages that brought me into analytics. To be clear, this path was incredibly challenging; I spent countless evenings feeling frustrated and confused. Many nights I wanted to just throw in the towel and settle for being the business guy.

下面,我概述了学习这些语言的途径,这些语言将我带入了分析。 需要明确的是,这条路极具挑战性。 我花了无数个晚上感到沮丧和困惑。 许多晚上,我只想穿上毛巾,为成为商务人士做好准备。

But your motivation remains the key to pushing forward through the obstacles you’ll inevitably face. Whether you want to move into a data analysis or data science type role, or just want to have a better grasp on programming and technology for the fun of it (which it does become fun!), you have to figure out how to stay motivated and disciplined if you want to actually learn these skills.

但是,您的动力仍然是克服您将不可避免面临的障碍的关键。 无论您是想从事数据分析或数据科学类型的工作,还是只是想对编程和技术有更好的了解(它确实变得很有趣!),您都必须弄清楚如何保持动力如果您想真正学习这些技能,请遵守纪律。

For me, setting aside specific amounts of time almost every day (about 90 minutes to 2 hours) to learn or practice immediately after I got home from work allowed me to develop consistent habits and hammer home concepts I found confusing.

对我来说,下班回家后几乎每天都花特定的时间(大约90分钟到2个小时)学习或练习,这使我养成了一致的习惯,使我感到困惑。

Here’s the path that I took; hopefully it can help you get started on your own.

这是我走的路; 希望它可以帮助您自行入门。

核心基金会 (The Core Foundation)

  1. Learn Python the Hard Way

    艰苦学习Python

This is one of the best courses I’ve ever taken, period. It’s self-directed and challenging, but Zed provides you with enough detail and guidance to start to actually begin programming in Python. He makes programming feel accessible, and the material gives you the confidence week after week to actually feel as if you can effectively learn Python.

这是我上过的最好的课程之一。 它具有自我指导性和挑战性,但是Zed为您提供了足够的细节和指导,可以开始真正地开始使用Python进行编程。 他使编程变得容易上手,并且这些材料使您周而复始地充满信心,使您仿佛可以有效学习Python。

2. Mode Analytics: Pandas

2. 模式分析:熊猫

Mode Analytics provides an awesome introduction to Python and includes tutorials on one of its most powerful data structures: the Pandas DataFrame. This is perfect for learning the basics of data analysis once you have the fundamentals of Python down.

Mode Analytics提供了关于Python的出色介绍,并包括有关其最强大的数据结构之一的教程:Pandas DataFrame。 一旦掌握了Python的基础知识,这对于学习数据分析的基础知识是非常理想的。

3. Mode Analytics: SQL

3. 模式分析:SQL

The other Mode Analytics tutorial on SQL is fantastic too. You can learn all of the key concepts and create a strong SQL foundation here. They even have their own SQL editor and data you can play around with.

另一本有关SQL的Mode Analytics教程也很棒。 您可以在此处学习所有关键概念并创建强大SQL基础。 他们甚至拥有自己SQL编辑器和可以使用的数据。

In conjunction with Mode Analytics, W3 Schools can help answer any SQL question you ever have as you go make your way through the tutorials.

W3 Schools与Mode Analytics结合使用,可以帮助您逐步完成教程中遇到的任何SQL问题。

潜入机器学习 (Diving Right Into Machine Learning)

Before I fully had a strong grasp of Python, I took a shot and applied for Udacity’s self-driving car nanodegree. I knew it was completely over my head, but I thought, why not try?

在我完全掌握Python之前,我开了枪,申请了Udacity的自动驾驶汽车纳米度。 我知道这完全是困扰我的事情,但是我想,为什么不试试呢?

It’s easier to motivate yourself to learn Python and machine learning when you’re fascinated by the practical applications.

当您对实际应用程序着迷时,更容易激发自己学习Python和机器学习的动力。

I had about a month before the class began, so I took as many classes around data science and machine learning as possible.

我上课还有一个月左右的时间,所以我尽可能多地参加了有关数据科学和机器学习的课程。

Here were the best free introductory courses I found that were incredibly helpful:

这是我发现的最好的免费 入门 课程 ,对您有极大帮助:

Yes, you can see I think quite highly of Udacity.

是的,您可以看到我对Udacity的高度评价。

While not free, I’d also highly recommend checking out the Grokking Deep Learning book. It provides extremely clear and relatable examples on the fundamentals of machine learning.

虽然不是免费的,但我也强烈建议您阅读《 Grokking深度学习》一书。 它提供了关于机器学习基础的极其清晰且相关的示例。

TensorFlow, developed by Google, is an open source library for machine learning that can be written in Python. It’s incredibly powerful, and absolutely worth becoming familiar with.

由Google开发的TensorFlow是一个用于机器学习的开源库,可以使用Python编写。 它功能强大,绝对值得熟悉。

Check out the MNIST exercise for a fantastic introduction to the framework.

查看MNIST练习 ,以获取对该框架的精彩介绍。

I found the Stanford CS231 class to be a useful resource too; it covers convolutional neural networks (what we use for image or facial recognition software) extensively, which I read would be incredibly helpful for the self-driving car Nanodegree. If you’re interested at all in using machine learning with images or video, you won’t find much better than this course.

我发现斯坦福CS231类也是有用的资源。 它广泛地涵盖了卷积神经网络(我们将其用于图像或面部识别软件),我读过这对无人驾驶汽车Nanodegree会非常有用。 如果您对将机器学习与图像或视频结合使用完全感兴趣,那么您将发现比这门课程更好的选择。

Finally, after using these resources to build a solid foundation, I began the Udacity Self Driving Car Nanodegree.

最后,在利用这些资源建立了坚实的基础之后,我开始了Udacity无人驾驶汽车纳米学位

I’m not going to talk about it too much since there are already great write ups of the course here and here. What I will say is that, to my own shock, despite being the most challenging course I’ve ever taken, I was able to understand most of the content. Armed with the right base knowledge, you’d be surprised at how deep your understanding of a complex topic can be.

我不会在此谈论太多,因为在这里这里已经有很多关于该课程的文章。 我要说的是,令我震惊的是,尽管这是我参加过的最具挑战性的课程,但我仍然能够理解大部分内容。 有了正确的基础知识,您会对您对一个复杂主题的理解有多么深刻的惊讶。

持续的分析和数据科学学习 (Continued Analytics and Data Science Learning)

After diving intensely into machine learning for a few months, it was helpful to take a step back and reinforce my understanding of practical analytics and data science principles.

在深入学习机器学习几个月后,退后一步并加深我对实际分析和数据科学原理的理解是有帮助的。

I started with Data Science, Deep Learning, & Machine Learning with Python, a fantastic course on Udemy. While touching upon machine learning, it completely covers principles in analytics, data science, and statistics, particularly around different data mining techniques and practical scenarios to deploy them.

我从数据科学,深度学习和Python机器学习开始 ,这是有关Udemy的绝妙课程。 在涉及机器学习时,它完全涵盖了分析,数据科学和统计方面的原理,尤其是围绕不同的数据挖掘技术和部署它们的实际方案。

The book Data Science For Business, also explains incredibly well the HOW and WHY certain models work when solving problems in a specific context; it hammers into you an analytical framework and mindset that can be applied to any situation revolving around data problems. It’s the best resource I found that connects different analytical approaches to specific business situations and problems.

《面向业务的数据科学 》一书也很好地解释了在特定情况下解决问题时某些模型的工作原理和原因。 它为您提供了一个分析框架和思路,可以应用于围绕数据问题的任何情况。 这是我发现的最佳资源,可以将不同的分析方法与特定的业务状况和问题联系起来。

Of course, if you’re interested in pursuing a career in analytics or data science, you should always be honing old skills or adding new skills into your toolkit. FreeCodeCamp and Hackernoon publish informative articles and tutorials on all things data science and software engineering. My favorite article recently was a well-written tutorial on writing your own blockchain.

当然,如果您对从事分析或数据科学职业感兴趣,则应该始终磨练旧技能或将新技能添加到工具包中。 FreeCodeCampHackernoon 发布有关数据科学和软件工程的所有事物的信息丰富的文章和教程。 我最喜欢的文章最近是一篇写得很好的教程,介绍了如何编写自己的区块链

You want to know the best way to continue learning though?

您是否想知道继续学习的最佳方法?

Build something. Anything. Explore a dataset. Find a practical problem that you or your company faces, and try to solve it.
建立东西。 没事 探索数据集。 找到您或您的公司面临的实际问题,并尝试解决它。

Even if you don’t have access to high-quality data at your company, there are plenty of open source datasets that you can play around and practice with. I bet you’ll learn just as much, if not more, working on your own data projects than taking any course or reading any book.

即使您无法访问公司的高质量数据,也可以使用许多开源数据集进行练习。 我敢打赌,与自己学习数据或学习任何书籍相比,在自己的数据项目上学习的知识将会更多甚至更多。

Finally, meeting and learning from people who have the skills you want to acquire is hugely beneficial. I highly recommend using Meetup to find groups of analytics or software professionals in your area. Many of these groups have free tutorial or study sessions, and you’ll meet plenty of insanely smart people who can provide tips and tricks to accelerate your learnings.

最后,与具有想要掌握的技能的人会面和学习是非常有益的。 我强烈建议您使用Meetup查找您所在地区的分析或软件专业人员组。 这些小组中的许多小组都有免费的教程或学习课程,您会遇到很多疯狂的聪明人,他们可以提供提示和技巧来加速您的学习。

In New York City, some of the groups that have helped me tremendously are:

在纽约市,为我提供了极大帮助的一些团体是:

祝您学习愉快,并让我知道您的旅途如何! (Have fun learning, and let me know how your own journey goes!)

— — — — — — — — — — — — — — — — — — — — — — — — — — — — — — —

— — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — —

UPDATE: Udacity just released a new Data Scientist Nanodegree program. I’ve looked through the materials, and it looks like an incredibly useful resource! The projects include building a recommendation engine with IBM data, and classifying customers into segments. I haven’t taken it yet, but check it out here: Data Scientist Nanodegree program.

更新 :Udacity刚刚发布了一个新的数据科学家纳米学位计划 。 我仔细阅读了材料,它看起来像是一个非常有用的资源! 这些项目包括使用IBM数据构建推荐引擎,以及将客户分类。 我还没有接受,但可以在这里查看: Data Scientist Nanodegree program

翻译自: https://www.freecodecamp.org/news/a-path-for-you-to-learn-analytics-and-data-skills-bd48ccde7325/

自学数据分析找工作

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值