github星最高_哪些语言在2016年获得GitHub最高星?

github星最高

by Jose Aguinaga

通过Jose Aguinaga

哪些语言在2016年获得GitHub最高星? (Which languages got the most GitHub stars in 2016?)

A few weeks ago I decided to build an application to find which programming languages I star the most in GitHub.

几周前,我决定构建一个应用程序,以查找我在GitHub中使用最多的编程语言。

Why? Because lately I had been starring projects about Machine Learning, Data Science, and Artificial Intelligence. I wanted to see whether my increased interest would show up in my starred projects timeline in any way. And what better way to discover this by using a little bit of Data Science on its own?

为什么? 因为最近我一直在主持有关机器学习,数据科学人工智能的项目。 我想看看我增加的兴趣是否会以任何方式出现在我加星标的项目时间表中。 还有什么更好的方法通过单独使用一点数据科学来发现这一点?

The experiment consisted in obtaining the information from GitHub, cleaning it up, and displaying it in a visualization. To try it yourself, go to the following webpage.

该实验包括从GitHub获取信息,对其进行清理并将其以可视化方式显示。 要自己尝试,请访问以下网页。

What languages got the most GitHub stars in 2016?A streamgraph of github starred languages on 2016.starred.jjperezaguinaga.com

哪些语言在2016年获得GitHub最多的星? github上已加注星标语言的流图于2016年 。starred.jjperezaguinaga.com

After trying it yourself, give me a moment to explain how it works and show you some interesting examples.

自己尝试之后,请让我花一点时间解释它的工作原理,并向您展示一些有趣的示例。

检索和分析数据 (Retrieving and analyzing the data)

For better or worse, GitHub doesn’t provide an easy way to consume this information. You need to go through all your starred projects on github.com, then click through many pages to find them all. Depending in how many repositories you have starred, it would take you a few minutes before you can see all the projects across a specific timespan.

不管是好是坏,GitHub都不提供一种简单的方式来使用此信息。 您需要浏览github.com上所有已加星标的项目,然后单击许多页面以查找全部。 根据您已加注星标的存储库的数量,您可能需要花费几分钟,才能看到特定时间范围内的所有项目。

The good news are that GitHub has a starring activity API, which I then used to write a JavaScript utility to fetch all my starred projects through the year. GitHub allows you to pass a flag to see the date when you first starred a project, which let me get only the projects I starred in 2016.

好消息是,GitHub有一个starring活动API ,然后我用它编写了一个JavaScript实用程序,以获取一年中我所有已加星标的项目。 GitHub允许您传递标记以查看您首次加星标的日期,这让我仅获得我在2016年加星标的项目。

With the data retrieved, I proceeded to filter it based on the language GitHub has assigned to them. Ramda was particularly useful to map and reduce this data.

检索到数据后,我便根据GitHub分配给他们的语言对其进行过滤。 Ramda对映射和减少此数据特别有用。

Then, to visualize this information, I decided to display the frequency of each repository programming language through a chart known as a streamgraph. Aggregating each language instance per month, I could see the increase and decrease of interest over time.

然后,可视化该信息,我决定通过已知为一图表,显示每个存储库的编程语言的频率streamgraph 每月汇总每个语言实例,我可以看到随着时间的推移兴趣的增加和减少。

As we can see in the graph, I starred 142 projects in 2016. There were more than 15 languages across my starred repositories, but I’m only displaying the top 7, as the frequency per language drops after this number. The top language is JavaScript, which doesn’t surprise me, as I work as a Front-End Engineer on a daily basis.

如图所示,我在2016年为142个项目加注了星标。加注星标的存储库中有超过15种语言,但我只显示前7种,因为每种语言的频率在此数字之后下降。 顶级语言是JavaScript,这对我来说并不奇怪,因为我每天都担任前端工程师。

The second and third programming languages are Python and Go, which most likely relate to projects about artificial intelligence / deep learning I mentioned earlier. Python made sense, since it was recently considered the most popular language for Machine Learning.

第二和第三种编程语言是PythonGo,它们很可能与我前面提到的有关人工智能/深度学习的项目有关。 Python很有意义,因为最近它被认为是机器学习中最受欢迎的语言

每个人都得到一个图。 (Everyone gets a graph.)

As part of the development of the tool, I tested the application with other developers. This produced a series of interesting graphs.

作为该工具开发的一部分,我与其他开发人员一起测试了该应用程序。 这产生了一系列有趣的图。

The following is a list of a few famous developers, grouped by the languages they’ve starred the most.

以下是一些知名开发人员的列表,按他们最常出演的语言进行分组。

JavaScript开发人员 (Javascript Developers)
Golang开发人员 (Golang Developers)
Python开发人员 (Python Developers)
斯威夫特 (Swift,R)

关于数据的事情 (The thing about data)

I had a lot of fun from this experiment, and learned two important lessons:

我从这个实验中获得了很多乐趣,并且学到了两个重要的教训:

  • Data can be beautiful. Not everything needs to have a deep meaning in order to be interesting. For instance, the cover for this article is the product of overlapping a series of streamgraphs from various datasets. I liked it so much I even copyrighted it.

    数据可以很漂亮 。 并非所有事物都必须具有深层含义才能变得有趣。 例如,本文的封面是重叠来自各种数据集的一系列流图的产物 。 我非常喜欢它,甚至拥有版权

  • Our data identify us. Given enough starred projects, the chances of having two individuals with the exact same starred repositories at the exact same time are negligible*. Thus, if we analyze the starring patterns on a developer, we could identify them by seeing their data. This is an example of Behavioral Analytics, used in the past to identify users by mobile app usage.

    我们的数据可以识别我们。 如果有足够的已加星标项目,则在完全相同的时间让两个人拥有完全相同的已加星标存储库的机会可以忽略*。 因此,如果我们分析开发人员的主打模式,则可以通过查看其数据来识别它们。 这是行为分析的示例,过去曾用于通过移动应用使用情况识别用户

By the end of this experiment, I’m was more interested in exploring the usages of Data Visualization and Machine Learning than before**. I’ll continue to expand my knowledge in the area to create more experiments like this in the future.

在本实验结束时,我对探索数据可视化和机器学习的用途比以前更加感兴趣**。 我将继续在该领域中扩展我的知识,以在将来创建更多类似的实验。

请在家尝试一下 (Please do try this at home)

If you’re curious about the code, you can see it on Github.

如果您对代码感到好奇,可以在Github上看到它。

jjperezaguinaga/github-patternsgithub-patterns - ? What languages got the most GitHub stars in 2016?github.com

jjperezaguinaga / github-patterns github-patterns-? 2016年哪些语言在GitHub上获得最多星光?g ithub.com

Bear in mind the code is very dirty, so errors might occur (for example, the GitHub rate limit timeout error isn’t caught), so don’t take it as a reference for any real production projects. Feel free to change, expand or fork the code as you wish.

请记住,代码非常脏,因此可能会发生错误(例如,未捕获GitHub速率限制超时错误),因此请勿将其用作任何实际生产项目的参考。 随意更改,扩展或添加代码

*Not negligible, but very unlikely. A person would need to star the same project at the same second to share the same pattern. There are 31557600 seconds in an astronomical year and are around 20M repositories in GitHub by the end of 2016, and around 5.8M active users in GitHub. You tell me what are the odds of two people with 10 starred projects to have the same pattern.

*不可忽略,但极不可能。 一个人需要在相同的时间为同一项目加注星标,以共享相同的模式。 天文学年中有31557600秒,到2016年底GitHub上有大约2000万个存储库,GitHub上有580万活跃用户。 您告诉我,两个拥有10个加星标项目的人具有相同模式的几率是多少?

**Udacity released this weekend a new nanodegree about Deep Learning foundations. I’ve enrolled myself and will post an overview after I finish it.

** Udacity本周末发布了有关深度学习基础的新纳米学位 我已经报名参加,完成后会发表概述。

翻译自: https://www.freecodecamp.org/news/data-visualization-what-languages-got-the-most-github-stars-in-2016-a4e3908a9532/

github星最高

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值