r包设置地址rstudio_您必须知道的十大最佳R软件包

本文介绍了用于数据分析和可视化的十个最受欢迎的R软件包,包括dplyr、ggplot2、tidyr等。这些包简化了数据操作、图形绘制、日期处理和建模等工作,是R语言中不可或缺的工具。
摘要由CSDN通过智能技术生成

r包设置地址rstudio

There are hundreds of free R packages and libraries that are constantly being developed and improved by a rich and diverse developer community. But today we’ll look at the most popular R packages that you must know of.

有数百个免费的R程序包和库,它们由丰富多样的开发人员社区不断开发和改进。 但是今天,我们将介绍您必须知道的最受欢迎的R软件包。

最受欢迎的R软件包列表 (List of the Most Popular R Packages )

This article aims to introduce you to some of the most popular R packages that are used for data analytics and visualization. If you’re short on time, here’s a quick list of the most popular R packages that you must know.

本文旨在向您介绍一些用于数据分析和可视化的最受欢迎的R软件包。 如果时间紧迫,这里是您必须了解的最受欢迎R软件包的快速列表。

A quick list of the best, most popular R packages:

最佳,最受欢迎的R软件包的快速列表

  1. dpylr – This is the best R package that makes data operations easier by designing them into actions known as verbs.

    dpylr –这是最好的R包,它通过将数据设计为称为动词的动作来使数据操作更容易。
  2. ggplot2 – This is the most popular R package for plotting beautiful looking graphs. The letters gg stand for the grammar of graphics.

    ggplot2 –这是最流行的R软件包,用于绘制漂亮的图形。 字母gg代表图形语法。
  3. tidyr – Tidyr is one of the best R packages for giving your data a tidy appearance when performing data munging.

    tidyr – Tidyr是最好的R软件包之一,可以在执行数据整理时为您的数据提供整洁的外观。
  4. lubridate – One of the best data manipulation R packages, exclusively meant to simplify working with date and time formats

    lubridate –最好的数据处理R软件包之一,专门用于简化日期和时间格式的工作
  5. tibble – This R package makes working with large data sets very easy by providing us options to subset the data before displaying

    tibble –该R包通过向我们提供在显示之前对数据进行子集化的选项,使处理大型数据集变得非常容易
  6. stringr – The stringr R package aims to build a more cohesive set of these string operations and provides capabilities for several string operations

    stringr – stringr R软件包旨在为这些字符串操作建立更紧密的结合,并为多个字符串操作提供功能
  7. RMarkDown – This R package is designed to help you with designing and processing reports using markdown

    RMarkDown –此R软件包旨在帮助您使用markdown设计和处理报告
  8. Shiny – This package is an alternative to RMarkdown to generate web-based interactive apps to communicate your data science findings

    闪亮 –此软件包是RMarkdown的替代产品,可以生成基于Web的交互式应用程序来传达您的数据科学发现
  9. modelr – This is one of the best R packages that assists you with building models using your data

    modelr –这是最好的R软件包之一,可帮助您使用数据构建模型
  10. mlr – The mlr package is one of the most popular R packages for implementing machine learning algorithms in R

    mlr – mlr软件包是最受欢迎的R软件包之一,用于在R中实现机器学习算法

1. dpylr ( 1. dpylr)

This R package was developed to solve the data manipulation challenges from beginner to expert level. The package makes your data operations easier by designing them into actions known as the verbs. These verbs perform a different function each.

开发此R包的目的是为了解决从初学者到专家级别的数据处理难题。 该软件包通过将它们设计为称为动词的动作,使您的数据操作更加容易。 这些动词各自执行不同的功能。

  • filter() – This is used to filter through huge data frames and obtain only relevant observations.

    filter() –用于过滤巨大的数据帧并仅获取相关的观察结果。
  • arrange() – Organize and re-order the rows by specific conditions.

    range() –根据特定条件对行进行组织和重新排序。
  • select() – Makes selections from the data frame by a condition specified.

    select() –根据指定的条件从数据帧中进行选择。
  • mutate() – Modify the existing variables to create new ones.

    mutate() –修改现有变量以创建新变量。
  • summarize() – Derive a single value by summarizing multiple variables.

    summary() –通过汇总多个变量来得出单个值。
  • sample_n() and sample_frac() – To obtain random samples from the data.

    sample_n()sample_frac() –从数据中获取随机样本。

These seemingly simple looking operations can be used to perform complex data manipulations when grouped together.

这些看似简单的操作在组合在一起时可以用来执行复杂的数据操作。

Official Website: Dplyr

官方网站: Dplyr

2. ggplot2 (2. ggplot2 )

Ggplot2
Ggplot2 Plot example
Ggplot2绘图示例

R is preferred by several data scientists and statisticians for its beautifully formatted graphics. In addition to the inbuilt graphics package in R, ggplot2 is another very popular graphics suite. The letters gg stand for the grammar of graphics which allows you to create aesthetic looking graphics in a declarative manner.

R因其格式精美的图形而受到数位数据科学家和统计人员的青睐。 除了R中的内置图形包之外 ,ggplot2是另一个非常流行的图形套件。 字母gg代表图形语法,可让您以声明性的方式创建美观的图形。

Each aspect of the graphics such as the data, aesthetic mapping, object scale, coordinates, etc is treated as separate building blocks of the graphic in ggplot2. Thus, it is very flexible to create different graphics using ggplot2 due to this abstraction.

图形的各个方面,例如数据,美学映射,对象比例,坐标等,在ggplot2中被视为图形的单独构建块。 因此,由于这种抽象,使用ggplot2创建不同的图形非常灵活。

Ggplot2 is one of the R packages that also gives a far more polished look to the graphics as it takes care of many issues in appearance and allows you to customize the graphics by themes.

Ggplot2是R软件包之一,它还处理外观上的许多问题,并允许您按主题自定义图形,从而使图形外观更加优美。

Ggplot2 is included in one of the most popular package collection called the tidyverse.

Ggplot2包含在最流行的软件包tidyverse之一中

Official Website: GGPlot2

官方网站: GGPlot2

3.提迪尔 (3. tidyr)

Tidyr Gif
Tidyr in action – Source: Official Tidyr Github
Tidyr的行动–资料来源:Tidyr Github官方

As the name indicates, this R package is all about giving your data a tidy appearance when performing data munging. The rows here are saved as observations and columns are each an exclusive variable.

顾名思义,此R软件包用于在执行数据处理时使您的数据整洁。 此处的行另存为观察值 ,列分别为排他变量

This creates a neat and tidy representation of data that can be manipulated in an organized way. Tidyr like dplyr has a few major verbs to perform these actions.

这样可以创建整洁的数据表示形式,可以以有组织的方式进行操作。 像dplyr这样的Tidyr有一些主要的动词来执行这些动作。

  • gather() – To convert columns to rows with key and value pairs.

    collect() –使用键和值对将列转换为行。
  • spread() – To convert rows to columns.

    spread() –将行转换为列。
  • separate() – To separate a single column to multiple columns.

    split() –将单个列分隔为多个列。
  • unite() – To combine multiple columns into a single column.

    unite() –将多个列合并为一个列。

Official Website: Tidyr

官方网站: Tidyr

4.润滑 (4. lubridate)

This is one of the data manipulation R packages, exclusively meant to simplify working with date and time formats. This R package can handle dates written in multiple formats and separators.

这是数据处理R包之一,专用于简化日期和时间格式的工作。 这个R包可以处理以多种格式和分隔符编写的日期。

Lubridate makes it easy to extract specific information from date objects such as day, month, year, weekday, etc. It also has support for several time zones so that you can avoid inconsistency in representing the same moment of time. Adding and subtracting dates is also much more straightforward with lubridate.

Lubridate使得从日期对象(例如日,月,年,周日等)中提取特定信息变得容易。它还支持多个时区,因此您可以避免表示同一时刻的不一致。 使用lubridate添加和减去日期也更加简单。

Official Website: Lubridate

官方网站: Lubridate

5.小声 (5. tibble)

Tibble is the name given for the data frame alternative in the tidyverse group of R packages. Tibbles behave in a manner very much similar to data frames but have some extra functionality that makes working with nibbles much more convenient.

Tibble是R包的tidyverse组中为数据帧替代名称的名称。 Tibble的行为与数据帧非常相似,但是具有一些额外的功能,使得使用Nibble更加方便。

Tibbles employs a novel print method that displays the first 10 rows of a data frame instead of the entire frame. This makes working with large data frames much easier. Similarly, subsetting is also much easier with tibbles compared to data frames.

Tibbles采用一种新颖的打印方法,该方法显示数据帧的前10行而不是整个帧。 这使得处理大型数据帧变得更加容易。 同样,与数据帧相比,带有小标题的子集也容易得多。

Official Website: Tibble

官方网站: Tibble

6.纵梁 (6. stringr)

Strings play a major role in data cleaning and preparation tasks. However, traditional string handling methods in R can become very clumsy due to their inconsistency.

字符串在数据清理和准备任务中起主要作用。 但是,R中的传统字符串处理方法由于不一致而变得非常笨拙。

The stringr package aims to build a more cohesive set of these string operations and provides capabilities for several string operations such as:

stringr软件包旨在为这些字符串操作建立更紧密的结合,并为几种字符串操作提供功能,例如:

  • Finding length – str_length()

    查找长度– str_length()
  • Concatenate two strings – str_c()

    连接两个字符串– str_c()
  • Extract substrings – str_sub()

    提取子字符串– str_sub()
  • Duplicate string – str_dup()

    字符串重复– str_dup()
  • Match a pattern – str_match()

    匹配模式– str_match()

Official Website: Stringr

官方网站: Stringr

7. RMarkDown (7. RMarkDown)

R Markdown is a package designed to help you with designing and processing reports using markdown. R Markdown provides you with a notebook format for integrating your data science code and provides results along with commentary.

R Markdown是一个软件包,旨在帮助您使用markdown设计和处理报告。 R Markdown为您提供了一种笔记本格式,用于集成数据科学代码,并提供结果和注释。

These beautifully generated reports can be used to communicate your work to the decision-makers and collaborate with other data scientists on the web.

这些生成的精美报告可用于将您的工作传达给决策者,并与网络上的其他数据科学家进行协​​作。

Official Website: RMarkDown

官方网站: RMarkDown

8.闪亮 (8. Shiny)

Shiny In R Demo
Shiny In R Demo
闪亮的R演示

This R package is another alternative to RMarkdown to generate web-based interactive apps to communicate your data science findings with decision-makers and programmers.

此R包是RMarkdown的另一种选择,可以生成基于Web的交互式应用程序,以与决策者和程序员交流您的数据科学发现。

The Shiny package in R doesn’t require you to have any web development skills and is totally compatible with RMarkdown. The generated web applications can be stand-alone or be embedded within other webpages using HTML, CSS, and javascript.

R中的Shiny软件包不需要您具备任何Web开发技能,并且与RMarkdown完全兼容。 生成的Web应用程序可以是独立的,也可以使用HTML,CSS和javascript嵌入到其他网页中。

Official Website: Shiny in R

官方网站: Shiny in R

9.建模者 (9. modelr)

Modelr is also a tidyverse component package with a large number of helper functions to assist you with building models using your data.

Modelr还是一个tidyverse组件包,具有大量的辅助功能,可以帮助您使用数据构建模型。

A model family can be a simple linear equation, a quadratic function, or any other kind of function that captures the data.

模型族可以是简单的线性方程,二次函数或任何其他种类的捕获数据的函数。

The model is fit to the data by changing its parameters to correspond to the data closely. Modelr offers functions for sampling data, generating models, evaluating models using quality metrics, and also interact with models using new unseen data.

通过更改其参数以使其与数据紧密对应,该模型适合数据。 Modelr提供以下功能:采样数据,生成模型,使用质量指标评估模型,以及使用新的看不见的数据与模型进行交互。

Official Website: ModelR

官方网站: ModelR

10. 毫升 (10. mlr)

If you are keen on implementing machine learning algorithms using R and are looking for a package that provides an infrastructure to do so, mlr is the right package for you.

如果您热衷于使用R来实现机器学习算法,并且正在寻找提供基础架构的软件包,那么mlr是适合您的软件包。

Mlr is one of the R packages which has several functions to build classification, regression, clustering and survival models in R. The latest version of mlr known as the mlr3 has even more advanced features to build ML models to suit the current day needs.

Mlr是R软件包之一,它具有在R中构建分类,回归,聚类和生存模型的几种功能。最新版本的mlr被称为mlr3,它具有更高级的功能来构建ML模型以适应当今的需求。

Official Website: MLR

官方网站: MLR

翻译自: https://www.journaldev.com/36488/top-best-r-packages

r包设置地址rstudio

  • 1
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值