新兴科技趋势报告_大数据:IT部门的新兴趋势

新兴科技趋势报告

If you haven’t heard all about the excitement around Big Data, then I must say you are not really paying attention. The IT industries are the fastest growing technology markets in the world and inset to significantly transform the way we perceive different aspects of life. To grab a good job at IT sector or to have a good grip in the industry, for those who are currently working, we must always remain updated about ongoing and upcoming trends.

如果您还没有完全了解Big Data带来的兴奋,那么我必须说您并没有真正注意。 IT行业是世界上发展最快的技术市场,并且将极大地改变我们对生活各个方面的看法。 为了在IT部门中抢占先机或在行业中牢牢掌控,对于那些目前在工作的人,我们必须始终保持最新的发展趋势。

The fact is that big data analytics has become one of the most valuable parts of any modern business and will surely have a prominent future in IT sector.

事实是, 大数据分析已成为任何现代业务中最有价值的部分之一,并且必将在IT领域拥有广阔的未来。

什么是大数据? (What is big data?)

As the name itself suggests us that Big Data means a huge amount of data, which is high volume, high velocity along with a huge variety. Big data requires new technologies to capture, store and analyze them.

顾名思义, 大数据意味着海量的数据,即大数据 ,高速度以及种类繁多的数据。 大数据需要新技术来捕获,存储和分析它们。

In simple language, we can define big data as examining huge amounts of data in order to discover the hidden patterns, correlations, sharpness, prescience market trend etc.

用简单的语言,我们可以将大数据定义为检查大量数据,以发现隐藏的模式,相关性,敏锐度,先行市场趋势等。

Big data are so voluminous, messed up, varied and complex that the traditional software that we use for data processing is inadequate to deal with.

大数据是如此庞大,混乱,多样化和复杂,以至于我们用于数据处理的传统软件不足以应对。

The general consensus of the day is that there are specific attributes that define BIG DATA. In the most data circles, these are known as 4 v’s:

当今的普遍共识是,存在定义BIG DATA的特定属性。 在大多数数据圈中,这些被称为4 v:

  1. Volume

  2. Variety

    品种

  3. Velocity

    速度

  4. Veracity

    真实性

big data analysis

Figure: Big data analysis

图:大数据分析

挑战性 (Challenges)

Some common challenges faced in Big data analysis are:

大数据分析面临的一些常见挑战是:

  • Dealing with data

    处理数据

  • Generating insights in a timely manner

    及时产生见解

  • Recruiting and retaining big data

    招募和保留大数据

  • Integrating disparate data sources

    集成不同的数据源

  • Validating data

    验证数据

  • Storing data

    储存资料

  • Securing big data etc

    保护大数据等

使用的技术或软件 (Technologies or softwares used)

Technologies:

技术:

  • Different techniques used for analyzing big data are as A/B testing, machine learning, and natural language processing.

    用于分析大数据的不同技术包括A / B测试,机器学习和自然语言处理。

  • Cloud computing, Artificial intelligence, Database management are also used in big data analysis.

    云计算,人工智能,数据库管理也用于大数据分析。

  • Charts, graphs etc. of data are used in the analysis process.

    在分析过程中使用数据的图表,图形等。

Softwares or tools used:

使用的软件或工具:

  • Hadoop: In simple language, we can say that Apache Hadoop is an open source framework which allows us to implement Big Data. Hadoop can be also defined as a distributed data processing system which stores the data and then it allows us to use or process this data in a distributed manner.

    Hadoop:用简单的语言,我们可以说Apache Hadoop是一个开放源代码框架,允许我们实现大数据。 Hadoop也可以定义为存储数据的分布式数据处理系统,然后它允许我们以分布式方式使用或处理此数据。

    Download link: http://archive.apache.org/dist/hadoop/common/hadoop-2.6.2/hadoop-2.6.2.tar.gz

    下载链接: http : //archive.apache.org/dist/hadoop/common/hadoop-2.6.2/hadoop-2.6.2.tar.gz

    Note: It's only the download link for Hadoop, you will have to install Java 8 as well as an eclipse in order to run Hadoop. (** will brief and demonstrate the whole process in upcoming articles).

    注意:这只是Hadoop的下载链接,您必须安装Java 8和eclipse才能运行Hadoop。 (**将在以后的文章中简要介绍并演示整个过程)。

  • Mapreduce: It is a programming paradigm that allows for massive scalability of unstructured data across hundreds or thousands of commodity clusters servers in an Apache Hadoop cluster. It is also called "Heart of the Apache Hadoop".

    Mapreduce:这是一种编程范例,可在Apache Hadoop集群中的数百或数千个商品集群服务器上实现非结构化数据的大规模可伸缩性。 它也被称为“ Apache Hadoop的心脏”

    Note: There are several others like Apache Spark, NoSQL etc. that are being used for big data analysis.

    注意:还有其他一些诸如Apache Spark,NoSQL等用于大数据分析

大数据课程的前提条件 (Prerequisites for big data courses)

顶级认证 (Top Certifications)

  • Cloudera Certified Administrator for apache Hadoop (CCAH).

    Cloudera Apache Hadoop(CCAH)的认证管理员。

  • Cloudera Certified professional: Data Scientist (CCP:DS).

    Cloudera认证的专业人员:数据科学家(CCP:DS)。

  • EMC Data Scientist Associate (EMCDSA).

    EMC数据科学家协会(EMCDSA)。

  • HP Vertica Big Data Accredited solutions Expert (ASE).

    HP Vertica大数据认证解决方案专家(ASE)。

结论 (Conclusion)

Till now, we have known what is big data, what are its features, technologies used, certifications that we can do in this field. We will discuss in brief about the difference between data mining and big data, future scopes, more about big data and upcoming trends. So, stay connected, it will be a great fun to learn and discover together. Stay healthy and keep learning!

到现在为止,我们已经知道什么是大数据 ,它在该领域可以做的功能,使用的技术和认证。 我们将简要讨论数据挖掘与大数据之间的区别,未来范围,更多有关大数据和未来趋势的信息。 因此,保持联系,一起学习和发现将是一个很大的乐趣。 保持健康并继续学习!

翻译自: https://www.includehelp.com/big-data/big-data-an-emerging-trend-on-it-sector.aspx

新兴科技趋势报告

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值