Hadoop

最新推荐文章于 2022-07-13 14:20:42 发布

hrdzkj

最新推荐文章于 2022-07-13 14:20:42 发布

阅读量412

点赞数 1

分类专栏：研发

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/hrdzkj/article/details/74931871

版权

研发专栏收录该内容

61 篇文章 0 订阅

订阅专栏

Class InputFormat<K,V>

Map-Reduce framework :Split-up the input file(s) into logical InputSplits, each of which is then assigned to an individual Mapper.

Map-Reduce framework 分割输入文件到逻辑的InputSplits，每一个InputSplit都被赋值给个人的Mapper.

RecordReader implementation to be used to glean input records from the logical InputSplit for processing by the Mapper

RecordReader实现成为了Mapper处理，用于从逻辑的InputSplit收集记录

the FileSystem blocksize of the input files is treated as an upper bound for input splits. A lower bound on the split size can be set via mapreduce.input.fileinputformat.split.minsize.

输入文件的最大系统块是分割的上线，下线可以通过mapreduce.input.fileinputformat.split.minsize设置

JOB

It allows the user to configure the job, submit it, control its execution, and query the state

允许用户配置作业，提交他，控制它的执行，和查询状态

Java抽象类org.apache.hadoop.fs.FileSystem定义了hadoop的一个文件系统接口

FileCopyWithProgress---Copies a local file to a Hadoop filesystem 展现如何拷贝本地文件到Hadoop文件系统

FileSystemCat /FileSystemDoubleCat--Displays files from a Hadoop filesystem on standard output by using the FileSystem directly 通过直接使用文件系统显示hadoop文件系统的文件到标准输出上。

URLCat--- Displays files from a Hadoop filesystem on standard output using a URLStreamHandler. 使用URLStreamHandler显示hadoop 文件系统的文件到标准输出上。

Hadoop中的FileStatus类可以用来查看HDFS中文件或者目录的元信息

FileStatus[] status = fs.listStatus(paths);

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。