hadoop hdfs2 Basic Filesystem Operations

最新推荐文章于 2024-04-13 12:27:35 发布

zhuyu4839

最新推荐文章于 2024-04-13 12:27:35 发布

阅读量929

点赞数

分类专栏：一步一步学习hadoop2.2 文章标签： hadoop2.2 hadoop2.2.0 hadoop hdfs2

本文链接：https://blog.csdn.net/zhuyu4839/article/details/21489217

版权

一步一步学习hadoop2.2 专栏收录该内容

15 篇文章 0 订阅

订阅专栏

一) 使用File shell方式,主要包括hadoop fs [command]和hdfs dfs [command]操作,具体的示例hadoop的referenece

二) 使用java api操作:

目前hadoop提供的文件操作的入口类主要有FileSystem和FileContext

步骤:

1. 获取org.apache.hadoop.conf.Configuration对象实例

2. 获取FileSystem或者FileContext实例

3. 通过FileSystem或者FileContext实例操作hadoop文件系统

示例1. create path in hadoop hdfs

Configeration conf = new Configeration();

FileSystem hdfs = FileSystem.get(conf);

Path path = new Path("pathName"); // ex: hdfs://localhost:9000/test and file:///filepath

hdfs.create(path);

hdfs.close();

示例2. create path in hadoop hdfs2

Configeration conf = new Configeration();

FileContext hdfs2 = FileContext.getFileContext(conf);
Path path = new Path("pathName");// ex: hdfs://localhost:9000/test and file:///filepath
// 这里搞不懂为什么要用EnumSet,纠结了半天还是没明白,直接用Enum方便多了

hdfs2.create(path, EnumSet.of(CreateFlag.CREATE), Options.CreateOpts.createParent());

//the same as operation delete

查看api可以知道FileContext提供了一个静态的方法:getLocalFSFileConext()来获取本地的FileConext对象.

hadoop1.x与hadoop2.x的HDFS的文件操作api全部由FileSystem转到了FileContext,当然向下兼容性可以让我们使用FileSystem操作.

需要注意的是,目前hadoop的配置上,hdfs的访问全是ALL,并且文件的权限也全是rw;这安全性以后研究.

获取一个目录下的文件列表:

FileStatus status = FileContext.getLocalFSFileContext().getFileStatus(path);

if (status.isDirectory()) {

RemoteIterator<FileStatus> fileList = FileContext.getLocalFSFileContext().listStatus(path);

while (fileList.hasNext()) {
FileStatus statu = fileList.next();
System.out.println(statu.getPath().getName());
}

}

使用FSDataInputStream于FSDataOutputStream对文件数据读写操作.

示例3. FSDataInputStream与FSDataOutputStream(分别继承java.io.DataInputStream与java.io.DataOutputStream)进行数据的操作.

下面完成了在hdfs文件系统中文件的拷贝:

FSDataInputStream in = null;
FSDataOutputStream out = null;
try {
in = FileContext.getLocalFSFileContext().open(path);
out = FileContext.getLocalFSFileContext().create(dest, EnumSet.of(CreateFlag.CREATE), Options.CreateOpts.donotCreateParent());

byte[] data = new byte[4096];
int length = 0;
while(0 < (length = in.read(data, 0, data.length))) {
out.write(data, 0, length);
}
} catch (AccessControlException e) {
e.printStackTrace();
} catch (FileNotFoundException e) {
e.printStackTrace();
} catch (UnsupportedFileSystemException e) {
e.printStackTrace();
} catch (IOException e) {
e.printStackTrace();
} finally {
try {
if (null != in)
in.close();
if (null != out)
out.close();
} catch (IOException e) {
e.printStackTrace();
}
}

Hadoop获取文件对象数据:FileStatus实例,可以参考api

zhuyu4839

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
hadoop hdfs2 Basic Filesystem Operations

1. create path in hadoop hdfsConfigeration conf = new Configeration();FileSystem hdfs = FileSystem.get(conf);Path path = new Path("pathName");hdfs.create(path);hdfs.close();2. create p
复制链接

扫一扫