hadoop学习笔记（一）

最新推荐文章于 2024-05-15 17:41:21 发布

a_victory

最新推荐文章于 2024-05-15 17:41:21 发布

阅读量832

点赞数 2

分类专栏：大数据学习 hadoop 文章标签： hadoop

大数据学习同时被 2 个专栏收录

4 篇文章 0 订阅

订阅专栏

hadoop

1 篇文章 0 订阅

订阅专栏

Hadoop之FileSystem文件的操作

//读数据
Hadoop中的IOUtils类的两个静态方法：
1）IOUtils.copyBytes()，其中in表示拷贝源，System.out表示拷贝目的地（也就是要拷贝到标准输出中去），4096表示用来拷贝的buffer大小，false表明拷贝完成后我们并不关闭拷贝源可拷贝目的地（因为System.out并不需要关闭，in可以在finally语句中被关闭）。
2）IOUtils.closeStream()，用来关闭一个流

public class FileSystemDoubleCat {
       public static void main(String[] args) throws Exception {
          String uri = args[0];
          Configuration conf = new Configuration();
          FileSystem fs = FileSystem.get(URI.create(uri), conf);
          FSDataInputStream in = null;
     try {
           in = fs.open(new Path(uri));
           IOUtils.copyBytes(in, System.out, 4096, false);
           in.seek(0); // go back to the start of the file
           IOUtils.copyBytes(in, System.out, 4096, false);
     } finally {
           IOUtils.closeStream(in);
     }
   }
 }

//写数据

public class FileCopyWithProgress {
      public static void main(String[] args) throws Exception {
          String localSrc = args[0];
          String dst = args[1];
          InputStream in = new BufferedInputStream(new FileInputStream(localSrc));
          Configuration conf = new Configuration();
          FileSystem fs = FileSystem.get(URI.create(dst), conf);
          OutputStream out = fs.create(new Path(dst), new Progressable() {
              public void progress() {
                 System.out.print(".");
             }
         });
         IOUtils.copyBytes(in, out, 4096, true);
     }
 }

//getFileStatus()方法提供了获取某个给定文件或目录的FileStatus对象的途径（略，见http://www.tuicool.com/articles/NvQf6b）

//创建

fs.mkdirs(newPath(DIR_PATH));

//删除

fs.delete(newPath(FILE_PATH), true);

hadoop yarn web service
假设你有一个application_1388830974669_1540349作业，并且运行完了。可以通过下面的命令得到这个作业的一些信息：
$ curl –compressed -H”Accept: application/json”-X \
GET “http://master:8088/ws/v1/cluster/apps/application_1388830974669_1540349“会以json格式返回数据信息

a_victory

关注

2
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
hadoop学习笔记（一）

Hadoop之FileSystem文件的操作//读数据 Hadoop中的IOUtils类的两个静态方法： 1）IOUtils.copyBytes()，其中in表示拷贝源，System.out表示拷贝目的地（也就是要拷贝到标准输出中去），4096表示用来拷贝的buffer大小，false表明拷贝完成后我们并不关闭拷贝源可拷贝目的地（因为System.out并不需要关闭，in可以在finally语
复制链接

扫一扫

专栏目录