博客
关于我
Hadoop之HDFS:Hadoop分布式文件系统--具体知识点
阅读量:79 次
发布时间:2019-02-25

本文共 3353 字,大约阅读时间需要 11 分钟。

HDFS?Hadoop???????

HDFS?Hadoop Distributed File System??Hadoop????????????????????????????????????????????????????????????????????????????Java API?????????HDFS???????


??????

??VMware 15???????????CentOS 7?????????Apache Hadoop 3.1.4???????Windows?Mac???????IDEA?IntelliJ IDEA????????????????????????Hadoop?????????????????


??????

  • Hadoop?????
  • Hadoop????????
  • Hadoop?????
  • HDFS????

  • ??????

  • ????????????
  • ??HDFS????????
  • ????HDFS?????
  • ??HDFS???????

  • ??????

    1. Hadoop?????

    Hadoop????????Apache Nutch???????????????????????????????????????Nutch???????????????2003??????????????????????????GFS?Google File System??MapReduce???????????Hadoop???????????

    2004??2005??Nutch?????????GFS?MapReduce?????????????????Hadoop???2008?1??Hadoop????Apache??????????????????

    Hadoop?????????????????????????????????????????

    2. Hadoop????

    Hadoop?????????????

    • 0.x???Hadoop?????????????????
    • 1.x????????0.x??????bug???????????
    • 2.x??????YARN??????????????????????????????????
    • 3.x????2.x????????HDFS????????????????????

    3. Hadoop????????

    Hadoop??????????????Apache Hadoop?Cloudera?Hortonworks?MapR?

    • Apache Hadoop?????????????????
    • Cloudera????????????????Impala???????????
    • Hortonworks????????Windows??????????
    • MapR????????Hadoop??????????????

    ??Hadoop??????

    Hadoop??????????

  • HDFS??????????????????????
  • MapReduce???????????????????????
  • YARN?Hadoop 2.x????????????????????????
  • 1. HDFS??

    HDFS????????

    • NameNode????????????????????????????????
    • DataNode??????????????????????
    • Secondary NameNode???NameNode????????????????

    2. YARN??

    YARN????????????

    • ResourceManager?????????????????
    • NodeManager??????????MapReduce???

    ??HDFS????

    1. ?????????

    HDFS???????????????128MB?????????????????????????????3????????????????

    • ???????hdfs-site.xml???dfs.blocksize??????????
    • ???????????dfs.replication?????????????????????????

    2. ?????????

    • ??????????????HDFS??????????????????????
    • ????????????????????????????
    • ??????????????????????????

    3. HDFS??

    HDFS????NameNode?DataNode?Secondary NameNode?

    • NameNode??????????????????
    • DataNode????????????????????
    • Secondary NameNode???NameNode??????????????????????

    4. ?????

    • ?????????????????????
    • ?????????Unix???????????????

    ??HDFS????

    HDFS?????????hadoop fs?hdfs dfs?????????????????????????????????

    • ?????
      hdfs dfs -mkdir /path
    • ?????
      hdfs dfs -put local_file hdfs_path
    • ?????
      hdfs dfs -get hdfs_path local_file
    • ?????
      hdfs dfs -rm hdfs_path

    ??HDFS????

    HDFS?????????????????????????????????????????????????

    • ????????????????99.9%?????????????
    • ????????????????????hdfs dfsadmin -safemode enter | leave | get | wait?

    ??Java API??

    1. ????

    • Windows?????Hadoop??????????????????
    • Mac??????????????Hadoop???

    2. ?????????

    ??Maven???????Hadoop?????

    org.apache.hadoop
    hadoop-client
    ${hadoop.version}

    3. Java API????

    @Testpublic void createDirectory() throws IOException {    Configuration configuration = new Configuration();    configuration.set("fs.defaultFS", "hdfs://node01:8020");    FileSystem fileSystem = FileSystem.get(configuration);    Path dirPath = new Path("/kaikeba/dir1");    FsPermission permission = new FsPermission(FsAction.ALL, FsAction.READ, FsAction.WRITE);    fileSystem.mkdirs(dirPath, permission);    fileSystem.close();}

    ?????????

  • ?????HDFS?????????????????????
  • ???????HDFS??????????????????
  • ?????HDFS???????Hive?Spark??????

  • ???????????????HDFS????????????????

    转载地址:http://fg.baihongyu.com/

    你可能感兴趣的文章
    Objective-C实现AlphaNumericalSort字母数字排序算法(附完整源码)
    查看>>
    Objective-C实现alternate disjoint set不相交集算法(附完整源码)
    查看>>
    Objective-C实现alternative list arrange备选列表排列算法(附完整源码)
    查看>>
    Objective-C实现An Armstrong number阿姆斯特朗数算法(附完整源码)
    查看>>
    Objective-C实现anagrams字谜算法(附完整源码)
    查看>>
    Objective-C实现ApproximationMonteCarlo蒙特卡洛方法计算pi值算法 (附完整源码)
    查看>>
    Objective-C实现area under curve曲线下面积算法(附完整源码)
    查看>>
    Objective-C实现argmax函数功能(附完整源码)
    查看>>
    Objective-C实现arithmetic算术算法(附完整源码)
    查看>>
    Objective-C实现armstrong numbers阿姆斯壮数算法(附完整源码)
    查看>>
    Objective-C实现articulation-points(关键点)(割点)算法(附完整源码)
    查看>>
    Objective-C实现atoi函数功能(附完整源码)
    查看>>
    Objective-C实现average absolute deviation平均绝对偏差算法(附完整源码)
    查看>>
    Objective-C实现average mean平均数算法(附完整源码)
    查看>>
    Objective-C实现average median平均中位数算法(附完整源码)
    查看>>
    Objective-C实现average mode平均模式算法(附完整源码)
    查看>>
    Objective-C实现avl 树算法(附完整源码)
    查看>>
    Objective-C实现AvlTree树算法(附完整源码)
    查看>>
    Objective-C实现backtracking Jump Game回溯跳跃游戏算法(附完整源码)
    查看>>
    Objective-C实现BACKTRACKING 方法查找集合的幂集算法(附完整源码)
    查看>>