博客
关于我
Hadoop之HDFS:Hadoop分布式文件系统--具体知识点
阅读量:79 次
发布时间:2019-02-25

本文共 3353 字,大约阅读时间需要 11 分钟。

HDFS?Hadoop???????

HDFS?Hadoop Distributed File System??Hadoop????????????????????????????????????????????????????????????????????????????Java API?????????HDFS???????


??????

??VMware 15???????????CentOS 7?????????Apache Hadoop 3.1.4???????Windows?Mac???????IDEA?IntelliJ IDEA????????????????????????Hadoop?????????????????


??????

  • Hadoop?????
  • Hadoop????????
  • Hadoop?????
  • HDFS????

  • ??????

  • ????????????
  • ??HDFS????????
  • ????HDFS?????
  • ??HDFS???????

  • ??????

    1. Hadoop?????

    Hadoop????????Apache Nutch???????????????????????????????????????Nutch???????????????2003??????????????????????????GFS?Google File System??MapReduce???????????Hadoop???????????

    2004??2005??Nutch?????????GFS?MapReduce?????????????????Hadoop???2008?1??Hadoop????Apache??????????????????

    Hadoop?????????????????????????????????????????

    2. Hadoop????

    Hadoop?????????????

    • 0.x???Hadoop?????????????????
    • 1.x????????0.x??????bug???????????
    • 2.x??????YARN??????????????????????????????????
    • 3.x????2.x????????HDFS????????????????????

    3. Hadoop????????

    Hadoop??????????????Apache Hadoop?Cloudera?Hortonworks?MapR?

    • Apache Hadoop?????????????????
    • Cloudera????????????????Impala???????????
    • Hortonworks????????Windows??????????
    • MapR????????Hadoop??????????????

    ??Hadoop??????

    Hadoop??????????

  • HDFS??????????????????????
  • MapReduce???????????????????????
  • YARN?Hadoop 2.x????????????????????????
  • 1. HDFS??

    HDFS????????

    • NameNode????????????????????????????????
    • DataNode??????????????????????
    • Secondary NameNode???NameNode????????????????

    2. YARN??

    YARN????????????

    • ResourceManager?????????????????
    • NodeManager??????????MapReduce???

    ??HDFS????

    1. ?????????

    HDFS???????????????128MB?????????????????????????????3????????????????

    • ???????hdfs-site.xml???dfs.blocksize??????????
    • ???????????dfs.replication?????????????????????????

    2. ?????????

    • ??????????????HDFS??????????????????????
    • ????????????????????????????
    • ??????????????????????????

    3. HDFS??

    HDFS????NameNode?DataNode?Secondary NameNode?

    • NameNode??????????????????
    • DataNode????????????????????
    • Secondary NameNode???NameNode??????????????????????

    4. ?????

    • ?????????????????????
    • ?????????Unix???????????????

    ??HDFS????

    HDFS?????????hadoop fs?hdfs dfs?????????????????????????????????

    • ?????
      hdfs dfs -mkdir /path
    • ?????
      hdfs dfs -put local_file hdfs_path
    • ?????
      hdfs dfs -get hdfs_path local_file
    • ?????
      hdfs dfs -rm hdfs_path

    ??HDFS????

    HDFS?????????????????????????????????????????????????

    • ????????????????99.9%?????????????
    • ????????????????????hdfs dfsadmin -safemode enter | leave | get | wait?

    ??Java API??

    1. ????

    • Windows?????Hadoop??????????????????
    • Mac??????????????Hadoop???

    2. ?????????

    ??Maven???????Hadoop?????

    org.apache.hadoop
    hadoop-client
    ${hadoop.version}

    3. Java API????

    @Testpublic void createDirectory() throws IOException {    Configuration configuration = new Configuration();    configuration.set("fs.defaultFS", "hdfs://node01:8020");    FileSystem fileSystem = FileSystem.get(configuration);    Path dirPath = new Path("/kaikeba/dir1");    FsPermission permission = new FsPermission(FsAction.ALL, FsAction.READ, FsAction.WRITE);    fileSystem.mkdirs(dirPath, permission);    fileSystem.close();}

    ?????????

  • ?????HDFS?????????????????????
  • ???????HDFS??????????????????
  • ?????HDFS???????Hive?Spark??????

  • ???????????????HDFS????????????????

    转载地址:http://fg.baihongyu.com/

    你可能感兴趣的文章
    Node提示:error code Z_BUF_ERROR,error error -5,error zlib:unexpected end of file
    查看>>
    Node提示:npm does not support Node.js v12.16.3
    查看>>
    Node搭建静态资源服务器时后缀名与响应头映射关系的Json文件
    查看>>
    Node服务在断开SSH后停止运行解决方案(创建守护进程)
    查看>>
    node模块化
    查看>>
    node环境下使用import引入外部文件出错
    查看>>
    node编译程序内存溢出
    查看>>
    Node读取并输出txt文件内容
    查看>>
    node防xss攻击插件
    查看>>
    noi 1996 登山
    查看>>
    noi 7827 质数的和与积
    查看>>
    NOIp2005 过河
    查看>>
    NOIP2011T1 数字反转
    查看>>
    NOIP2014 提高组 Day2——寻找道路
    查看>>
    NOIp模拟赛二十九
    查看>>
    Nokia5233手机和我装的几个symbian V5手机软件
    查看>>
    Non-final field ‘code‘ in enum StateEnum‘
    查看>>
    none 和 host 网络的适用场景 - 每天5分钟玩转 Docker 容器技术(31)
    查看>>
    None还可以是函数定义可选参数的一个默认值,设置成默认值时实参在调用该函数时可以不输入与None绑定的元素...
    查看>>
    NOPI读取Excel
    查看>>