博客
关于我
Hadoop之HDFS:Hadoop分布式文件系统--具体知识点
阅读量:79 次
发布时间:2019-02-25

本文共 3353 字,大约阅读时间需要 11 分钟。

HDFS?Hadoop???????

HDFS?Hadoop Distributed File System??Hadoop????????????????????????????????????????????????????????????????????????????Java API?????????HDFS???????


??????

??VMware 15???????????CentOS 7?????????Apache Hadoop 3.1.4???????Windows?Mac???????IDEA?IntelliJ IDEA????????????????????????Hadoop?????????????????


??????

  • Hadoop?????
  • Hadoop????????
  • Hadoop?????
  • HDFS????

  • ??????

  • ????????????
  • ??HDFS????????
  • ????HDFS?????
  • ??HDFS???????

  • ??????

    1. Hadoop?????

    Hadoop????????Apache Nutch???????????????????????????????????????Nutch???????????????2003??????????????????????????GFS?Google File System??MapReduce???????????Hadoop???????????

    2004??2005??Nutch?????????GFS?MapReduce?????????????????Hadoop???2008?1??Hadoop????Apache??????????????????

    Hadoop?????????????????????????????????????????

    2. Hadoop????

    Hadoop?????????????

    • 0.x???Hadoop?????????????????
    • 1.x????????0.x??????bug???????????
    • 2.x??????YARN??????????????????????????????????
    • 3.x????2.x????????HDFS????????????????????

    3. Hadoop????????

    Hadoop??????????????Apache Hadoop?Cloudera?Hortonworks?MapR?

    • Apache Hadoop?????????????????
    • Cloudera????????????????Impala???????????
    • Hortonworks????????Windows??????????
    • MapR????????Hadoop??????????????

    ??Hadoop??????

    Hadoop??????????

  • HDFS??????????????????????
  • MapReduce???????????????????????
  • YARN?Hadoop 2.x????????????????????????
  • 1. HDFS??

    HDFS????????

    • NameNode????????????????????????????????
    • DataNode??????????????????????
    • Secondary NameNode???NameNode????????????????

    2. YARN??

    YARN????????????

    • ResourceManager?????????????????
    • NodeManager??????????MapReduce???

    ??HDFS????

    1. ?????????

    HDFS???????????????128MB?????????????????????????????3????????????????

    • ???????hdfs-site.xml???dfs.blocksize??????????
    • ???????????dfs.replication?????????????????????????

    2. ?????????

    • ??????????????HDFS??????????????????????
    • ????????????????????????????
    • ??????????????????????????

    3. HDFS??

    HDFS????NameNode?DataNode?Secondary NameNode?

    • NameNode??????????????????
    • DataNode????????????????????
    • Secondary NameNode???NameNode??????????????????????

    4. ?????

    • ?????????????????????
    • ?????????Unix???????????????

    ??HDFS????

    HDFS?????????hadoop fs?hdfs dfs?????????????????????????????????

    • ?????
      hdfs dfs -mkdir /path
    • ?????
      hdfs dfs -put local_file hdfs_path
    • ?????
      hdfs dfs -get hdfs_path local_file
    • ?????
      hdfs dfs -rm hdfs_path

    ??HDFS????

    HDFS?????????????????????????????????????????????????

    • ????????????????99.9%?????????????
    • ????????????????????hdfs dfsadmin -safemode enter | leave | get | wait?

    ??Java API??

    1. ????

    • Windows?????Hadoop??????????????????
    • Mac??????????????Hadoop???

    2. ?????????

    ??Maven???????Hadoop?????

    org.apache.hadoop
    hadoop-client
    ${hadoop.version}

    3. Java API????

    @Testpublic void createDirectory() throws IOException {    Configuration configuration = new Configuration();    configuration.set("fs.defaultFS", "hdfs://node01:8020");    FileSystem fileSystem = FileSystem.get(configuration);    Path dirPath = new Path("/kaikeba/dir1");    FsPermission permission = new FsPermission(FsAction.ALL, FsAction.READ, FsAction.WRITE);    fileSystem.mkdirs(dirPath, permission);    fileSystem.close();}

    ?????????

  • ?????HDFS?????????????????????
  • ???????HDFS??????????????????
  • ?????HDFS???????Hive?Spark??????

  • ???????????????HDFS????????????????

    转载地址:http://fg.baihongyu.com/

    你可能感兴趣的文章
    OA让企业业务流程管理科学有“据”
    查看>>
    OA项目之我的会议(会议排座&送审)
    查看>>
    OA项目之我的会议(查询)
    查看>>
    Object c将一个double值转换为时间格式
    查看>>
    object detection之Win10配置
    查看>>
    object detection训练自己数据
    查看>>
    object detection错误Message type "object_detection.protos.SsdFeatureExtractor" has no field named "bat
    查看>>
    object detection错误之Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR
    查看>>
    object detection错误之no module named nets
    查看>>
    Object of type 'ndarray' is not JSON serializable
    查看>>
    Object Oriented Programming in JavaScript
    查看>>
    object references an unsaved transient instance - save the transient instance before flushing
    查看>>
    Object.assign用法
    查看>>
    Object.create
    查看>>
    Object.keys()的详解和用法
    查看>>
    objectForKey与valueForKey在NSDictionary中的差异
    查看>>
    Objective - C 小谈:消息机制的原理与使用
    查看>>
    OBJECTIVE C (XCODE) 绘图功能简介(转载)
    查看>>
    Objective-C ---JSON 解析 和 KVC
    查看>>
    Objective-C 编码规范
    查看>>