目录
- Hadoop概述
- Hadoop核心组件之HDFS.
- Hadoop核心组件之MapReduce
- Hadoop核心组件之YARN
- Hadoop优势
- Hadoop发展史
- Hadoop生态圈
- Hadoop发行版选型
- OOTB环境的使用
Hadoop概述
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing.
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.
- 开源的
- 分布式存储和计算
- 分布式
Modules
- Hadoop Common: The common utilities that support the other Hadoop modules.
- Hadoop distributed File System (HDFS™): A distributed file system that provides high-throughput access to application data.
- Hadoop YARN: A framework for job scheduling and cluster resource management.
- Hadoop MapReduce: A YARN-based system for parallel processing of large data sets.
- Hadoop Ozone: An object store for Hadoop.
翻译翻译
Hadoop核心组件之HDFS.
起源
- 源于Google的GFS的论文
- 是GFS的克隆版
特点
- 扩展,
- 容错,
- 海量。
Hadoop核心组件之MapReduce
起源
- 源于Google的MapRedece的论文
- 是Google MapReduce的克隆版
特点
- 扩展
- 容错
- 海量离线处理
Hadoop核心组件之YARN
- Yet Another Resource Negotiator
- 负责整个集群资源的管理和调度
特点: - 扩展
- 容错
- 多框架资源统一调度
Hadoop优势
- 数据存储:数据块多副本
- 数据计算:重新调度作业计算
- 机器扩展:可以线性扩展机器,集群可以包含上千节点
- 成本降低:去IoE
- 生态圈成熟
Hadoop发展史
Hadoop生态圈
特点
- 开源,活跃
- 成熟
- 囊括大数据大部分
Hadoop发行版选型
-
Apache社区版本
-
第三方发行版本(如CDH,HDP,
MapR等)- 优点:基于Apache协议,100%开源。版本管理清晰。比Apache Hadoop在兼容性、安全性、稳定性上有增强。第三方发行版通常都经过了大量的测试验证,有众多部署实例,大量的运行到各种生产环境。
- 缺点:部分不开源
OOTB环境的使用
//切换到root
$ sudo -i
# cd /etc/sysconfig/network-scripts/
# ls
//删除
# rm -f ifcfg-lo
PING baidu.com (220.181.38.148) 56(84) bytes of data.
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=1 ttl=46 time=42.7 ms
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=2 ttl=46 time=42.0 ms
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=3 ttl=46 time=45.0 ms
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=4 ttl=46 time=44.4 ms
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 [email protected] 举报,一经查实,本站将立刻删除。