課程目錄:Administrator Training for Apache Hadoop培訓
4401 人關注
(78637/99817)
課程大綱:

        Administrator Training for Apache Hadoop培訓

 

 

 

1: HDFS (17%)
Describe the function of HDFS Daemons
Describe the normal operation of an Apache Hadoop cluster, both in data storage and in data processing.
Identify current features of computing systems that motivate a system like Apache Hadoop.
Classify major goals of HDFS Design
Given a scenario, identify appropriate use case for HDFS Federation
Identify components and daemon of an HDFS HA-Quorum cluster
Analyze the role of HDFS security (Kerberos)
Determine the best data serialization choice for a given scenario
Describe file read and write paths
Identify the commands to manipulate files in the Hadoop File System Shell
2: YARN and MapReduce version 2 (MRv2) (17%)
Understand how upgrading a cluster from Hadoop 1 to Hadoop 2 affects cluster settings
Understand how to deploy MapReduce v2 (MRv2 / YARN), including all YARN daemons
Understand basic design strategy for MapReduce v2 (MRv2)
Determine how YARN handles resource allocations
Identify the workflow of MapReduce job running on YARN
Determine which files you must change and how in order to migrate a cluster from MapReduce version 1 (MRv1) to MapReduce version 2 (MRv2) running on YARN.
3: Hadoop Cluster Planning (16%)
Principal points to consider in choosing the hardware and operating systems to host an Apache Hadoop cluster.
Analyze the choices in selecting an OS
Understand kernel tuning and disk swapping
Given a scenario and workload pattern, identify a hardware configuration appropriate to the scenario
Given a scenario, determine the ecosystem components your cluster needs to run in order to fulfill the SLA
Cluster sizing: given a scenario and frequency of execution, identify the specifics for the workload, including CPU, memory, storage, disk I/O
Disk Sizing and Configuration, including JBOD versus RAID, SANs, virtualization, and disk sizing requirements in a cluster
Network Topologies: understand network usage in Hadoop (for both HDFS and MapReduce) and propose or identify key network design components for a given scenario
4: Hadoop Cluster Installation and Administration (25%)
Given a scenario, identify how the cluster will handle disk and machine failures
Analyze a logging configuration and logging configuration file format
Understand the basics of Hadoop metrics and cluster health monitoring
Identify the function and purpose of available tools for cluster monitoring
Be able to install all the ecosystem components in CDH 5, including (but not limited to): Impala, Flume, Oozie, Hue, Manager, Sqoop, Hive, and Pig
Identify the function and purpose of available tools for managing the Apache Hadoop file system
5: Resource Management (10%)
Understand the overall design goals of each of Hadoop schedulers
Given a scenario, determine how the FIFO Scheduler allocates cluster resources
Given a scenario, determine how the Fair Scheduler allocates cluster resources under YARN
Given a scenario, determine how the Capacity Scheduler allocates cluster resources
6: Monitoring and Logging (15%)
Understand the functions and features of Hadoop’s metric collection abilities
Analyze the NameNode and JobTracker Web UIs
Understand how to monitor cluster Daemons
Identify and monitor CPU usage on master nodes
Describe how to monitor swap and memory allocation on all nodes
Identify how to view and manage Hadoop’s log files
Interpret a log file

主站蜘蛛池模板: 动漫裸男露ji无遮挡网站| 天天做人人爱夜夜爽2020毛片| 日韩中文有码高清| 国产精品一区欧美激情| 亚洲日韩乱码中文字幕| 中文www新版资源在线| 老师…好紧开裆蕾丝内裤| 欧美成人777| 宝贝过来趴好张开腿让我看看 | 欧美a欧美1级| 国产精品久久久久国产精品| 亚洲国产欧美日韩精品一区二区三区| 99ee6热久久免费精品6| 毛片女人毛片一级毛片毛片| 国内一级特黄女人精品片| 亚洲欧美日韩国产一区二区精品| 888米奇在线视频四色| 美女和男生一起差差差| 成人无码WWW免费视频| 午夜看一级特黄a大片| 久久免费国产视频| 1819sextub欧美中国| 男女边摸边揉边做视频| 日本伊人色综合网| 国产91精品一区二区麻豆亚洲| 中国性猛交xxxxx免费看| 真实国产乱子伦对白视频37p| 天堂成人一区二区三区| 亚洲最大成人网色香蕉| 色噜噜视频影院| 日本免费的一级v一片| 国产99在线a视频| www亚洲欲色成人久久精品| 色综合67194| 日韩精品欧美国产精品忘忧草| 国产午夜一区二区在线观看| 丽玲老师高跟鞋调教小说| 野花直播免费观看日本更新最新| 成年女人永久免费观看片| 伊人色综合九久久天天蜜桃| 三上悠亚在线观看免费|