お問い合わせを送信いただきありがとうございます!当社のスタッフがすぐにご連絡いたします。
予約を送信いただきありがとうございます!当社のスタッフがすぐにご連絡いたします。
コース概要
Introduction
- Introduction to Cloud Computing and Big Data solutions
- Overview of Apache Hadoop Features and Architecture
Setting up Hadoop
- Planning a Hadoop cluster (on-premise, cloud, etc.)
- Selecting the OS and Hadoop distribution
- Provisioning resources (hardware, network, etc.)
- Downloading and installing the software
- Sizing the cluster for flexibility
Working with HDFS
- Understanding the Hadoop Distributed File System (HDFS)
- Overview of HDFS Command Reference
- Accessing HDFS
- Performing Basic File Operations on HDFS
- Using S3 as a complement to HDFS
Overview of the MapReduce
- Understanding Data Flow in the MapReduce Framework
- Map, Shuffle, Sort and Reduce
- Demo: Computing Top Salaries
Working with YARN
- Understanding resource management in Hadoop
- Working with ResourceManager, NodeManager, Application Master
- Scheduling jobs under YARN
- Scheduling for large numbers of nodes and clusters
- Demo: Job scheduling
Integrating Hadoop with Spark
- Setting up storage for Spark (HDFS, Amazon, S3, NoSQL, etc.)
- Understanding Resilient Distributed Datasets (RDDs)
- Creating an RDD
- Implementing RDD Transformations
- Demo: Implementing a Text Search Program for Movie Titles
Managing a Hadoop Cluster
- Monitoring Hadoop
- Securing a Hadoop cluster
- Adding and removing nodes
- Running a performance benchmark
- Tuning a Hadoop cluster to optimizing performance
- Backup, recovery and business continuity planning
- Ensuring high availability (HA)
Upgrading and Migrating a Hadoop Cluster
- Assessing workload requirements
- Upgrading Hadoop
- Moving from on-premise to cloud and vice-versa
- Recovering from failures
Troubleshooting
Summary and Conclusion
要求
- System administration experience
- Experience with Linux command line
- An understanding of big data concepts
Audience
- System administrators
- DBAs
35 時間
お客様の声 (5)
The live examples
Ahmet Bolat - Accenture Industrial SS
コース - Python, Spark, and Hadoop for Big Data
very interactive...
Richard Langford
コース - SMACK Stack for Data Science
実践的な内容で、講師の知識が豊富です
Chris Tan
コース - A Practical Introduction to Stream Processing
機械翻訳
Get to learn spark streaming , databricks and aws redshift
Lim Meng Tee - Jobstreet.com Shared Services Sdn. Bhd.
コース - Apache Spark in the Cloud
実践的な面だけでなく、アジェイによる理論的側面もよく説明されました。
Dominik Mazur - Capgemini Polska Sp. z o.o.
コース - Hadoop Administration on MapR
機械翻訳