Syllabus of Cloudera CCA Admin Certification Online Training Course
Module 1: Introduction to Big Data, Hadoop and Cloudera
- 1. Big data Definition, Examples, Characteristics & Hurdles
- 2. Apache Hadoop Definition, Ecosystem Components, Versions
- 3. DistributionsSpark Core Programming and SparkSQL Programming on Scala
- 4. Cloudera Definition, Cloudera Manager, Components & Architecture
Module 2: Using Oracle Virtual Box
- 1. Setup Linux VM
- 2. Connecting via SSH
- 3. Transferring Files
Module 3: Using Cloudera Quick-Start VM
- 1. Introduction to Cloudera Quick-Start VM
- 2. Exploring Cloudera Manager 20mins 30s
Module 4: Cloudera Hadoop Installation
- 1. Installing Cloudera Manager
- 2. Preparing Node for Hadoop
- 3. Hadoop Installation using Cloudera Manager
Module 5: Working with HDFS
- 1. Introduction to HDFS
- 2. HDFS Commands
- 3. HDFS Web UI 23mins 08s
Module 6: Working with YARN
- 1. Introduction to YARN
- 2. Exploring YARN
- 3. Spark Integration with YARN
Module 7: Installing Sqoop
- 1. Introduction to Sqoop
- 2. Installing Sqoop Services using Cloudera Manager
Module 8: Ingesting Data Using Sqoop
- 1. Importing Data Using Sqoop
- 2. Exporting Data Using Sqoop
- 3. Sqoop Best Practices
Module 9: Working with Hive
- 1. Introduction to Hive
- 2. Working with Hive
Module 10: Using with Pig and Impala
- 1. Working with Pig
- 2. Working with Impala
Module 11: Flume & Hadoop REST Services
- 1. Working with Apache Flume
- 2. Exploring with Hadoop REST services
Module 12: Managing YARN Resources – Part 1
- 1. YARN Memory & CPU Settings
- 2. Working with Fair Scheduler
Module 13: Managing YARN Resources – Part 2
- 1. Dynamic Resource Pools
- 2. Configuring Control Groups
Module 14: Cloudera Manager Features
- 1. Exploring Cloudera Manager Features
- 2. Configuring Services, Logs, Ports etc.
Module 15: Cluster Maintenance Operations
- 1. Exploring Cloudera Cluster Maintenance
- 2. HDFS Operations using Cloudera Manager
Module 16: Planning a Hadoop Cluster
- 1. Planning Considerations
- 2. Working with Hosts
- 3. Creating & Managing Users
Module 17: Integrating Kerberos with Hadoop
- 1. Introduction to Kerberos
- 2. Enable Hadoop Kerberos authentication
Module 18: Advanced Cluster Management Part 1
- 1. Cloudera Manager Health Tests
- 2. Cluster Backup & Recovery
Module 19: Advanced Cluster Management – Part 2
- 1. High Availability for components
- 2. Working with Hadoop HDFS with HttpFS
Module 20: Advanced Cluster Management – Part 3
- 1. Password-less SSH
- 2. Local Repositories
- 3. Data Encryption