Autoplay
Autocomplete
Previous Lesson
Complete and Continue
Specialization in BigData
Basic Prerequisite Videos
AWS Account Creation (7:08)
AWS Red Hat - Linux Instance Launch (17:31)
AWS - Putty Linux Connect SetUp (18:39)
How To Download RedHat Linux @ No Cost (3:48)
Install RedHat OS RHEL 8 Using Virtualization Over Oracle Virtualbox (38:02)
How to Configure YUM / DNF in RedHat 8 (42:11)
Linux Basic Commands (44:14)
SPLUNK
Session 1 - 6th May-Source Types | Index | Data Inputs |Data Sources | Splunk CLI (415:55)
Session 1 - Summary
Session 2 - 7th May-Field Extraction | Parsing using Regular Expression | Parsing using Delimiters | Line Breaking (383:13)
Session 2 - Summary
Session 3 - 13th May-Lookups | Lookup Table | Splunk Universal Forwarder | Indexer | Types of Splunk (296:17)
Session 3 - Summary
Session 4 - 20th May-IAM | user authentication | RBAC | create custom role | inheritance | default index (164:49)
Session 4 - Summary
Session 5 - 26th May-Extra Session_Integrate with AWS Kinesis (121:28)
Session 5 - Summary
Kafka Session
Kafka Session - 28th May-Apache Kafka-low latency | run time |fault tolerance |high throughput| webserver|machine learning module|data transport |batch |network |storage |sources|tightly coupled |partation|kafka broker|kaf cluster |scala program |JVM |create kafka cluster (323:54)
Session 1- Summary
Kafka Session - 3rd June-Apache Kafka Training Session-kafka connect |kafka broker |kafka cluster|bigdata |java|mongoDB |confluent hub | plugin |source connector |os image |container image| docker image |launch kafka broker in container |storage file| (286:37)
Hadoop
Session 1 - 17th June-Big data| storage limitations |IO operations |Hadoop |framework |volume |velocity | Hadoop cluster configuration |ec2 instance |master node |data nodes |creating directories |transferring files |managing data| (318:01)
Session 1 - Summary
Session 2 - 18th June-fault tolerance |SPOF|multiple blocks| High Availability |Replicating data |replication factor|master node|Data Nodes|block's name|data durability|Monitoring network activities| tcpdump tool |analyze packets|network communication (235:49)
Session 2 - Summary
Session 3 - 24th June-HDFS cluster| handling and analyzing large volumes |CEPH storage |S3 service (AWS)|managing data.|compute unit|horizontal scaling|single point of failure|MapReduce clusters|MapReduce algorithm |JobTracker|TaskTrackers| (298:38)
Session 3 - Summary
Session 4 - 2nd July-MapReduce clusters |compute cluster |HDFS cluster | create cluster with HDFS and MapReduce |data lake| maping concept (139:15)
Amazon Redshift
Summary - Session 1
Summary Session 2
Summary - Session 3
Summary Session 4
Teach online with
Summary - Session 3
Lesson content locked
If you're already enrolled,
you'll need to login
.
Enroll in Course to Unlock