This site is powered by
course builder. Create your online course today.
Start now
Create your course
with
Autoplay
Autocomplete
Previous Lesson
Complete and Continue
Specialization in BigData
Basic Prerequisite Videos
AWS Account Creation (7:08)
AWS Red Hat - Linux Instance Launch (17:31)
AWS - Putty Linux Connect SetUp (18:39)
How To Download RedHat Linux @ No Cost (3:48)
Install RedHat OS RHEL 8 Using Virtualization Over Oracle Virtualbox (38:02)
How to Configure YUM / DNF in RedHat 8 (42:11)
Linux Basic Commands (44:14)
SPLUNK
Session 1 - 6th May-Source Types | Index | Data Inputs |Data Sources | Splunk CLI (415:55)
Session 1 - Summary
Session 2 - 7th May-Field Extraction | Parsing using Regular Expression | Parsing using Delimiters | Line Breaking (383:13)
Session 2 - Summary
Session 3 - 13th May-Lookups | Lookup Table | Splunk Universal Forwarder | Indexer | Types of Splunk (296:17)
Session 3 - Summary
Session 4 - 20th May-IAM | user authentication | RBAC | create custom role | inheritance | default index (164:49)
Session 4 - Summary
Session 5 - 26th May-Extra Session_Integrate with AWS Kinesis (121:28)
Session 5 - Summary
Kafka Session
Kafka Session - 28th May-Apache Kafka-low latency | run time |fault tolerance |high throughput| webserver|machine learning module|data transport |batch |network |storage |sources|tightly coupled |partation|kafka broker|kaf cluster |scala program |JVM |create kafka cluster (323:54)
Session 1- Summary
Kafka Session - 3rd June-Apache Kafka Training Session-kafka connect |kafka broker |kafka cluster|bigdata |java|mongoDB |confluent hub | plugin |source connector |os image |container image| docker image |launch kafka broker in container |storage file| (286:37)
Hadoop
Session 1 - 17th June-Big data| storage limitations |IO operations |Hadoop |framework |volume |velocity | Hadoop cluster configuration |ec2 instance |master node |data nodes |creating directories |transferring files |managing data| (318:01)
Session 1 - Summary
Session 2 - 18th June-fault tolerance |SPOF|multiple blocks| High Availability |Replicating data |replication factor|master node|Data Nodes|block's name|data durability|Monitoring network activities| tcpdump tool |analyze packets|network communication (235:49)
Session 2 - Summary
Session 3 - 24th June-HDFS cluster| handling and analyzing large volumes |CEPH storage |S3 service (AWS)|managing data.|compute unit|horizontal scaling|single point of failure|MapReduce clusters|MapReduce algorithm |JobTracker|TaskTrackers| (298:38)
Session 3 - Summary
Session 4 - 2nd July-MapReduce clusters |compute cluster |HDFS cluster | create cluster with HDFS and MapReduce |data lake| maping concept (139:15)
Amazon Redshift
Summary - Session 1
Summary Session 2
Summary - Session 3
Summary Session 4
Session 3 - 24th June-HDFS cluster| handling and analyzing large volumes |CEPH storage |S3 service (AWS)|managing data.|compute unit|horizontal scaling|single point of failure|MapReduce clusters|MapReduce algorithm |JobTracker|TaskTrackers|
Lesson content locked
If you're already enrolled,
you'll need to login
.
Enroll in Course to Unlock