This course introduces you to the concept of DataOps—its origins, components, real-life applications, and ways to implement it.
WHAT YOU WILL LEARN
- Understand the process to establish a repeatable process that delivers rigor and repeatability
- Articulate the business value of any data sprint by capturing the KPI's the sprint will deliver
- Understand how to enable the organization's business, development and operations to continuously design, deliver and validate new data demands
Agenda of the Dataops Training Course
Agenda
====================================================
DataOps Concept and Foundation
The Problem with Datascience
- The knowledge Gap
- Lack of Support
- Challenges of of Data Analytics
Agile Collaboration for DataOps
- DataOps Manifesto
- DataOps Principles
- Data Science Life-Cycle
DevOps for DataOps
- Development and Operations
- Fast Flow from Continous Delivery
- Reproducible Environments
- Deployment Pipelines
- Continous Integration
- Automated Testing
Deployment and Release Processes
- Self-Service Deployments
- Release Processes
- DevOps Measurements
- Review Processes
- DevOps for Data Analytics
- The Data Conflict
- Data Pipeline Environments
- Data Pipeline Orchestration
- Data Pipeline Continous Integration
DataOps Technology
- Tools based on DataOps Values and Principles
- DataOps Technology Ecosystem
- The Assembly Line
- Data Integration
- Data Preparation
- Stream Processing
- Data Management
- Reproducibility, Deployment, Orchestration, and Monitoring
- Compute Infrastructure and Query Execution Engines
- Data Storage
- DataOps Platforms
- Data Analytics Tools
DataOps Tools Training
- Models & Architecture - DataOps Concept and Foundation
- Platform - Operating Systems - Centos/Ubuntu & VirtualBox & Vagrant
- Platform - Cloud - AWS
- Platform - Containers - Docker
- Planning and Designing - Jira & Confulence
- Programming Language - Python
- Source Code Versioning - Git using Github
- Container Orchestration - Kubernetes & Helm Introduction
- Database - Mysql
- Database - postgresql
- Data Analystics Engine - Apache Spark
- Reporting - Grafana
- ETL Tools - Apache Kafka
- Bigdata - Apache Hadoop
- DataOps Integration - Jenkins
- Big Data Tools for Visualization - Microsoft PowerBI
- Big Data Tools for Visualization - Tableau