Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
- Fundamentals of Big Data
- The role of Big Data in the corporate landscape
- Phases involved in developing a corporate Big Data strategy
- The rationale behind a holistic approach to Big Data
- Essential components of a Big Data platform
- Big data storage solutions
- Limits of traditional technologies
- Overview of database types
- The four dimensions of Big Data
- Business Impact of Big Data
- The business importance of Big Data
- Challenges associated with extracting valuable data
- Integrating Big Data with traditional data systems
- Big Data Storage Technologies
- Overview of big data technologies
- Data storage models
- Hadoop
- Hive
- Cassandra
- MongoDB
- Selecting the appropriate big data technology
- Overview of big data technologies
- Processing Big Data
- Connecting to and extracting data from databases
- Transforming and preparing data for processing
- Utilizing Hadoop MapReduce for distributed data processing
- Monitoring and executing Hadoop MapReduce jobs
- Hadoop Distributed File System building blocks
- MapReduce and YARN
- Handling streaming data with Spark
- Big Data Analysis Tools and Technologies
- Programming Hadoop with Pig Latin
- Querying big data using Hive
- Data mining with Mahout
- Visualization and reporting tools
- Big Data in Business Context
- Managing and establishing Big Data requirements
- The business importance of Big Data
- Selecting the right big data tools for specific problems
Data Warehousing Concepts
- Definition of a Data Warehouse
- Differences between OLTP and Data Warehousing
- Data Acquisition
- Data Extraction
- Data Transformation
- Data Loading
- Data Marts
- Dependent vs. Independent Data Marts
- Database Design
ETL Testing Concepts:
- Introduction
- Software Development Life Cycle
- Testing methodologies
- ETL Testing Workflow Process
- ETL Testing Responsibilities in Datastage
Big Data Fundamentals
- The role of Big Data in the corporate landscape
- Phases involved in developing a corporate Big Data strategy
- The rationale behind a holistic approach to Big Data
- Essential components of a Big Data platform
- Big data storage solutions
- Limits of traditional technologies
- Overview of database types
NoSQL Databases
Hadoop
MapReduce
Apache Spark
Requirements
Participants should possess a general understanding of storage tools, have some practical experience, and be aware of how to handle large datasets.
14 Hours
Testimonials (1)
trainer's knowledge