Course Outline
Advanced Transformation Building Blocks
- Handling complex data types.
- Managing fields, metadata, and dynamic structures.
- Leveraging reusable transformation patterns.
Parameters, Variables, and Job-Oriented Design
- Understanding runtime variables and scoping.
- Implementing parameterized transformations.
- Designing parent-child job structures.
Database Integration and Lookup Strategies
- Utilizing advanced lookup steps.
- Employing effective caching strategies.
- Designing efficient joins.
Working with Files, APIs, and External Systems
- Processing JSON and XML data.
- Interacting with REST and SOAP services.
- Executing streaming and batch loads.
Error Handling and Data Quality Techniques
- Capturing and routing errors effectively.
- Applying data validation patterns.
- Conducting auditing and logging.
Performance Tuning Essentials
- Optimizing step design.
- Addressing memory and threading considerations.
- Identifying and resolving bottlenecks.
Introduction to Repository-Based Development
- Utilizing the Pentaho repository.
- Managing version control.
- Adopting team collaboration practices.
Deployment and Migration Practices
- Promoting jobs across different environments.
- Implementing configuration management.
- Following operational best practices.
Summary and Next Steps
Requirements
- A solid grasp of ETL fundamentals.
- Prior experience using Pentaho Data Integration.
- Fundamental knowledge of data warehousing concepts.
Target Audience
- ETL developers.
- Data engineers.
- Technical professionals seeking to expand their PDI capabilities.
Testimonials (3)
That it was very practical.
Alfonso Ramos - Banco de Mexico
Course - Fundamentos de Integración de Datos Pentaho
Machine Translated
Very useful in because it helps me understand what we can do with the data in our context. It will also help me
Nicolas NEMORIN - Adecco Groupe France
Course - KNIME Analytics Platform for BI
It's a hands-on session.