Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment for pbdR
  • Overview of pbdR's capabilities and available tools
  • Commonly used packages for Big Data alongside pbdR

Message Passing Interface (MPI)

  • Working with pbdR MPI 5
  • Parallel processing techniques
  • Point-to-point communication
  • Sending matrices
  • Summing matrices
  • Collective communication methods
  • Summing matrices using Reduce
  • Scatter and Gather operations
  • Additional MPI communication patterns

Distributed Matrices

  • Constructing a distributed diagonal matrix
  • Performing Singular Value Decomposition (SVD) on a distributed matrix
  • Building a distributed matrix in parallel

Statistics Applications

  • Monte Carlo Integration
  • Loading datasets
  • Reading data across all processes
  • Broadcasting from a single process
  • Accessing partitioned data
  • Distributed Regression analysis
  • Distributed Bootstrap methods
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories