コース概要

Introduction to Programming Big Data with R (bpdR)

  • Setting up your environment to use pbdR
  • Scope and tools available in pbdR
  • Packages commonly used with Big Data alongside pbdR

Message Passing Interface (MPI)

  • Using pbdR MPI 5
  • Parallel processing
  • Point-to-point communication
  • Send Matrices
  • Summing Matrices
  • Collective communication
  • Summing Matrices with Reduce
  • Scatter / Gather
  • Other MPI communications

Distributed Matrices

  • Creating a distributed diagonal matrix
  • SVD of a distributed matrix
  • Building a distributed matrix in parallel

Statistics Applications

  • Monte Carlo Integration
  • Reading Datasets
  • Reading on all processes
  • Broadcasting from one process
  • Reading partitioned data
  • Distributed Regression
  • Distributed Bootstrap
 21 時間

参加者の人数



Price per participant

お客様の声 (2)

関連コース

Introduction to Data Visualization with Tidyverse and R

7 時間

Data Vault: Building a Scalable Data Warehouse

28 時間

Spark Streaming with Python and Kafka

7 時間

Confluent KSQL

7 時間

Apache Ignite for Developers

14 時間

Unified Batch and Stream Processing with Apache Beam

14 時間

Apache Apex: Processing Big Data-in-Motion

21 時間

Apache Storm

28 時間

Apache NiFi for Administrators

21 時間

Apache NiFi for Developers

7 時間

Apache Flink Fundamentals

28 時間

Python and Spark for Big Data (PySpark)

21 時間

Introduction to Graph Computing

28 時間

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

21 時間

Apache Spark MLlib

35 時間

関連カテゴリー