EMR - Elastic MapReduce


EMR (Elastic MapReduce)

  • What it is: Managed Hadoop and Spark. This is exactly what we discussed with Dataproc earlier. It’s for running open-source big data frameworks.

  • Data Engineer Note: EMR is the industry standard for migrating legacy on-prem Hadoop clusters to the cloud.

  • Equivalents:

    • GCP: Dataproc

    • Azure: Azure HDInsight or Azure Databricks


Last updated