DEV Community

Cover image for EMR vs Serverless EMR
Chetan Hirapara
Chetan Hirapara

Posted on

1

EMR vs Serverless EMR

Recently aws has announced release of AWS Serverless in re:invent 2022 event. If you'are a Data Engineer or Bigdata developer with AWS data services then One obvious question will raise in everyone mind is why Serverless EMR when AWS Glue is already there in service list which is almost doing the same job.

Introduction of EMR

Before we move to understand EMR serverless, it is more helpful to get brief about EMR first.

EMR is fully managed Hadoop cluster in AWS to store, process and analyze big data systems​. It is a combination of Map reduce process that typically data enginners were doing in past on local machines or cluster.

In EMR to store intermidiate results we have HDFS/EMRFS/Local File system(Instance store/EBS)​. This is same as HDFS - hadoop distributed file system provided by spark or hadoop.

EMR support nearly 50+ softwares to use on your EMR cluster that you spin up to perform your daily jobs/tasks. i.e. Spark, Hive, HBase, Hue, Pig, JupyterLab, etc.

EMR vs EMR serverless

Hostinger image

Get n8n VPS hosting 3x cheaper than a cloud solution

Get fast, easy, secure n8n VPS hosting from $4.99/mo at Hostinger. Automate any workflow using a pre-installed n8n application and no-code customization.

Start now

Top comments (0)

AWS Security LIVE!

Join us for AWS Security LIVE!

Discover the future of cloud security. Tune in live for trends, tips, and solutions from AWS and AWS Partners.

Learn More