DEV Community

Rajnish
Rajnish

Posted on • Originally published at rajnishspandey.hashnode.dev

Databricks introduction

Databricks

it is a unified, open analytics platform for building, deploying, sharing and maintaining data, analytics, and AI solutions at scale.

Databricks Architecture and Services

Clusters

  • it’s a collection of VM (Virtual Machines) instances.

  • over which computational workloads are distributed across workers

Comparison

There are two types

All-Purpose Clusters Job Clusters
Analyse data collectively using interactive Notebooks Run automated jobs
Create cluster from the workspace or API The Databricks job scheduler creates job clusters when running jobs
Configuration information is retained for upto 70 clusters for upto 30 days Configuration information is retained for upto 30 most recently terminated cluster

Hostinger image

Get n8n VPS hosting 3x cheaper than a cloud solution

Get fast, easy, secure n8n VPS hosting from $4.99/mo at Hostinger. Automate any workflow using a pre-installed n8n application and no-code customization.

Start now

Top comments (0)

👋 Kindness is contagious

Engage with a wealth of insights in this thoughtful article, valued within the supportive DEV Community. Coders of every background are welcome to join in and add to our collective wisdom.

A sincere "thank you" often brightens someone’s day. Share your gratitude in the comments below!

On DEV, the act of sharing knowledge eases our journey and fortifies our community ties. Found value in this? A quick thank you to the author can make a significant impact.

Okay