DEV Community

# bigdata

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Database normalization may be harmful to efficiency on large scale analytics projects.

Database normalization may be harmful to efficiency on large scale analytics projects.

12
Comments 2
2 min read
AWS Certified Big Data: Specialty study blueprint

AWS Certified Big Data: Specialty study blueprint

16
Comments
18 min read
My Databricks article compilation of 2019

My Databricks article compilation of 2019

6
Comments
2 min read
Converting CSV to ORC/Parquet fast without a cluster!

Converting CSV to ORC/Parquet fast without a cluster!

7
Comments
6 min read
Cloud Data Fusion, a game-changer for GCP

Cloud Data Fusion, a game-changer for GCP

12
Comments 7
4 min read
Multi-Class Image Classification With Transfer Learning In PySpark

Multi-Class Image Classification With Transfer Learning In PySpark

11
Comments
9 min read
Working with BigQuery Analytic Functions

Working with BigQuery Analytic Functions

6
Comments
5 min read
Building a Successful Modern Data Analytics Platform in the Cloud

Building a Successful Modern Data Analytics Platform in the Cloud

8
Comments
11 min read
AWS: Redshift – quick start and SQL-workbench connection configuration

AWS: Redshift – quick start and SQL-workbench connection configuration

13
Comments
4 min read
Data Lake vs Data Warehouse

Data Lake vs Data Warehouse

10
Comments
2 min read
Life Beyond Kafka with Apache Pulsar

Life Beyond Kafka with Apache Pulsar

19
Comments
4 min read
Explain MapReduce Like I'm Five

Explain MapReduce Like I'm Five

8
Comments
5 min read
Toward GCP Data Engineer certification

Toward GCP Data Engineer certification

9
Comments
1 min read
Azure Blob Storage with Pyspark

Azure Blob Storage with Pyspark

12
Comments 1
2 min read
Building simple data pipelines in Azure using Cosmos DB, Databricks and Blob Storage

Building simple data pipelines in Azure using Cosmos DB, Databricks and Blob Storage

5
Comments
15 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.