DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Vitess: Easy database deployment, clustering, and scaling!

Vitess: Easy database deployment, clustering, and scaling!

5
Comments
5 min read
Zero to Deployment and Evolution Data Catalog!

Zero to Deployment and Evolution Data Catalog!

4
Comments
6 min read
Build an analytics app with React and Cube.js

Build an analytics app with React and Cube.js

8
Comments
9 min read
Cardinality Counting in Redis

Cardinality Counting in Redis

2
Comments
4 min read
Cube Cloud Deep Dive: Mastering Pre-Aggregations

Cube Cloud Deep Dive: Mastering Pre-Aggregations

6
Comments
11 min read
BigQuery SQL Tip: QUALIFY clause

BigQuery SQL Tip: QUALIFY clause

5
Comments
1 min read
What Is Trino And Why Is It Great At Processing Big Data

What Is Trino And Why Is It Great At Processing Big Data

15
Comments
7 min read
Using PySpark and AWS Glue to analyze multi-line log files

Using PySpark and AWS Glue to analyze multi-line log files

12
Comments 1
5 min read
ETLs vs ELTs: Why are ELTs Disrupting the Data Market?

ETLs vs ELTs: Why are ELTs Disrupting the Data Market?

15
Comments
8 min read
How IoT integration with ERP system can bring business benefits

How IoT integration with ERP system can bring business benefits

2
Comments
4 min read
Bigdata: A problem and a solution

Bigdata: A problem and a solution

1
Comments
4 min read
Cube Cloud Deep Dive: Starting a New Cube App

Cube Cloud Deep Dive: Starting a New Cube App

16
Comments
9 min read
How Zero-Code Data Preparations Tools Enable Better, Faster IT Performance in the Age of Big Data

How Zero-Code Data Preparations Tools Enable Better, Faster IT Performance in the Age of Big Data

2
Comments
6 min read
Build your own data quality rules with AWS Glue DataBrew

Build your own data quality rules with AWS Glue DataBrew

10
Comments
6 min read
Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew

Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew

6
Comments
4 min read
Understanding Apache Hive LLAP

Understanding Apache Hive LLAP

3
Comments
7 min read
What Is Crypto and How Does It Work ?

What Is Crypto and How Does It Work ?

8
Comments 7
3 min read
Data lakes: building a serverless data pipeline

Data lakes: building a serverless data pipeline

3
Comments
6 min read
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro

Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro

8
Comments
8 min read
Meet the Innovators with Krzysztof Nowocin 12:14

Meet the Innovators with Krzysztof Nowocin

2
Comments
6 min read
The Important SQL Queries for Beginners

The Important SQL Queries for Beginners

12
Comments
8 min read
A first update on our AI/ML/Big Data salary survey

A first update on our AI/ML/Big Data salary survey

2
Comments
2 min read
Data Engineering Introduction

Data Engineering Introduction

6
Comments
2 min read
Performance capabilities of data warehouses and how Cube can help

Performance capabilities of data warehouses and how Cube can help

18
Comments
18 min read
Scramjet Transform Hub — Quick Start introduction

Scramjet Transform Hub — Quick Start introduction

12
Comments
7 min read
Introduction to Scramjet Data Processing Platform

Introduction to Scramjet Data Processing Platform

9
Comments
3 min read
Find The Best Way To Load Data In A Data Warehouse

Find The Best Way To Load Data In A Data Warehouse

2
Comments
4 min read
Snowflake Vs BigQuery — Two Cloud Data Warehouses Of Many

Snowflake Vs BigQuery — Two Cloud Data Warehouses Of Many

12
Comments 2
6 min read
Getting Started With Apache Airflow

Getting Started With Apache Airflow

6
Comments
11 min read
Big Data & Analytics : Driving Value To Advanced Business Growth Initiatives

Big Data & Analytics : Driving Value To Advanced Business Growth Initiatives

6
Comments 1
6 min read
ทดสอบทำ Machine Learning predict customer churn โดยใช้งาน Amazon SageMaker กับ Snowflake!

ทดสอบทำ Machine Learning predict customer churn โดยใช้งาน Amazon SageMaker กับ Snowflake!

8
Comments 1
6 min read
Computing the Pearson correlation matrix on huge datasets in Python

Computing the Pearson correlation matrix on huge datasets in Python

6
Comments 2
5 min read
How to scrape twitter data with Headless Chrome and Puppeteer

How to scrape twitter data with Headless Chrome and Puppeteer

6
Comments 2
5 min read
Reliable ingestion from AWS S3 using Hudi

Reliable ingestion from AWS S3 using Hudi

3
Comments
6 min read
ทดสอบการทำ Anonymize data in your data lake with Amazon Athena

ทดสอบการทำ Anonymize data in your data lake with Amazon Athena

9
Comments 1
2 min read
เริ่มใช้งาน SQL-based INSERTS, DELETES and UPSERTS in S3 โดยใช้ AWS Glue 3.0 และ Delta Lake

เริ่มใช้งาน SQL-based INSERTS, DELETES and UPSERTS in S3 โดยใช้ AWS Glue 3.0 และ Delta Lake

10
Comments
6 min read
How to deal with Big data challenges

How to deal with Big data challenges

6
Comments
5 min read
SQL-based INSERTS, DELETES and UPSERTS in S3 using AWS Glue 3.0 and Delta Lake

SQL-based INSERTS, DELETES and UPSERTS in S3 using AWS Glue 3.0 and Delta Lake

21
Comments 8
8 min read
To let the beginners know their career goals who have opted data science.

To let the beginners know their career goals who have opted data science.

2
Comments
2 min read
Updating data files, commits vs. pull requests

Updating data files, commits vs. pull requests

6
Comments 4
3 min read
Unboxing a Database-How Databases Work Internally

Unboxing a Database-How Databases Work Internally

18
Comments 4
11 min read
Data Optimization for Compacted Partitions

Data Optimization for Compacted Partitions

3
Comments
8 min read
Apache Hudi - The Streaming Data Lake Platform

Apache Hudi - The Streaming Data Lake Platform

2
Comments
25 min read
UPSERTS and DELETES using AWS Glue and Delta Lake

UPSERTS and DELETES using AWS Glue and Delta Lake

25
Comments 4
10 min read
Exploratory Data Analysis Using Python

Exploratory Data Analysis Using Python

44
Comments 1
5 min read
Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing

Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing

3
Comments
9 min read
E-commerce Security Basics: How to Start with E-commerce Security

E-commerce Security Basics: How to Start with E-commerce Security

2
Comments
6 min read
Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)

Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)

35
Comments 4
7 min read
Understanding Open Telemetry and Observability w/ Splunk's Spiros Xanthos

Understanding Open Telemetry and Observability w/ Splunk's Spiros Xanthos

12
Comments
1 min read
How to Use Consistent Hashing in a System Design Interview?

How to Use Consistent Hashing in a System Design Interview?

7
Comments 3
7 min read
The Complete Guide to Data Science, Big Data, and Data Analytics

The Complete Guide to Data Science, Big Data, and Data Analytics

11
Comments
3 min read
How to easily install kafka without zookeeper

How to easily install kafka without zookeeper

5
Comments
7 min read
AWS Data Lake with Terraform - Part 2 of 6

AWS Data Lake with Terraform - Part 2 of 6

17
Comments
2 min read
AWS Data Lake with Terraform - Part 1 of 6

AWS Data Lake with Terraform - Part 1 of 6

28
Comments
4 min read
Assess how many Kafka servers are needed to face a scenario of 1 billion requests.

Assess how many Kafka servers are needed to face a scenario of 1 billion requests.

6
Comments
6 min read
Big Data + MySQL = Mission InnoPossible?

Big Data + MySQL = Mission InnoPossible?

4
Comments
9 min read
A Visual Guide To: Azure Data Factory

A Visual Guide To: Azure Data Factory

10
Comments
4 min read
AzureFunBytes Reminder - Intro to @Azure Data Factory with @KromerBigData - 5/13/2021

AzureFunBytes Reminder - Intro to @Azure Data Factory with @KromerBigData - 5/13/2021

7
Comments
3 min read
5 Ways Big Data & Analytics Can Pay Off To Your Marketing & Sales in 2021

5 Ways Big Data & Analytics Can Pay Off To Your Marketing & Sales in 2021

5
Comments 2
6 min read
AzureFunBytes Episode 43 - Intro to @Azure Data Factory with @KromerBigData

AzureFunBytes Episode 43 - Intro to @Azure Data Factory with @KromerBigData

6
Comments
3 min read
loading...