DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Here is What Happens If You Decouple Your BI Stack

Here is What Happens If You Decouple Your BI Stack

5
Comments
7 min read
The New & Improved Spark UI & Spark History Server is now Generally Available

The New & Improved Spark UI & Spark History Server is now Generally Available

2
Comments
9 min read
Efficient Iteration of Big Data in Django

Efficient Iteration of Big Data in Django

5
Comments 1
4 min read
On explaining technical stuff in a non-technical way — (Py)Spark

On explaining technical stuff in a non-technical way — (Py)Spark

4
Comments
7 min read
ALGORITHMIC TRADING

ALGORITHMIC TRADING

6
Comments
3 min read
Event Streaming and AWS Kinesis

Event Streaming and AWS Kinesis

21
Comments
4 min read
Introduction to Data Analysis and Visualization using Python

Introduction to Data Analysis and Visualization using Python

4
Comments
4 min read
Introduction to Apache Airflow: get started in 5 minutes

Introduction to Apache Airflow: get started in 5 minutes

11
Comments
8 min read
Data, data everywhere!

Data, data everywhere!

5
Comments
2 min read
Inside Presto Optimizer

Inside Presto Optimizer

7
Comments
9 min read
Best Online Courses for Data Engineers In 2021

Best Online Courses for Data Engineers In 2021

32
Comments
7 min read
3 CTOs And Founders Perspectives on the Modern Data Stack

3 CTOs And Founders Perspectives on the Modern Data Stack

5
Comments
11 min read
Best Recommended AI and Data Science Books and Podcasts in 2021

Best Recommended AI and Data Science Books and Podcasts in 2021

2
Comments
3 min read
How I Built a Data Discovery API for AWS Data Lake

How I Built a Data Discovery API for AWS Data Lake

4
Comments
7 min read
"I learned right away how important Data manipulation and cleaning was to managing business affairs", — Matthew D. Groves

"I learned right away how important Data manipulation and cleaning was to managing business affairs", — Matthew D. Groves

6
Comments
4 min read
"Working with Data helps to uncover the inner workings of multiple spheres in our daily life",— Roksolana Diachuk.

"Working with Data helps to uncover the inner workings of multiple spheres in our daily life",— Roksolana Diachuk.

3
Comments
4 min read
"Data is always something new to learn in the area, some new tool to try, some new insight to discover", — Ruben Berenguel

"Data is always something new to learn in the area, some new tool to try, some new insight to discover", — Ruben Berenguel

2
Comments
4 min read
"Data is the new center of gravity", — Jules Damji.

"Data is the new center of gravity", — Jules Damji.

2
Comments
3 min read
The Management of Data

The Management of Data

8
Comments 1
3 min read
Configuration of Hadoop Cluster Using Ansible

Configuration of Hadoop Cluster Using Ansible

2
Comments
4 min read
How we implemented Distributed Multi-document ACID Transactions in Couchbase

How we implemented Distributed Multi-document ACID Transactions in Couchbase

7
Comments
14 min read
7 Real-Time Data Streaming Tools You Should Consider On Your Next Project

7 Real-Time Data Streaming Tools You Should Consider On Your Next Project

21
Comments 1
9 min read
Business Analytics tools & use cases

Business Analytics tools & use cases

4
Comments
7 min read
Elasticsearch as a primary database?

Elasticsearch as a primary database?

19
Comments
2 min read
Why You Need a CRM Data Cleanup

Why You Need a CRM Data Cleanup

2
Comments
8 min read
Data Engineering skills

Data Engineering skills

13
Comments 1
3 min read
Big Data, What's the Big Deal

Big Data, What's the Big Deal

1
Comments
2 min read
Starting your Journey with Big Data Analytics

Starting your Journey with Big Data Analytics

37
Comments
4 min read
Impact of COVID-19 on people's habits worldwide

Impact of COVID-19 on people's habits worldwide

8
Comments 5
2 min read
Apache Kafka: What is and how it works

Apache Kafka: What is and how it works

5
Comments 1
8 min read
Getting Started With JanusGraph

Getting Started With JanusGraph

2
Comments
5 min read
What is data engineering?

What is data engineering?

4
Comments
1 min read
A Look at the Long-Lasting Java and Big Data Relationship (With a List of Resources Data Scientists Can Use for Java Learning)

A Look at the Long-Lasting Java and Big Data Relationship (With a List of Resources Data Scientists Can Use for Java Learning)

5
Comments 1
8 min read
BIG DATA COURSE

BIG DATA COURSE

3
Comments
3 min read
What In The World Is Dremio And Why Is It Valued At 1 Billion Dollars?

What In The World Is Dremio And Why Is It Valued At 1 Billion Dollars?

5
Comments
7 min read
The ugly truth of the CDP

The ugly truth of the CDP

4
Comments
1 min read
Spark MLlib for Big data and Machine learning

Spark MLlib for Big data and Machine learning

8
Comments
4 min read
The Unbiased Guide to Choosing the Right BI Tool

The Unbiased Guide to Choosing the Right BI Tool

37
Comments 1
5 min read
Optimize Data Lake layout using Clustering in Apache Hudi

Optimize Data Lake layout using Clustering in Apache Hudi

2
Comments
6 min read
Aprendiendo Spark: #1 Introducción

Aprendiendo Spark: #1 Introducción

11
Comments
3 min read
Using Your Own Apache Spark/Hudi Versions With AWS EMR

Using Your Own Apache Spark/Hudi Versions With AWS EMR

4
Comments
2 min read
What is Chaos Engineering: Theory, Principles & Benefits

What is Chaos Engineering: Theory, Principles & Benefits

3
Comments
6 min read
Kinesis Data Streams vs. Kinesis Firehose Delivery Streams

Kinesis Data Streams vs. Kinesis Firehose Delivery Streams

2
Comments
3 min read
5 Best Hadoop Tutorials to Start in 2023

5 Best Hadoop Tutorials to Start in 2023

9
Comments
7 min read
写给女朋友的 SQL 教程——数据模型

写给女朋友的 SQL 教程——数据模型

2
Comments
1 min read
Right Sizing Snowflake Warehouses / Compute

Right Sizing Snowflake Warehouses / Compute

2
Comments
3 min read
What Is Big Data?

What Is Big Data?

3
Comments
6 min read
Hadoop Installation on Windows 10 using WSL

Hadoop Installation on Windows 10 using WSL

20
Comments
7 min read
Machine Learning and Artificial Intelligence

Machine Learning and Artificial Intelligence

3
Comments
8 min read
Here is a python ORM/Driver for InfluxDB : Influxable

Here is a python ORM/Driver for InfluxDB : Influxable

7
Comments
2 min read
Spark on Kubernetes Made Easy - How Data Mechanics Improves on the Open-Source version

Spark on Kubernetes Made Easy - How Data Mechanics Improves on the Open-Source version

7
Comments
5 min read
Data Analytics on AWS — What, Why & How

Data Analytics on AWS — What, Why & How

11
Comments
13 min read
Data Analyst vs Business Analyst

Data Analyst vs Business Analyst

20
Comments 5
4 min read
Obstacles on the road to automation: Why self-driving cars still need to overcome the big data hurdle

Obstacles on the road to automation: Why self-driving cars still need to overcome the big data hurdle

2
Comments
5 min read
Event Driven Data Pipelines in AWS

Event Driven Data Pipelines in AWS

5
Comments
9 min read
5 Reasons Why Big Data Analytics is the Best Career Move

5 Reasons Why Big Data Analytics is the Best Career Move

2
Comments
4 min read
What Are ETLs And Why We Use Them

What Are ETLs And Why We Use Them

30
Comments 2
14 min read
Automation and Machine Learning: A Match Made In Heaven

Automation and Machine Learning: A Match Made In Heaven

30
Comments 3
5 min read
Trying to grow an open-source ETL project with PHP

Trying to grow an open-source ETL project with PHP

4
Comments
1 min read
3 Ways To Improve Your Data Science Teams Efficiency

3 Ways To Improve Your Data Science Teams Efficiency

17
Comments
7 min read
loading...