Bigdata

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Vinicius Fagundes

Jun 30

Your warehouse isn't expensive. Your full table scans are.

#bigdata #bigquery #redshift #aws

5 min read

DataDriven

Jun 16

Does Early Dragon Control Help Scaling Compositions? A Big Data Analysis of League of Legends

#bigdata #leagueoflegende #python #karmincorp

4 min read

RONI DAS

Jul 10

How Delta Lake Brings ACID to a Data Lake

#systemdesign #dataengineering #bigdata #database

3 min read

SciForce

Jun 4

Predictive Maintenance in 2026: How AI, Edge Computing, and Agentic Systems Turn Detection Into Action

#ai #manufacturing #bigdata #datascience

14 min read

Lê Đình Phú

Jun 22

How Uber Built Its Big Data System — From a Few TBs to 350 Petabytes with Sub-Hour Latency

#bigdata #dataengineering #apachehudi #architecture

9 min read

Ara

May 17

Migrating a ScyllaDB Cluster the “Brain Transplant” Way

#scylladb #bigdata #database

6 min read

sezin öztekin

May 10

"We Have DevOps, So Why Not DataOps?"

#datascience #bigdata #dataops #devops

2 min read

Ruslan

May 8

Valentina Studio v17.3 supports new VARIANT field type of DuckDB v1.5

#duckdb #valentinastudio #bigdata #database

1 min read

Lê Đình Phú

Jun 8

Why Big Tech is Migrating from Traditional Databases to NewSQL

#bigdata #dataengineering #database #sql

1 min read

Manish Podiyal

May 4

The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

#bigdata #spark #pyspark #dataengineering

2 min read

Fu'ad Husnan

Jun 7

The Future of Query Optimization: AI-Driven Insights in Big Data

#bigdata #database #ai

7 min read

Apache SeaTunnel

May 28

The Next Decade of Data Engineering: From Modern Data Stack to Data Engineering Harness

#data #dataengineering #dataengineeringharness #bigdata

8 min read

StiiWann

May 19

Fentanyl Poverty: Building a Big Data Pipeline to Map America's Overdose Epidemic

#bigdata #elasticsearch #spark #python

3 min read

Charles Wu for OceanBase User Group

Apr 30

I Built a Knowledge Base That Thinks — Inspired by Karpathy’s LLM Wiki

#ai #productivity #llm #bigdata

6 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.

DEV Community

#bigdata

Your warehouse isn't expensive. Your full table scans are.

Top 12 Spark Interview Problems for Data Engineers, With Answers

Does Early Dragon Control Help Scaling Compositions? A Big Data Analysis of League of Legends

How Delta Lake Brings ACID to a Data Lake

Predictive Maintenance in 2026: How AI, Edge Computing, and Agentic Systems Turn Detection Into Action

How Uber Built Its Big Data System — From a Few TBs to 350 Petabytes with Sub-Hour Latency

Migrating a ScyllaDB Cluster the “Brain Transplant” Way

"We Have DevOps, So Why Not DataOps?"

Valentina Studio v17.3 supports new VARIANT field type of DuckDB v1.5

Why Big Tech is Migrating from Traditional Databases to NewSQL

The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

The Future of Query Optimization: AI-Driven Insights in Big Data

The Next Decade of Data Engineering: From Modern Data Stack to Data Engineering Harness

Fentanyl Poverty: Building a Big Data Pipeline to Map America's Overdose Epidemic

I Built a Knowledge Base That Thinks — Inspired by Karpathy’s LLM Wiki