DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Test Driving Redshift AI-Driven Scaling

Test Driving Redshift AI-Driven Scaling

1
Comments
3 min read
Building Robust Data Pipelines: A Comprehensive Guide

Building Robust Data Pipelines: A Comprehensive Guide

4
Comments
3 min read
How to store and calculate historical big data with lower usage frequency

How to store and calculate historical big data with lower usage frequency

6
Comments
4 min read
Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog

Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog

4
Comments
4 min read
Use Selenium with Python to Target the XPath of a Particular Object

Use Selenium with Python to Target the XPath of a Particular Object

Comments
9 min read
Simplifying ETL Pipelines with SQL: Three Tips for Data Processing

Simplifying ETL Pipelines with SQL: Three Tips for Data Processing

19
Comments
3 min read
🏆How to master 📊 Big Data pipelines with Taipy and PySpark 🐍

🏆How to master 📊 Big Data pipelines with Taipy and PySpark 🐍

219
Comments 8
9 min read
Working with Parquet files in Java using Protocol Buffers

Working with Parquet files in Java using Protocol Buffers

Comments
7 min read
IoT and Data Analytics: Unleashing the Power of Big Data

IoT and Data Analytics: Unleashing the Power of Big Data

Comments 1
3 min read
Understanding Concurrency Through Amdahl's Law

Understanding Concurrency Through Amdahl's Law

4
Comments
3 min read
From Hadoop to Cloud: Why and How to Decouple Storage and Compute in Big Data Platforms

From Hadoop to Cloud: Why and How to Decouple Storage and Compute in Big Data Platforms

Comments
13 min read
Data Engineering Terminology: Understanding Upstream and Downstream in Data Pipelines

Data Engineering Terminology: Understanding Upstream and Downstream in Data Pipelines

1
Comments
1 min read
Big data models 📊 vs. Computer memory 💾

Big data models 📊 vs. Computer memory 💾

187
Comments 3
11 min read
Working with Parquet files in Java using Avro

Working with Parquet files in Java using Avro

1
Comments
10 min read
Business Intelligence Data Analyst vs. BI Developer

Business Intelligence Data Analyst vs. BI Developer

3
Comments
3 min read
Cloud Data Analytics: A Journey to Actionable Insights & Data-driven Success

Cloud Data Analytics: A Journey to Actionable Insights & Data-driven Success

Comments
2 min read
BigData Journey from Hadoop and MapReduce to AWS EMR

BigData Journey from Hadoop and MapReduce to AWS EMR

Comments
9 min read
S3 Multi-Part Upload: Part 2 Conclusion

S3 Multi-Part Upload: Part 2 Conclusion

3
Comments
11 min read
Most common errors when setting up Amazon EMR

Most common errors when setting up Amazon EMR

6
Comments
2 min read
15 top AI tools for marketing, infrastructure, and LLMOps

15 top AI tools for marketing, infrastructure, and LLMOps

Comments
3 min read
Bridging Data and Marketing in the AI Arena: My Journey

Bridging Data and Marketing in the AI Arena: My Journey

Comments
2 min read
HyperLogLog | Un algoritmo para contarlos (aproximadamente) a todos

HyperLogLog | Un algoritmo para contarlos (aproximadamente) a todos

2
Comments
6 min read
Install Hadoop on Ubuntu

Install Hadoop on Ubuntu

4
Comments
6 min read
Which Scenarios Does ClickHouse Applies to?

Which Scenarios Does ClickHouse Applies to?

5
Comments 1
9 min read
Data-Powered Accessibility: How to Build Inclusive Product for Any User Need

Data-Powered Accessibility: How to Build Inclusive Product for Any User Need

48
Comments
7 min read
loading...