DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Loading data to Google Big Query using Dataproc workflow templates and cloud Schedule

Loading data to Google Big Query using Dataproc workflow templates and cloud Schedule

3
Comments
12 min read
Data Visualisation Basics

Data Visualisation Basics

8
Comments
7 min read
Connecting AI with Excel - Talk to Your Spreadsheets

Connecting AI with Excel - Talk to Your Spreadsheets

2
Comments
6 min read
Demystifying Data Science: A Beginner’s Guide!

Demystifying Data Science: A Beginner’s Guide!

Comments
3 min read
How to install Apache Kafka on Ubuntu with KRaft Mode (without Zookeeper): A Step-by-Step Guide

How to install Apache Kafka on Ubuntu with KRaft Mode (without Zookeeper): A Step-by-Step Guide

3
Comments
10 min read
Data Lakes vs. Data Warehouses: Choosing the Right Big Data Architecture

Data Lakes vs. Data Warehouses: Choosing the Right Big Data Architecture

1
Comments
4 min read
How to Install Hadoop on Ubuntu: A Step-by-Step Guide

How to Install Hadoop on Ubuntu: A Step-by-Step Guide

1
Comments
10 min read
🤔 Is It Possible to Achieve 100% Test Automation?

🤔 Is It Possible to Achieve 100% Test Automation?

Comments
2 min read
Optimize ETL Processes with Apache Iceberg: A Game Changer

Optimize ETL Processes with Apache Iceberg: A Game Changer

1
Comments
4 min read
Data ingestion – definition, types and best practices

Data ingestion – definition, types and best practices

Comments
8 min read
How to Handle Databases with Billions of Records

How to Handle Databases with Billions of Records

3
Comments
1 min read
Effective Strategies for Scaling Databases: Enhancing Performance for Growing Data Needs

Effective Strategies for Scaling Databases: Enhancing Performance for Growing Data Needs

4
Comments
5 min read
Data Driven Dreams: Building My Data Science Career

Data Driven Dreams: Building My Data Science Career

Comments
4 min read
Working with Parquet files in Java using Carpet

Working with Parquet files in Java using Carpet

1
Comments
6 min read
Optimizing ETL Processes for Efficient Data Loading in EDWs

Optimizing ETL Processes for Efficient Data Loading in EDWs

Comments 1
4 min read
Patient-Centered Care and Data Integration in Population Health Management

Patient-Centered Care and Data Integration in Population Health Management

Comments
4 min read
The Basics of Big Data: What You Need to Know

The Basics of Big Data: What You Need to Know

Comments
3 min read
Why Apache Doris is the Best Open Source Alternative to Rockset

Why Apache Doris is the Best Open Source Alternative to Rockset

3
Comments
3 min read
Introduction to Apache Hadoop & MapReduce

Introduction to Apache Hadoop & MapReduce

5
Comments
3 min read
Blazingly-Fast Serialization: Apache Fury 0.5.1 released

Blazingly-Fast Serialization: Apache Fury 0.5.1 released

Comments
3 min read
Databricks - Variant Type Analysis

Databricks - Variant Type Analysis

3
Comments
7 min read
Metadata for win — Apache Parquet

Metadata for win — Apache Parquet

Comments
5 min read
Comprehensive Guide to Schema Inference with MongoDB Spark Connector in PySpark

Comprehensive Guide to Schema Inference with MongoDB Spark Connector in PySpark

Comments
3 min read
Advanced Insights into Automated Data Processing Tools

Advanced Insights into Automated Data Processing Tools

1
Comments
4 min read
Documenting Rate Limits and Throttling in REST APIs

Documenting Rate Limits and Throttling in REST APIs

Comments
5 min read
loading...