DEV Community

# bigdata

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
🤔 Is It Possible to Achieve 100% Test Automation?

🤔 Is It Possible to Achieve 100% Test Automation?

Comments
2 min read
Data ingestion – definition, types and best practices

Data ingestion – definition, types and best practices

Comments
8 min read
How to Handle Databases with Billions of Records

How to Handle Databases with Billions of Records

3
Comments
1 min read
Effective Strategies for Scaling Databases: Enhancing Performance for Growing Data Needs

Effective Strategies for Scaling Databases: Enhancing Performance for Growing Data Needs

4
Comments
5 min read
Data Driven Dreams: Building My Data Science Career

Data Driven Dreams: Building My Data Science Career

Comments
4 min read
Working with Parquet files in Java using Carpet

Working with Parquet files in Java using Carpet

1
Comments
6 min read
Optimizing ETL Processes for Efficient Data Loading in EDWs

Optimizing ETL Processes for Efficient Data Loading in EDWs

Comments
4 min read
Patient-Centered Care and Data Integration in Population Health Management

Patient-Centered Care and Data Integration in Population Health Management

Comments
4 min read
The Basics of Big Data: What You Need to Know

The Basics of Big Data: What You Need to Know

Comments
3 min read
Why Apache Doris is the Best Open Source Alternative to Rockset

Why Apache Doris is the Best Open Source Alternative to Rockset

3
Comments
3 min read
Introduction to Apache Hadoop & MapReduce

Introduction to Apache Hadoop & MapReduce

5
Comments
3 min read
Blazingly-Fast Serialization: Apache Fury 0.5.1 released

Blazingly-Fast Serialization: Apache Fury 0.5.1 released

Comments
3 min read
Databricks - Variant Type Analysis

Databricks - Variant Type Analysis

1
Comments
7 min read
Metadata for win — Apache Parquet

Metadata for win — Apache Parquet

Comments
5 min read
Comprehensive Guide to Schema Inference with MongoDB Spark Connector in PySpark

Comprehensive Guide to Schema Inference with MongoDB Spark Connector in PySpark

Comments
3 min read
Advanced Insights into Automated Data Processing Tools

Advanced Insights into Automated Data Processing Tools

1
Comments
4 min read
Documenting Rate Limits and Throttling in REST APIs

Documenting Rate Limits and Throttling in REST APIs

Comments
5 min read
How to Build an API with Strong Security Measures

How to Build an API with Strong Security Measures

Comments
4 min read
GraphQL API Design Best Practices for Efficient Data Management

GraphQL API Design Best Practices for Efficient Data Management

Comments
5 min read
The current Lakehouse is like a false proposition

The current Lakehouse is like a false proposition

6
Comments 1
10 min read
Is distributed technology the panacea for big data processing?

Is distributed technology the panacea for big data processing?

7
Comments 1
10 min read
Big Data: a ferramenta que precisamos.

Big Data: a ferramenta que precisamos.

Comments
2 min read
PySpark: missing value

PySpark: missing value

Comments
2 min read
Cross-cluster replication for read-write separation

Cross-cluster replication for read-write separation

2
Comments
4 min read
Stream Data at scale from millions of sources with Amazon Kinesis (Serverless)

Stream Data at scale from millions of sources with Amazon Kinesis (Serverless)

13
Comments
7 min read
loading...