DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Bulk load to Elastic Search with PySpark

Bulk load to Elastic Search with PySpark

7
Comments
2 min read
Routable computing engine implements front-end database

Routable computing engine implements front-end database

Comments
5 min read
How does the in-memory database bring memory’s advantage into play?

How does the in-memory database bring memory’s advantage into play?

Comments
12 min read
How to clone tables in BigQuery

How to clone tables in BigQuery

2
Comments
1 min read
Why ETL Becomes ELT or Even LET?

Why ETL Becomes ELT or Even LET?

Comments
8 min read
Processing EventHub Captured Messages in Avro Files Using Databricks

Processing EventHub Captured Messages in Avro Files Using Databricks

2
Comments
2 min read
HTAP: Learning from Xiaohongshu

HTAP: Learning from Xiaohongshu

1
Comments
5 min read
HTAP database cannot handle HTAP requirements

HTAP database cannot handle HTAP requirements

Comments
13 min read
Integrating Apache Age with Other Big Data Tools and Frameworks

Integrating Apache Age with Other Big Data Tools and Frameworks

2
Comments 1
2 min read
The current Lakehouse is like a false proposition

The current Lakehouse is like a false proposition

Comments
11 min read
How to make the columnar storage data warehouse more efficient

How to make the columnar storage data warehouse more efficient

Comments
11 min read
Simplest pyspark tutorial

Simplest pyspark tutorial

2
Comments
7 min read
Making Debezium 2.x Support Confluent Schema Registry

Making Debezium 2.x Support Confluent Schema Registry

3
Comments 3
3 min read
Performance Enhancement: Conversion Funnel Analysis

Performance Enhancement: Conversion Funnel Analysis

Comments
9 min read
Boost Your Testing Strategy: The Coolest Methods to Prioritize A/B Tests Like a Pro! 🎲📊😎

Boost Your Testing Strategy: The Coolest Methods to Prioritize A/B Tests Like a Pro! 🎲📊😎

3
Comments
4 min read
A Comprehensive Comparison of JuiceFS and HDFS for Cloud-Based Big Data Storage

A Comprehensive Comparison of JuiceFS and HDFS for Cloud-Based Big Data Storage

2
Comments
11 min read
The Secret to Rapid Scaling: How Scraping Helped These Startups Go From Zero to $1.2+ Trillion

The Secret to Rapid Scaling: How Scraping Helped These Startups Go From Zero to $1.2+ Trillion

7
Comments 1
6 min read
Mastering Large-Scale Data Processing: Building a Data Pipeline with ApacheAGE for Efficient Ingestion, Processing, and Analysis

Mastering Large-Scale Data Processing: Building a Data Pipeline with ApacheAGE for Efficient Ingestion, Processing, and Analysis

2
Comments
2 min read
How we mastered dbt: A true story

How we mastered dbt: A true story

7
Comments
14 min read
Exploration of Spark Executor Memory

Exploration of Spark Executor Memory

2
Comments
9 min read
GETTING STARTED WITH SENTIMENT ANALYSIS.

GETTING STARTED WITH SENTIMENT ANALYSIS.

2
Comments
4 min read
Lightweight HTTP API for Big Data on S3

Lightweight HTTP API for Big Data on S3

3
Comments
3 min read
How to cope with high-concurrency account query?

How to cope with high-concurrency account query?

Comments
6 min read
Don't Break the Bank on SQL Queries: BigQuery On-Demand vs Flat-Rate prices. Which Saves You More? 💰😎

Don't Break the Bank on SQL Queries: BigQuery On-Demand vs Flat-Rate prices. Which Saves You More? 💰😎

5
Comments 3
5 min read
Read before-The Ultimate Guide to AWS IoT Core: What it is, How it helps, and Real-World use Cases. Mini-Project-Intro

Read before-The Ultimate Guide to AWS IoT Core: What it is, How it helps, and Real-World use Cases. Mini-Project-Intro

10
Comments
3 min read
loading...