DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Working with Parquet files in Java using Avro

Working with Parquet files in Java using Avro

1
Comments
10 min read
Business Intelligence Data Analyst vs. BI Developer

Business Intelligence Data Analyst vs. BI Developer

3
Comments
3 min read
Cloud Data Analytics: A Journey to Actionable Insights & Data-driven Success

Cloud Data Analytics: A Journey to Actionable Insights & Data-driven Success

Comments
2 min read
BigData Journey from Hadoop and MapReduce to AWS EMR

BigData Journey from Hadoop and MapReduce to AWS EMR

Comments
9 min read
S3 Multi-Part Upload: Part 2 Conclusion

S3 Multi-Part Upload: Part 2 Conclusion

3
Comments
11 min read
Most common errors when setting up Amazon EMR

Most common errors when setting up Amazon EMR

6
Comments
2 min read
15 top AI tools for marketing, infrastructure, and LLMOps

15 top AI tools for marketing, infrastructure, and LLMOps

Comments
3 min read
Bridging Data and Marketing in the AI Arena: My Journey

Bridging Data and Marketing in the AI Arena: My Journey

Comments
2 min read
HyperLogLog | Un algoritmo para contarlos (aproximadamente) a todos

HyperLogLog | Un algoritmo para contarlos (aproximadamente) a todos

2
Comments
6 min read
Install Hadoop on Ubuntu

Install Hadoop on Ubuntu

4
Comments
6 min read
Which Scenarios Does ClickHouse Applies to?

Which Scenarios Does ClickHouse Applies to?

5
Comments 1
9 min read
Data-Powered Accessibility: How to Build Inclusive Product for Any User Need

Data-Powered Accessibility: How to Build Inclusive Product for Any User Need

48
Comments
7 min read
SPL computing performance test series: in-group accumulation

SPL computing performance test series: in-group accumulation

5
Comments
12 min read
Log Analysis: Elasticsearch VS Apache Doris

Log Analysis: Elasticsearch VS Apache Doris

2
Comments
11 min read
SPL computing performance test series: funnel analysis

SPL computing performance test series: funnel analysis

5
Comments
16 min read
SPL computing performance test series: position association

SPL computing performance test series: position association

1
Comments
12 min read
SPL computing performance test series: multi-index aggregating

SPL computing performance test series: multi-index aggregating

1
Comments
6 min read
Connecting Multiple Kafka Clusters in ClickHouse Using Named Collections

Connecting Multiple Kafka Clusters in ClickHouse Using Named Collections

8
Comments
3 min read
SPL computing performance test series: associate tables and wide table

SPL computing performance test series: associate tables and wide table

Comments
6 min read
Leveraging AI in Education: Exploring Big Data and Related Applications

Leveraging AI in Education: Exploring Big Data and Related Applications

Comments
11 min read
GlusterFS vs. JuiceFS

GlusterFS vs. JuiceFS

Comments
7 min read
50%+ Cut in Both Storage & Compute Costs: Designing NetEase Games' Cloud Big Data Platform

50%+ Cut in Both Storage & Compute Costs: Designing NetEase Games' Cloud Big Data Platform

Comments
9 min read
What is '_spark_metadata' Directory in Spark Structured Streaming ?

What is '_spark_metadata' Directory in Spark Structured Streaming ?

2
Comments
1 min read
SQL is consuming the lives of data scientists

SQL is consuming the lives of data scientists

6
Comments 3
20 min read
⛏ Get Mining into Data with These Top 5 Resources

⛏ Get Mining into Data with These Top 5 Resources

5
Comments 2
6 min read
loading...