DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Which speeds up development more: AI Coding Agents or Pair Programming?

Which speeds up development more: AI Coding Agents or Pair Programming?

1
Comments
5 min read
What is Conditional Probability?

What is Conditional Probability?

1
Comments
3 min read
🔍 Understanding Supervised Learning: A Guide for Beginners

🔍 Understanding Supervised Learning: A Guide for Beginners

2
Comments
3 min read
A Primer to Framing Business Problems for Machine Learning

A Primer to Framing Business Problems for Machine Learning

Comments
6 min read
[Personal Project #16] UEFA Women’s EURO 2025 Semifinals: What the Numbers Say About the Final Four

[Personal Project #16] UEFA Women’s EURO 2025 Semifinals: What the Numbers Say About the Final Four

Comments
3 min read
🖼️ PixelSink: Hunt Hidden Data Inside Images

🖼️ PixelSink: Hunt Hidden Data Inside Images

1
Comments
1 min read
Towards Sub-100ms Latency Stream Processing with an S3-Based Architecture

Towards Sub-100ms Latency Stream Processing with an S3-Based Architecture

1
Comments
7 min read
Eigenvalues and Eigenvectors: Unveiling the Secrets of Data Transformation in Machine Learning

Eigenvalues and Eigenvectors: Unveiling the Secrets of Data Transformation in Machine Learning

Comments
3 min read
Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Comments
3 min read
Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Comments
3 min read
Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Comments
3 min read
Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency

Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency

Comments
3 min read
Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests

Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests

Comments
3 min read
Apache Iceberg Table Optimization #3: Optimizing Compaction for Streaming Workloads in Apache Iceberg

Apache Iceberg Table Optimization #3: Optimizing Compaction for Streaming Workloads in Apache Iceberg

Comments
3 min read
Apache Iceberg Table Optimization #1: The Cost of Neglect — How Apache Iceberg Tables Degrade Without Optimization

Apache Iceberg Table Optimization #1: The Cost of Neglect — How Apache Iceberg Tables Degrade Without Optimization

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.