DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Constructing a Station-Level Statistical Manifold with Dual Flat Structure from Pedestrian Trajectories

Constructing a Station-Level Statistical Manifold with Dual Flat Structure from Pedestrian Trajectories

Comments
10 min read
Deduplicating 401,000 Equipment Auction Records with LLM Calibration

Deduplicating 401,000 Equipment Auction Records with LLM Calibration

Comments
6 min read
Predicting 10 Minutes in 1 Square Meter: The Ultimate AI Boundary?

Predicting 10 Minutes in 1 Square Meter: The Ultimate AI Boundary?

Comments
2 min read
AI-Powered Deduplication: How LLMs Supercharge the Golden Suite

AI-Powered Deduplication: How LLMs Supercharge the Golden Suite

Comments
8 min read
No It Wasn't A Waste Entirely

No It Wasn't A Waste Entirely

6
Comments
5 min read
GoldenMatch vs. BPID: Testing Against an EMNLP Benchmark

GoldenMatch vs. BPID: Testing Against an EMNLP Benchmark

Comments
7 min read
10 Data Problems Every Pipeline Hits (and the One-Liner Fixes)

10 Data Problems Every Pipeline Hits (and the One-Liner Fixes)

Comments
4 min read
📘 The Science of Un-Mixing Data (PCA & ICA)

📘 The Science of Un-Mixing Data (PCA & ICA)

Comments
4 min read
Data Science vs Data Analysis vs Machine Learning.

Data Science vs Data Analysis vs Machine Learning.

Comments
5 min read
April 9 - Visual AI Agents Workshop

April 9 - Visual AI Agents Workshop

Comments
1 min read
Entity Resolution on 208,000 Real Records with the Golden Suite

Entity Resolution on 208,000 Real Records with the Golden Suite

Comments
7 min read
How to Publish a Power BI Report and Embed It into a Website: A Complete Step-by-Step Guide

How to Publish a Power BI Report and Embed It into a Website: A Complete Step-by-Step Guide

Comments
6 min read
Why I Built a 3,200-Line Python Pipeline to Generate Synthetic Financial Data From Math -- Not AI

Why I Built a 3,200-Line Python Pipeline to Generate Synthetic Financial Data From Math -- Not AI

Comments
5 min read
Confusion Matrix, Precision, Recall, and F1: A Practical Medical Screening Guide

Confusion Matrix, Precision, Recall, and F1: A Practical Medical Screening Guide

Comments
3 min read
Understanding the Data Science Lifecycle From messy data to real-world impact – a step-by-step journey

Understanding the Data Science Lifecycle From messy data to real-world impact – a step-by-step journey

1
Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.