DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Reducing the Filtering Effect in Public School Admissions: A Bias-aware Analysis for Targeted Interventions

Reducing the Filtering Effect in Public School Admissions: A Bias-aware Analysis for Targeted Interventions

Comments
4 min read
Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency

Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency

Comments
4 min read
Learning to Break Deep Perceptual Hashing: The Use Case NeuralHash

Learning to Break Deep Perceptual Hashing: The Use Case NeuralHash

Comments
4 min read
Accuracy is Not All You Need

Accuracy is Not All You Need

Comments
4 min read
Backpropagation through space, time, and the brain

Backpropagation through space, time, and the brain

Comments
3 min read
Is GPT-4 conscious?

Is GPT-4 conscious?

Comments
4 min read
Parametric Matrix Models

Parametric Matrix Models

Comments
4 min read
Qwen2 Technical Report

Qwen2 Technical Report

Comments
4 min read
On scalable oversight with weak LLMs judging strong LLMs

On scalable oversight with weak LLMs judging strong LLMs

Comments
3 min read
Vision language models are blind

Vision language models are blind

Comments
4 min read
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Comments
4 min read
xLSTMTime : Long-term Time Series Forecasting With xLSTM

xLSTMTime : Long-term Time Series Forecasting With xLSTM

Comments
4 min read
AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models

AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models

Comments
3 min read
Handling Outliers|| Feature Engineering || Machine Learning

Handling Outliers|| Feature Engineering || Machine Learning

6
Comments
5 min read
R Programming: Zero to Hero Series 🚀

R Programming: Zero to Hero Series 🚀

1
Comments
1 min read
How Data Science is Transforming the Healthcare Industry

How Data Science is Transforming the Healthcare Industry

Comments
6 min read
Tracking Health with Data Engineering - Chapter 1: Meal Optimization

Tracking Health with Data Engineering - Chapter 1: Meal Optimization

Comments
6 min read
Can ChatGPT Pass a Theory of Computing Course?

Can ChatGPT Pass a Theory of Computing Course?

Comments
4 min read
Human-in-the-Loop Visual Re-ID for Population Size Estimation

Human-in-the-Loop Visual Re-ID for Population Size Estimation

Comments
4 min read
WildGaussians: 3D Gaussian Splatting in the Wild

WildGaussians: 3D Gaussian Splatting in the Wild

Comments
3 min read
UNSAT Solver Synthesis via Monte Carlo Forest Search

UNSAT Solver Synthesis via Monte Carlo Forest Search

Comments
4 min read
Transformer Layers as Painters

Transformer Layers as Painters

Comments
3 min read
TurboTLS: TLS connection establishment with 1 less round trip

TurboTLS: TLS connection establishment with 1 less round trip

Comments
5 min read
Beyond Euclid: An Illustrated Guide to Modern Machine Learning with Geometric, Topological, and Algebraic Structures

Beyond Euclid: An Illustrated Guide to Modern Machine Learning with Geometric, Topological, and Algebraic Structures

Comments
5 min read
Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs

Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs

Comments
3 min read
Agent Attention: On the Integration of Softmax and Linear Attention

Agent Attention: On the Integration of Softmax and Linear Attention

Comments
4 min read
Adapting Large Language Models via Reading Comprehension

Adapting Large Language Models via Reading Comprehension

Comments
4 min read
CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

Comments
4 min read
SparQ Attention: Bandwidth-Efficient LLM Inference

SparQ Attention: Bandwidth-Efficient LLM Inference

Comments
4 min read
Different Encoding Methods for your Dataset.

Different Encoding Methods for your Dataset.

1
Comments
7 min read
Solving the SQL Murder Mystery: A Step-by-Step Guide

Solving the SQL Murder Mystery: A Step-by-Step Guide

Comments
4 min read
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

1
Comments
4 min read
K Means Clustering, Clustering: Unsupervised Machine Learning

K Means Clustering, Clustering: Unsupervised Machine Learning

Comments
4 min read
K Nearest Neighbors Classification, Classification: Supervised Machine Learning

K Nearest Neighbors Classification, Classification: Supervised Machine Learning

Comments
4 min read
Taming the Wild West: Data Preprocessing for Machine Learning Success

Taming the Wild West: Data Preprocessing for Machine Learning Success

Comments
5 min read
Differences between Persistent Connection and Non-Persistent Connection

Differences between Persistent Connection and Non-Persistent Connection

1
Comments
3 min read
AdaBoost - Ensemble Method, Classification: Supervised Machine Learning

AdaBoost - Ensemble Method, Classification: Supervised Machine Learning

Comments
7 min read
Decision Tree, Classification: Supervised Machine Learning

Decision Tree, Classification: Supervised Machine Learning

Comments
7 min read
Understanding Lookup Tables in Excel and SQL

Understanding Lookup Tables in Excel and SQL

Comments
12 min read
Logistic Regression, Classification: Supervised Machine Learning

Logistic Regression, Classification: Supervised Machine Learning

Comments
10 min read
Day 1 of 30 of Python

Day 1 of 30 of Python

Comments
2 min read
How to Enhance Model Performance with Effective Feature Engineering

How to Enhance Model Performance with Effective Feature Engineering

Comments
4 min read
Simulacra as Conscious Exotica

Simulacra as Conscious Exotica

Comments
4 min read
Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

Comments
4 min read
Delving into ChatGPT usage in academic writing through excess vocabulary

Delving into ChatGPT usage in academic writing through excess vocabulary

Comments
3 min read
Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Comments
4 min read
AI Agents That Matter

AI Agents That Matter

Comments
3 min read
Which algorithm to select in sports timetabling?

Which algorithm to select in sports timetabling?

Comments
4 min read
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Comments
3 min read
Distilling System 2 into System 1

Distilling System 2 into System 1

Comments
4 min read
LLMs can learn self-restraint through iterative self-reflection

LLMs can learn self-restraint through iterative self-reflection

Comments
5 min read
Vulnerability Detection with Code Language Models: How Far Are We?

Vulnerability Detection with Code Language Models: How Far Are We?

Comments
5 min read
Shadows of quantum machine learning

Shadows of quantum machine learning

Comments
4 min read
SmartChoices: Augmenting Software with Learned Implementations

SmartChoices: Augmenting Software with Learned Implementations

Comments
4 min read
Personalized Language Modeling from Personalized Human Feedback

Personalized Language Modeling from Personalized Human Feedback

Comments
4 min read
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Comments
4 min read
Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Comments
4 min read
When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

Comments
4 min read
Memory, Consciousness and Large Language Model

Memory, Consciousness and Large Language Model

Comments
4 min read
LoRA+: Efficient Low Rank Adaptation of Large Models

LoRA+: Efficient Low Rank Adaptation of Large Models

Comments
3 min read
loading...