DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Databases Deconstructed: The Value of Data Lakehouses and Table Formats

Databases Deconstructed: The Value of Data Lakehouses and Table Formats

4
Comments
8 min read
Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

1
Comments
4 min read
Simulacra as Conscious Exotica

Simulacra as Conscious Exotica

Comments
4 min read
Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Comments
4 min read
Which algorithm to select in sports timetabling?

Which algorithm to select in sports timetabling?

Comments
4 min read
AI Agents That Matter

AI Agents That Matter

Comments
3 min read
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Comments
3 min read
Mixture of A Million Experts

Mixture of A Million Experts

1
Comments
3 min read
Distilling System 2 into System 1

Distilling System 2 into System 1

Comments
4 min read
The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

Comments
4 min read
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Comments
4 min read
LLMs can learn self-restraint through iterative self-reflection

LLMs can learn self-restraint through iterative self-reflection

1
Comments
5 min read
Shadows of quantum machine learning

Shadows of quantum machine learning

Comments
4 min read
Vulnerability Detection with Code Language Models: How Far Are We?

Vulnerability Detection with Code Language Models: How Far Are We?

Comments
5 min read
Personalized Language Modeling from Personalized Human Feedback

Personalized Language Modeling from Personalized Human Feedback

Comments
4 min read
SmartChoices: Augmenting Software with Learned Implementations

SmartChoices: Augmenting Software with Learned Implementations

Comments
4 min read
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Comments
4 min read
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Comments
4 min read
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Comments
4 min read
Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Comments
4 min read
Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

1
Comments
4 min read
Reasoning in Large Language Models: A Geometric Perspective

Reasoning in Large Language Models: A Geometric Perspective

Comments
4 min read
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

1
Comments
3 min read
X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

Comments
4 min read
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Comments
4 min read
LoRA+: Efficient Low Rank Adaptation of Large Models

LoRA+: Efficient Low Rank Adaptation of Large Models

Comments
3 min read
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

2
Comments
3 min read
A Multivariate Unimodality Test Harnessing the Dip Statistic of Mahalanobis Distances Over Random Projections

A Multivariate Unimodality Test Harnessing the Dip Statistic of Mahalanobis Distances Over Random Projections

Comments
3 min read
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

Comments
4 min read
ColPali: Efficient Document Retrieval with Vision Language Models

ColPali: Efficient Document Retrieval with Vision Language Models

3
Comments
4 min read
Volumetric Rendering with Baked Quadrature Fields

Volumetric Rendering with Baked Quadrature Fields

Comments
3 min read
When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

Comments
4 min read
PaliGemma: A versatile 3B VLM for transfer

PaliGemma: A versatile 3B VLM for transfer

Comments
4 min read
Memory, Consciousness and Large Language Model

Memory, Consciousness and Large Language Model

Comments
4 min read
What's the Magic Word? A Control Theory of LLM Prompting

What's the Magic Word? A Control Theory of LLM Prompting

Comments
4 min read
Exploring the Latest LLMs for Leaderboard Extraction

Exploring the Latest LLMs for Leaderboard Extraction

Comments
4 min read
FACTS About Building Retrieval Augmented Generation-based Chatbots

FACTS About Building Retrieval Augmented Generation-based Chatbots

Comments
4 min read
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Comments
4 min read
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Comments
4 min read
Toto: Time Series Optimized Transformer for Observability

Toto: Time Series Optimized Transformer for Observability

Comments
5 min read
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Comments
4 min read
There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

Comments
4 min read
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Comments
4 min read
What is GitHub Copilot: detailed overview

What is GitHub Copilot: detailed overview

13
Comments
4 min read
Voxel51 Filtered Views Newsletter - July 12, 2024

Voxel51 Filtered Views Newsletter - July 12, 2024

1
Comments
11 min read
Optimizing ETL Processes for Efficient Data Loading in EDWs

Optimizing ETL Processes for Efficient Data Loading in EDWs

Comments
4 min read
Best Practices for Migrating Your Data to the Cloud

Best Practices for Migrating Your Data to the Cloud

Comments
5 min read
Patient-Centered Care and Data Integration in Population Health Management

Patient-Centered Care and Data Integration in Population Health Management

Comments
4 min read
¿Fortran es el nuevo Python?

¿Fortran es el nuevo Python?

1
Comments
2 min read
Useful datasets for AI/ML

Useful datasets for AI/ML

Comments
1 min read
Want to get started as a Data Engineer

Want to get started as a Data Engineer

Comments
1 min read
So I Built This: Broadening the Impact of What You’ve Built in the Lab

So I Built This: Broadening the Impact of What You’ve Built in the Lab

Comments
7 min read
Mastering the Art of Prompting: A Developer's Guide to Effective Prompt Design

Mastering the Art of Prompting: A Developer's Guide to Effective Prompt Design

5
Comments 5
4 min read
The Data Understanding Phase: The Key to a Successful Machine Learning Project

The Data Understanding Phase: The Key to a Successful Machine Learning Project

1
Comments
5 min read
How to Build a Data Entry System (Quick & Easy Guide)

How to Build a Data Entry System (Quick & Easy Guide)

2
Comments 1
15 min read
Self-Training LLMs for Text Classification using DQC Toolkit

Self-Training LLMs for Text Classification using DQC Toolkit

Comments
13 min read
My Journey As A Young Programmer.

My Journey As A Young Programmer.

4
Comments
2 min read
A Beginner's Guide to Generative AI: Understanding, Learning, and Implementing with Python and Hugging Face🐍🤗🤖

A Beginner's Guide to Generative AI: Understanding, Learning, and Implementing with Python and Hugging Face🐍🤗🤖

8
Comments
7 min read
What is Machine Learning?

What is Machine Learning?

Comments
2 min read
How to Capitalize String Python Dataframe Pandas

How to Capitalize String Python Dataframe Pandas

Comments
1 min read
loading...