DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

Comments
4 min read
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Comments
4 min read
Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models

Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models

Comments
4 min read
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

Comments
3 min read
State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

Comments
4 min read
Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

Comments
4 min read
Refusal in Language Models Is Mediated by a Single Direction

Refusal in Language Models Is Mediated by a Single Direction

1
Comments
3 min read
Large Language Models Are Zero-Shot Time Series Forecasters

Large Language Models Are Zero-Shot Time Series Forecasters

2
Comments
4 min read
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Comments
4 min read
Jellyfish: A Large Language Model for Data Preprocessing

Jellyfish: A Large Language Model for Data Preprocessing

2
Comments
4 min read
Chain-of-Thought Unfaithfulness as Disguised Accuracy

Chain-of-Thought Unfaithfulness as Disguised Accuracy

1
Comments
4 min read
Large language models surpass human experts in predicting neuroscience results

Large language models surpass human experts in predicting neuroscience results

1
Comments
4 min read
garak: A Framework for Security Probing Large Language Models

garak: A Framework for Security Probing Large Language Models

1
Comments
4 min read
LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

1
Comments
4 min read
Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Comments
4 min read
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Comments
3 min read
Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Comments
3 min read
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Comments
4 min read
A Survey on In-context Learning

A Survey on In-context Learning

1
Comments
4 min read
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Comments
4 min read
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Comments
4 min read
A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

Comments
6 min read
Depth Anything V2

Depth Anything V2

1
Comments
4 min read
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning

Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning

Comments
4 min read
DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

Comments
4 min read
Scrolly2Reel: Retargeting Graphics for Social Media Using Narrative Beats

Scrolly2Reel: Retargeting Graphics for Social Media Using Narrative Beats

Comments
3 min read
Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Comments
5 min read
A Survey on Large Language Models for Recommendation

A Survey on Large Language Models for Recommendation

Comments
4 min read
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Comments
4 min read
DataComp-LM: In search of the next generation of training sets for language models

DataComp-LM: In search of the next generation of training sets for language models

Comments
3 min read
Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

Comments
4 min read
How Do Humans Write Code? Large Models Do It the Same Way Too

How Do Humans Write Code? Large Models Do It the Same Way Too

Comments
4 min read
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Comments
3 min read
Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Comments
4 min read
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Comments
4 min read
Introduction TO Word Embeddings

Introduction TO Word Embeddings

Comments
3 min read
Learning Resource Hub

Learning Resource Hub

2
Comments
2 min read
AnĂĄlise dos reservatĂłrios federais - parte 2

AnĂĄlise dos reservatĂłrios federais - parte 2

3
Comments
3 min read
Understanding OOPs in Python 🐍🌠

Understanding OOPs in Python 🐍🌠

4
Comments
5 min read
Introduction to Pandas

Introduction to Pandas

9
Comments 4
37 min read
A Comprehensive Guide to the Data Science Life Cycle with Python Libraries 🐍🤖

A Comprehensive Guide to the Data Science Life Cycle with Python Libraries 🐍🤖

7
Comments 2
3 min read
Guia BĂĄsico para tratar dados com Pandas em Python

Guia BĂĄsico para tratar dados com Pandas em Python

2
Comments
4 min read
Decoding Databases: The Backbone of Data Science

Decoding Databases: The Backbone of Data Science

1
Comments
2 min read
Clustering vs Partitioning your Apache Iceberg Tables

Clustering vs Partitioning your Apache Iceberg Tables

4
Comments
10 min read
Unleashing GPU Power: Supercharge Your Data Processing with cuDF

Unleashing GPU Power: Supercharge Your Data Processing with cuDF

Comments
4 min read
Tech to Non-tech in Search of better Tech Roles again!

Tech to Non-tech in Search of better Tech Roles again!

Comments
2 min read
10 In-Demand Highest-Paying Python Jobs in 2024

10 In-Demand Highest-Paying Python Jobs in 2024

4
Comments
2 min read
The Data Professions

The Data Professions

1
Comments
3 min read
T-Test and Chi-Square Test in Data Analysis 🐍🤖🧠

T-Test and Chi-Square Test in Data Analysis 🐍🤖🧠

5
Comments
4 min read
ANOVA : Building and Understanding ANOVA in Python 🐍📶

ANOVA : Building and Understanding ANOVA in Python 🐍📶

10
Comments 2
4 min read
CVPR Survival Guide: Discovering Research That's Interesting to YOU!

CVPR Survival Guide: Discovering Research That's Interesting to YOU!

2
Comments
8 min read
Understanding the P-Test: A Beginner's Guide to Hypothesis Testing 🐍🅿️

Understanding the P-Test: A Beginner's Guide to Hypothesis Testing 🐍🅿️

4
Comments
3 min read
Embarking on My Tech Learning Journey

Embarking on My Tech Learning Journey

1
Comments
3 min read
5 Reasons to Make Power BI Your First Choice as a Data Science Student

5 Reasons to Make Power BI Your First Choice as a Data Science Student

Comments
3 min read
Idempotent in Computing: A Comprehensive Guide

Idempotent in Computing: A Comprehensive Guide

3
Comments
4 min read
Day 8 of Machine Learning ||Linear Regression Part 2

Day 8 of Machine Learning ||Linear Regression Part 2

7
Comments 2
2 min read
Data Science and the Cloud

Data Science and the Cloud

6
Comments
6 min read
Machine Learning Roadmap for Beginners ( If you have a Non-CS background like me😉)

Machine Learning Roadmap for Beginners ( If you have a Non-CS background like me😉)

1
Comments
3 min read
Understanding Hallucinations in Diffusion Models through Mode Interpolation

Understanding Hallucinations in Diffusion Models through Mode Interpolation

1
Comments
4 min read
An Empirical Study of Mamba-based Language Models

An Empirical Study of Mamba-based Language Models

6
Comments
4 min read
loading...