DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
IT Healthcare: Its Importance, Challenges And How To Find Good Healthcare Data

IT Healthcare: Its Importance, Challenges And How To Find Good Healthcare Data

Comments
7 min read
Create GPS Test Data In Go

Create GPS Test Data In Go

Comments
3 min read
Handling Noisy Labels in Text Classification

Handling Noisy Labels in Text Classification

Comments
16 min read
Constructors and Generators in Python

Constructors and Generators in Python

Comments
1 min read
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

2
Comments
4 min read
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models

Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models

1
Comments
4 min read
RETVec: Resilient and Efficient Text Vectorizer

RETVec: Resilient and Efficient Text Vectorizer

Comments
3 min read
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude

From LLM to NMT: Advancing Low-Resource Machine Translation with Claude

Comments
4 min read
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Comments
3 min read
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Comments
4 min read
A Survey on Self-Evolution of Large Language Models

A Survey on Self-Evolution of Large Language Models

Comments
3 min read
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Comments
4 min read
Landscape of Data Management Tools

Landscape of Data Management Tools

Comments
3 min read
Unveiling Insights into Project Management Software and its Demographics

Unveiling Insights into Project Management Software and its Demographics

Comments 1
5 min read
Empower your Projects with Face Recognition SDK: 9 Must-Have Features for Developers

Empower your Projects with Face Recognition SDK: 9 Must-Have Features for Developers

Comments
1 min read
CVPR 2024 Datasets and Benchmarks - Part 1: Datasets

CVPR 2024 Datasets and Benchmarks - Part 1: Datasets

Comments
14 min read
Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis

Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis

Comments
4 min read
Does GPT-4 pass the Turing test?

Does GPT-4 pass the Turing test?

1
Comments
4 min read
Think before you speak: Training Language Models With Pause Tokens

Think before you speak: Training Language Models With Pause Tokens

Comments
4 min read
Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Comments
4 min read
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Comments
4 min read
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Comments
4 min read
Matching Patients to Clinical Trials with Large Language Models

Matching Patients to Clinical Trials with Large Language Models

Comments
4 min read
Bot or Human? Detecting ChatGPT Imposters with A Single Question

Bot or Human? Detecting ChatGPT Imposters with A Single Question

2
Comments
4 min read
Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Comments
4 min read
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Comments
5 min read
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances

HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances

Comments
4 min read
Microsoft’s Phi-3 model is cool tech, but local LLMs are useless

Microsoft’s Phi-3 model is cool tech, but local LLMs are useless

Comments
6 min read
ReportLite Practice: Multi-layered cross-reports with complex formats

ReportLite Practice: Multi-layered cross-reports with complex formats

Comments
3 min read
INTRO : Apache Cassandra

INTRO : Apache Cassandra

Comments
2 min read
Top 5 Data Integration Tools for Modern Data Pipelines

Top 5 Data Integration Tools for Modern Data Pipelines

1
Comments
3 min read
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Comments
4 min read
Interpretable Graph Neural Networks for Tabular Data

Interpretable Graph Neural Networks for Tabular Data

Comments
4 min read
AI Consciousness is Inevitable: A Theoretical Computer Science Perspective

AI Consciousness is Inevitable: A Theoretical Computer Science Perspective

Comments
4 min read
Ten Hard Problems in Artificial Intelligence We Must Get Right

Ten Hard Problems in Artificial Intelligence We Must Get Right

2
Comments 3
4 min read
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Comments
4 min read
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

2
Comments
4 min read
Deep Neural Networks via Complex Network Theory: a Perspective

Deep Neural Networks via Complex Network Theory: a Perspective

Comments
4 min read
5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

13
Comments 3
4 min read
How to Detect Small Objects

How to Detect Small Objects

1
Comments
12 min read
Unleashing the Value of Data: A Journey into Data Monetization

Unleashing the Value of Data: A Journey into Data Monetization

1
Comments
2 min read
Distributed Databases

Distributed Databases

Comments
1 min read
NoSQL Deployment

NoSQL Deployment

Comments
1 min read
What is SQL in picture

What is SQL in picture

2
Comments 2
1 min read
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

2
Comments
4 min read
From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

2
Comments
4 min read
Domino tiling library

Domino tiling library

Comments
1 min read
Language Imbalance Can Boost Cross-lingual Generalisation

Language Imbalance Can Boost Cross-lingual Generalisation

1
Comments
3 min read
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

2
Comments
4 min read
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

1
Comments
4 min read
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

1
Comments
3 min read
Online Advertisements with LLMs: Opportunities and Challenges

Online Advertisements with LLMs: Opportunities and Challenges

1
Comments
4 min read
mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture

mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture

1
Comments
4 min read
Information Retrieval with Entity Linking

Information Retrieval with Entity Linking

2
Comments
4 min read
Advanced SQL Techniques: Taking Your Data Skills to the Next Level

Advanced SQL Techniques: Taking Your Data Skills to the Next Level

1
Comments
2 min read
Twenty Constructionist Things to Do with Artificial Intelligence and Machine Learning

Twenty Constructionist Things to Do with Artificial Intelligence and Machine Learning

Comments
4 min read
A decoder-only foundation model for time-series forecasting

A decoder-only foundation model for time-series forecasting

Comments
4 min read
LLM Agents can Autonomously Exploit One-day Vulnerabilities

LLM Agents can Autonomously Exploit One-day Vulnerabilities

Comments
4 min read
Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models

Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models

1
Comments
4 min read
A Closer Look at AUROC and AUPRC under Class Imbalance

A Closer Look at AUROC and AUPRC under Class Imbalance

Comments
4 min read
loading...