DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

13
Comments 3
4 min read
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

3
Comments
4 min read
Bot or Human? Detecting ChatGPT Imposters with A Single Question

Bot or Human? Detecting ChatGPT Imposters with A Single Question

3
Comments
4 min read
👭 Women suffrage dates (suffragettes) celebration w/ data 🗳️

👭 Women suffrage dates (suffragettes) celebration w/ data 🗳️

3
Comments 4
1 min read
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

2
Comments
4 min read
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

2
Comments
4 min read
Ten Hard Problems in Artificial Intelligence We Must Get Right

Ten Hard Problems in Artificial Intelligence We Must Get Right

2
Comments 3
4 min read
Information Retrieval with Entity Linking

Information Retrieval with Entity Linking

2
Comments
4 min read
From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

2
Comments
4 min read
What is SQL in picture

What is SQL in picture

2
Comments 2
1 min read
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

2
Comments
4 min read
Language Imbalance Can Boost Cross-lingual Generalisation

Language Imbalance Can Boost Cross-lingual Generalisation

1
Comments
3 min read
Top 5 Data Integration Tools for Modern Data Pipelines

Top 5 Data Integration Tools for Modern Data Pipelines

1
Comments
3 min read
Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models

Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models

1
Comments
4 min read
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models

Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models

1
Comments
4 min read
Advanced SQL Techniques: Taking Your Data Skills to the Next Level

Advanced SQL Techniques: Taking Your Data Skills to the Next Level

1
Comments
2 min read
Unleashing the Value of Data: A Journey into Data Monetization

Unleashing the Value of Data: A Journey into Data Monetization

1
Comments
2 min read
How to Detect Small Objects

How to Detect Small Objects

1
Comments
12 min read
mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture

mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture

1
Comments
4 min read
Does GPT-4 pass the Turing test?

Does GPT-4 pass the Turing test?

1
Comments
4 min read
Online Advertisements with LLMs: Opportunities and Challenges

Online Advertisements with LLMs: Opportunities and Challenges

1
Comments
4 min read
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

1
Comments
3 min read
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

1
Comments
4 min read
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Comments
5 min read
Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Comments
4 min read
Matching Patients to Clinical Trials with Large Language Models

Matching Patients to Clinical Trials with Large Language Models

Comments
4 min read
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Comments
4 min read
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Comments
4 min read
Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Comments
4 min read
Think before you speak: Training Language Models With Pause Tokens

Think before you speak: Training Language Models With Pause Tokens

Comments
4 min read
Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis

Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis

Comments
4 min read
CVPR 2024 Datasets and Benchmarks - Part 1: Datasets

CVPR 2024 Datasets and Benchmarks - Part 1: Datasets

Comments
14 min read
Empower your Projects with Face Recognition SDK: 9 Must-Have Features for Developers

Empower your Projects with Face Recognition SDK: 9 Must-Have Features for Developers

Comments
1 min read
Unveiling Insights into Project Management Software and its Demographics

Unveiling Insights into Project Management Software and its Demographics

Comments 1
5 min read
Landscape of Data Management Tools

Landscape of Data Management Tools

Comments
3 min read
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Comments
4 min read
A Survey on Self-Evolution of Large Language Models

A Survey on Self-Evolution of Large Language Models

Comments
3 min read
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Comments
4 min read
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Comments
3 min read
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude

From LLM to NMT: Advancing Low-Resource Machine Translation with Claude

Comments
4 min read
RETVec: Resilient and Efficient Text Vectorizer

RETVec: Resilient and Efficient Text Vectorizer

Comments
3 min read
Constructors and Generators in Python

Constructors and Generators in Python

Comments
1 min read
Handling Noisy Labels in Text Classification

Handling Noisy Labels in Text Classification

Comments
16 min read
Create GPS Test Data In Go

Create GPS Test Data In Go

Comments
4 min read
IT Healthcare: Its Importance, Challenges And How To Find Good Healthcare Data

IT Healthcare: Its Importance, Challenges And How To Find Good Healthcare Data

Comments
7 min read
Accelerating AI: The Role of Kubernetes in Data Science Workflows

Accelerating AI: The Role of Kubernetes in Data Science Workflows

Comments
2 min read
What Essential Skills Should Every Aspiring Data Scientist Develop?

What Essential Skills Should Every Aspiring Data Scientist Develop?

Comments
8 min read
JavaScript Histogram of Gaussian Distribution

JavaScript Histogram of Gaussian Distribution

Comments
7 min read
SpaceByte: Towards Deleting Tokenization from Large Language Modeling

SpaceByte: Towards Deleting Tokenization from Large Language Modeling

Comments
5 min read
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Comments
4 min read
An Analysis of the Math Requirements of 199 CS BS/BA Degrees at 158 U.S. Universities

An Analysis of the Math Requirements of 199 CS BS/BA Degrees at 158 U.S. Universities

Comments
3 min read
Removing Reflections from RAW Photos

Removing Reflections from RAW Photos

Comments
3 min read
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Comments
4 min read
Do not think pink elephant!

Do not think pink elephant!

Comments
5 min read
web-scraping-wikipedia-tables

web-scraping-wikipedia-tables

Comments
6 min read
How to Estimate Depth from a Single Image

How to Estimate Depth from a Single Image

Comments
10 min read
Using AI on top of your DB

Using AI on top of your DB

Comments 1
1 min read
Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

Comments
3 min read
Recapping the AI, Machine Learning and Data Science Meetup — April 18, 2024

Recapping the AI, Machine Learning and Data Science Meetup — April 18, 2024

Comments
6 min read
Computer Vision Meetup: Towards Resource Efficient Robust Text-to-Image Generative Models 21:34

Computer Vision Meetup: Towards Resource Efficient Robust Text-to-Image Generative Models

Comments
1 min read
loading...