DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

Comments
4 min read
DoRA: Weight-Decomposed Low-Rank Adaptation

DoRA: Weight-Decomposed Low-Rank Adaptation

Comments
4 min read
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Comments
4 min read
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Comments
4 min read
Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Comments
4 min read
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

Comments
3 min read
Benchmarking Mobile Device Control Agents across Diverse Configurations

Benchmarking Mobile Device Control Agents across Diverse Configurations

1
Comments
4 min read
Learning Performance-Improving Code Edits

Learning Performance-Improving Code Edits

Comments
4 min read
DATA PROFILING: Uncovering Insights in Your Data with YData's Expertise

DATA PROFILING: Uncovering Insights in Your Data with YData's Expertise

1
Comments
5 min read
Safeguarding Data Quality By Addressing Data Privacy and Security Concerns

Safeguarding Data Quality By Addressing Data Privacy and Security Concerns

1
Comments 1
4 min read
Best Practices for Designing an Efficient ETL Pipeline

Best Practices for Designing an Efficient ETL Pipeline

4
Comments
4 min read
Active Liveness Detection vs Passive Liveness Detection

Active Liveness Detection vs Passive Liveness Detection

24
Comments
1 min read
Using AI on top of your DB

Using AI on top of your DB

Comments 1
1 min read
How to Visualise MediaPipe’s Face and Face Landmark Detection in 2D and 3D with Rerun

How to Visualise MediaPipe’s Face and Face Landmark Detection in 2D and 3D with Rerun

8
Comments
3 min read
What is SQL in pictures. Diving deeper (Part 2)

What is SQL in pictures. Diving deeper (Part 2)

1
Comments
2 min read
JOINS IN SQL

JOINS IN SQL

2
Comments
2 min read
Optimizing SQL Performance: Best Practices for Efficient Database Operations

Optimizing SQL Performance: Best Practices for Efficient Database Operations

Comments
3 min read
5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

16
Comments 4
4 min read
Pandas reset_index(): How To Reset Indexes in Pandas

Pandas reset_index(): How To Reset Indexes in Pandas

Comments
3 min read
Spark SQL: Toolkit for Smart Data Manipulation

Spark SQL: Toolkit for Smart Data Manipulation

5
Comments
2 min read
Annotation is dead

Annotation is dead

Comments 2
11 min read
How to pick the best-performing time-series AI model for your specific data

How to pick the best-performing time-series AI model for your specific data

2
Comments
13 min read
Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

Comments 2
6 min read
Observations on MLOps–A Fragmented Mosaic of Mismatched Expectations

Observations on MLOps–A Fragmented Mosaic of Mismatched Expectations

Comments
9 min read
ETL VS ELT (Data Pipeline)

ETL VS ELT (Data Pipeline)

Comments 1
1 min read
Large Language Models can Learn Rules

Large Language Models can Learn Rules

2
Comments
4 min read
Voxel51 Filtered Views Newsletter – April 26, 2024

Voxel51 Filtered Views Newsletter – April 26, 2024

Comments
9 min read
Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion

Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion

1
Comments
4 min read
Brainformers: Trading Simplicity for Efficiency

Brainformers: Trading Simplicity for Efficiency

Comments
4 min read
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning

FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning

Comments
4 min read
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Comments
3 min read
NExT: Teaching Large Language Models to Reason about Code Execution

NExT: Teaching Large Language Models to Reason about Code Execution

Comments
4 min read
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Comments
4 min read
Sort one column by another column in powerBI

Sort one column by another column in powerBI

Comments
2 min read
Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

9
Comments
3 min read
How to Estimate Depth from a Single Image

How to Estimate Depth from a Single Image

1
Comments
10 min read
Web Scraping Wikipedia tables

Web Scraping Wikipedia tables

Comments
6 min read
An Analysis of the Math Requirements of 199 CS BS/BA Degrees at 158 U.S. Universities

An Analysis of the Math Requirements of 199 CS BS/BA Degrees at 158 U.S. Universities

Comments
3 min read
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Comments
4 min read
SpaceByte: Towards Deleting Tokenization from Large Language Modeling

SpaceByte: Towards Deleting Tokenization from Large Language Modeling

Comments
5 min read
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Comments
4 min read
Do not think pink elephant!

Do not think pink elephant!

Comments
5 min read
Removing Reflections from RAW Photos

Removing Reflections from RAW Photos

Comments
3 min read
"Day 48 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Maths for Data Analysis ( Per & Com - 3)

"Day 48 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Maths for Data Analysis ( Per & Com - 3)

1
Comments
2 min read
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

3
Comments
4 min read
Accelerating AI: The Role of Kubernetes in Data Science Workflows

Accelerating AI: The Role of Kubernetes in Data Science Workflows

Comments
2 min read
Create GPS Test Data In Go

Create GPS Test Data In Go

3
Comments
4 min read
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models

Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models

1
Comments
4 min read
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Comments
3 min read
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude

From LLM to NMT: Advancing Low-Resource Machine Translation with Claude

Comments
4 min read
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Comments
4 min read
RETVec: Resilient and Efficient Text Vectorizer

RETVec: Resilient and Efficient Text Vectorizer

Comments
3 min read
A Survey on Self-Evolution of Large Language Models

A Survey on Self-Evolution of Large Language Models

Comments
3 min read
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Comments
4 min read
Landscape of Data Management Tools

Landscape of Data Management Tools

1
Comments
3 min read
Unveiling Insights into Project Management Software and its Demographics

Unveiling Insights into Project Management Software and its Demographics

Comments 1
5 min read
"Day 49 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Maths for Data Analysis ( Per & Comb - 4)

"Day 49 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Maths for Data Analysis ( Per & Comb - 4)

1
Comments
1 min read
Empower your Projects with Face Recognition SDK: 9 Must-Have Features for Developers

Empower your Projects with Face Recognition SDK: 9 Must-Have Features for Developers

21
Comments
1 min read
Bot or Human? Detecting ChatGPT Imposters with A Single Question

Bot or Human? Detecting ChatGPT Imposters with A Single Question

3
Comments
4 min read
Does GPT-4 pass the Turing test?

Does GPT-4 pass the Turing test?

1
Comments
4 min read
loading...