DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Cross-validation in Machine Learning

Cross-validation in Machine Learning

Comments
4 min read
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Comments 1
3 min read
Iterative Reasoning Preference Optimization

Iterative Reasoning Preference Optimization

Comments
4 min read
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Comments
4 min read
Neural-Symbolic Recursive Machine for Systematic Generalization

Neural-Symbolic Recursive Machine for Systematic Generalization

Comments
4 min read
Relational Graph Convolutional Networks for Sentiment Analysis

Relational Graph Convolutional Networks for Sentiment Analysis

Comments
5 min read
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

Comments
4 min read
Benchmarking Benchmark Leakage in Large Language Models

Benchmarking Benchmark Leakage in Large Language Models

Comments
4 min read
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

Comments
4 min read
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Comments
3 min read
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

Comments
5 min read
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Comments
4 min read
Make Your LLM Fully Utilize the Context

Make Your LLM Fully Utilize the Context

Comments
4 min read
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Comments
4 min read
Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Comments
4 min read
Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Comments
5 min read
DoRA: Weight-Decomposed Low-Rank Adaptation

DoRA: Weight-Decomposed Low-Rank Adaptation

Comments
4 min read
InstructEdit: Instruction-based Knowledge Editing for Large Language Models

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Comments
4 min read
Extending Llama-3's Context Ten-Fold Overnight

Extending Llama-3's Context Ten-Fold Overnight

Comments
4 min read
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

Comments
4 min read
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Comments
4 min read
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Comments
4 min read
Step Differences in Instructional Video

Step Differences in Instructional Video

Comments
4 min read
Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Comments
4 min read
Building a Large Japanese Web Corpus for Large Language Models

Building a Large Japanese Web Corpus for Large Language Models

Comments
3 min read
Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Comments
4 min read
Benchmarking Mobile Device Control Agents across Diverse Configurations

Benchmarking Mobile Device Control Agents across Diverse Configurations

1
Comments
4 min read
DATA PROFILING: Uncovering Insights in Your Data with YData's Expertise

DATA PROFILING: Uncovering Insights in Your Data with YData's Expertise

1
Comments
5 min read
Learning Performance-Improving Code Edits

Learning Performance-Improving Code Edits

Comments
4 min read
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

Comments
3 min read
Safeguarding Data Quality By Addressing Data Privacy and Security Concerns

Safeguarding Data Quality By Addressing Data Privacy and Security Concerns

1
Comments 1
4 min read
Best Practices for Designing an Efficient ETL Pipeline

Best Practices for Designing an Efficient ETL Pipeline

4
Comments
4 min read
Active Liveness Detection vs Passive Liveness Detection

Active Liveness Detection vs Passive Liveness Detection

29
Comments
1 min read
Using AI on top of your DB

Using AI on top of your DB

Comments 1
1 min read
How to Visualise MediaPipe’s Face and Face Landmark Detection in 2D and 3D with Rerun

How to Visualise MediaPipe’s Face and Face Landmark Detection in 2D and 3D with Rerun

9
Comments
3 min read
What is SQL in pictures. Diving deeper (Part 2)

What is SQL in pictures. Diving deeper (Part 2)

1
Comments
2 min read
JOINS IN SQL

JOINS IN SQL

2
Comments
2 min read
Optimizing SQL Performance: Best Practices for Efficient Database Operations

Optimizing SQL Performance: Best Practices for Efficient Database Operations

Comments
3 min read
5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

16
Comments 4
4 min read
Pandas reset_index(): How To Reset Indexes in Pandas

Pandas reset_index(): How To Reset Indexes in Pandas

Comments
3 min read
Spark SQL: Toolkit for Smart Data Manipulation

Spark SQL: Toolkit for Smart Data Manipulation

5
Comments
2 min read
Annotation is dead

Annotation is dead

Comments 2
11 min read
How to pick the best-performing time-series AI model for your specific data

How to pick the best-performing time-series AI model for your specific data

2
Comments
13 min read
Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

Comments 2
6 min read
Observations on MLOps–A Fragmented Mosaic of Mismatched Expectations

Observations on MLOps–A Fragmented Mosaic of Mismatched Expectations

Comments
9 min read
Large Language Models can Learn Rules

Large Language Models can Learn Rules

2
Comments
4 min read
Voxel51 Filtered Views Newsletter – April 26, 2024

Voxel51 Filtered Views Newsletter – April 26, 2024

Comments
9 min read
Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion

Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion

1
Comments
4 min read
Brainformers: Trading Simplicity for Efficiency

Brainformers: Trading Simplicity for Efficiency

Comments
4 min read
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning

FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning

Comments
4 min read
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Comments
3 min read
NExT: Teaching Large Language Models to Reason about Code Execution

NExT: Teaching Large Language Models to Reason about Code Execution

Comments
4 min read
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Comments
4 min read
Sort one column by another column in powerBI

Sort one column by another column in powerBI

Comments
2 min read
Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

9
Comments
3 min read
How to Estimate Depth from a Single Image

How to Estimate Depth from a Single Image

1
Comments
10 min read
Web Scraping Wikipedia tables

Web Scraping Wikipedia tables

Comments
6 min read
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Comments
4 min read
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Comments
4 min read
SpaceByte: Towards Deleting Tokenization from Large Language Modeling

SpaceByte: Towards Deleting Tokenization from Large Language Modeling

Comments
5 min read
loading...