DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Enhancing ID Verification Systems: Unleashing the Power of On Premise Face Recognition SDKs

Enhancing ID Verification Systems: Unleashing the Power of On Premise Face Recognition SDKs

18
Comments 1
2 min read
Unlocking age verification—The fusion of ID document recognition and face attribute analysis

Unlocking age verification—The fusion of ID document recognition and face attribute analysis

21
Comments
2 min read
Apache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?

Apache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?

1
Comments
9 min read
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation

LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation

Comments
4 min read
MapReduce Vs Tez

MapReduce Vs Tez

6
Comments
2 min read
Using Apache Superset, a Powerful and Free Data Analysis Tool

Using Apache Superset, a Powerful and Free Data Analysis Tool

6
Comments
3 min read
Difoosion, a Simple Web-Interface for Stable Diffusion Models

Difoosion, a Simple Web-Interface for Stable Diffusion Models

10
Comments
3 min read
Creating Custom Dashboards with Vizro: A Comprehensive Guide

Creating Custom Dashboards with Vizro: A Comprehensive Guide

Comments
8 min read
Beginner's Guide to NLP and NLTK 🐍📑

Beginner's Guide to NLP and NLTK 🐍📑

19
Comments
6 min read
Recapping the AI, Machine Learning and Computer Meetup — July 3, 2024

Recapping the AI, Machine Learning and Computer Meetup — July 3, 2024

Comments
5 min read
Computer Vision Meetup: Performance Optimisation for Multimodal LLMs 34:59

Computer Vision Meetup: Performance Optimisation for Multimodal LLMs

2
Comments
1 min read
July 10: Developing Data-Centric AI Workshop

July 10: Developing Data-Centric AI Workshop

Comments
1 min read
The Power of Data Science: Revolutionizing Industries.

The Power of Data Science: Revolutionizing Industries.

Comments
1 min read
Uncertainty-Aware AI from Multimodal Data: A PyTorch Tutorial with LUMA Dataset

Uncertainty-Aware AI from Multimodal Data: A PyTorch Tutorial with LUMA Dataset

1
Comments
14 min read
How Data Integration Is Evolving Beyond ETL

How Data Integration Is Evolving Beyond ETL

Comments
16 min read
Data protection in AI ?

Data protection in AI ?

Comments
1 min read
Optimizing RAG Through an Evaluation-Based Methodology

Optimizing RAG Through an Evaluation-Based Methodology

13
Comments
14 min read
Handling Categorical Values|| Machine Learning

Handling Categorical Values|| Machine Learning

7
Comments
4 min read
CRISP-DM: The Essential Methodology for Structuring Your Data Science Projects

CRISP-DM: The Essential Methodology for Structuring Your Data Science Projects

2
Comments
4 min read
Key Data Science Innovations to Embrace in 2024

Key Data Science Innovations to Embrace in 2024

1
Comments 1
3 min read
Data Science Essentials: Building a Strong Foundation

Data Science Essentials: Building a Strong Foundation

2
Comments
3 min read
If in a Crowdsourced Data Annotation Pipeline, a GPT-4

If in a Crowdsourced Data Annotation Pipeline, a GPT-4

1
Comments
3 min read
Bayesian Regression Markets

Bayesian Regression Markets

1
Comments
3 min read
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

1
Comments
4 min read
A Comprehensive Guide to NumPy with Python 🐍🎲

A Comprehensive Guide to NumPy with Python 🐍🎲

5
Comments
3 min read
{sigma}-GPTs: A New Approach to Autoregressive Models

{sigma}-GPTs: A New Approach to Autoregressive Models

1
Comments
3 min read
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion

Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion

2
Comments
4 min read
My Experience with Python for Data Analysis

My Experience with Python for Data Analysis

3
Comments 2
4 min read
Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement

Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement

1
Comments
4 min read
Bytes Are All You Need: Transformers Operating Directly On File Bytes

Bytes Are All You Need: Transformers Operating Directly On File Bytes

1
Comments
4 min read
Cutting through buggy adversarial example defenses: fixing 1 line of code breaks Sabre

Cutting through buggy adversarial example defenses: fixing 1 line of code breaks Sabre

Comments
4 min read
Day 9 of Machine Learning|| Linear Regression implementation

Day 9 of Machine Learning|| Linear Regression implementation

6
Comments
4 min read
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Comments
5 min read
ReFT: Reasoning with Reinforced Fine-Tuning

ReFT: Reasoning with Reinforced Fine-Tuning

Comments
4 min read
Evaluating the Social Impact of Generative AI Systems in Systems and Society

Evaluating the Social Impact of Generative AI Systems in Systems and Society

Comments
5 min read
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

Comments
3 min read
From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

Comments
5 min read
The Remarkable Robustness of LLMs: Stages of Inference?

The Remarkable Robustness of LLMs: Stages of Inference?

Comments
4 min read
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

Comments
4 min read
Thermometer: Towards Universal Calibration for Large Language Models

Thermometer: Towards Universal Calibration for Large Language Models

Comments
4 min read
Assessing the nature of large language models: A caution against anthropocentrism

Assessing the nature of large language models: A caution against anthropocentrism

Comments
4 min read
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution

MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution

2
Comments
4 min read
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models

SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models

Comments
5 min read
The Ultimate Guide to Choosing Data Engineering Services for Your Enterprise

The Ultimate Guide to Choosing Data Engineering Services for Your Enterprise

Comments
3 min read
Technical Report: Titanic Passenger List

Technical Report: Titanic Passenger List

1
Comments
2 min read
Unlocking the Power of R: Essential Libraries for Data Science in 2024

Unlocking the Power of R: Essential Libraries for Data Science in 2024

1
Comments
3 min read
Unsupervised Learning: Unveiling the Hidden Secrets in Your Data

Unsupervised Learning: Unveiling the Hidden Secrets in Your Data

2
Comments
4 min read
Introduction to Apache Hadoop & MapReduce

Introduction to Apache Hadoop & MapReduce

5
Comments
3 min read
Using JSONB in PostgreSQL

Using JSONB in PostgreSQL

10
Comments 2
3 min read
A First Glance Review on Retail Sales Data

A First Glance Review on Retail Sales Data

1
Comments
2 min read
Getting Started with Python: A Beginner's Guide

Getting Started with Python: A Beginner's Guide

Comments 1
3 min read
First glance - Sales Dataset

First glance - Sales Dataset

Comments 1
2 min read
Basic Data Analysis on the Iris Flower Dataset (HNG 11)

Basic Data Analysis on the Iris Flower Dataset (HNG 11)

Comments 1
2 min read
Technical Report: Initial Data Analysis of Titanic Datasets

Technical Report: Initial Data Analysis of Titanic Datasets

Comments
2 min read
A Beginner's Guide to Mastering Data Science: Key Tips and Strategies 🤖

A Beginner's Guide to Mastering Data Science: Key Tips and Strategies 🤖

6
Comments 2
5 min read
SALES DATA ANALYSIS

SALES DATA ANALYSIS

2
Comments
3 min read
Introducing dataDisk: Simplify Your Data Processing Pipelines

Introducing dataDisk: Simplify Your Data Processing Pipelines

1
Comments
2 min read
Understanding Dijkstra's Algorithm: A Step-by-Step Guide 🚀

Understanding Dijkstra's Algorithm: A Step-by-Step Guide 🚀

12
Comments 2
6 min read
Tour the WayveScenes101 Autonomous Driving Dataset 03:18

Tour the WayveScenes101 Autonomous Driving Dataset

1
Comments 1
1 min read
Titanic Dataset Analysis

Titanic Dataset Analysis

Comments
2 min read
loading...