DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

Comments
4 min read
Neuromorphic dreaming: A pathway to efficient learning in artificial agents

Neuromorphic dreaming: A pathway to efficient learning in artificial agents

Comments
3 min read
Fractal Patterns May Illuminate the Success of Next-Token Prediction

Fractal Patterns May Illuminate the Success of Next-Token Prediction

Comments
5 min read
ML Intro : IRIS DATASET

ML Intro : IRIS DATASET

4
Comments
4 min read
SQL Convertor for Easy Migration from Presto, Trino, ClickHouse, and Hive to Apache Doris

SQL Convertor for Easy Migration from Presto, Trino, ClickHouse, and Hive to Apache Doris

Comments
4 min read
Unlocking the Power of YOLOv10: Step-by-Step Guide with Real-World Examples

Unlocking the Power of YOLOv10: Step-by-Step Guide with Real-World Examples

20
Comments
4 min read
MT-Bench: Comparing different LLM Judges

MT-Bench: Comparing different LLM Judges

13
Comments 1
4 min read
How To Build a Data Analytics Dashboard

How To Build a Data Analytics Dashboard

Comments 1
22 min read
Big Data: a ferramenta que precisamos.

Big Data: a ferramenta que precisamos.

Comments
2 min read
How I Aced the DP-100 Exam and Became an Azure Data Scientist Associate

How I Aced the DP-100 Exam and Became an Azure Data Scientist Associate

5
Comments 3
12 min read
NoSQL Deployment

NoSQL Deployment

Comments
1 min read
Unleashing the Value of Data: A Journey into Data Monetization

Unleashing the Value of Data: A Journey into Data Monetization

1
Comments
2 min read
Introducing Tapyr: Create and Deploy Enterprise-Ready PyShiny Dashboards with Ease

Introducing Tapyr: Create and Deploy Enterprise-Ready PyShiny Dashboards with Ease

Comments
5 min read
Domino tiling library

Domino tiling library

Comments
1 min read
Voxel51 Filtered Views Newsletter - May 24, 2024

Voxel51 Filtered Views Newsletter - May 24, 2024

Comments
11 min read
word clouds with python ☁️🐍

word clouds with python ☁️🐍

3
Comments
3 min read
Architecture of Neural Networks

Architecture of Neural Networks

Comments
6 min read
Embracing Open Source: A Catalyst for Scientific Progress

Embracing Open Source: A Catalyst for Scientific Progress

Comments
2 min read
Descriptive Statistics || Part 1(Central Tendency and Dispersion)

Descriptive Statistics || Part 1(Central Tendency and Dispersion)

2
Comments
4 min read
📸 Disaster reporting w/Open Source LLaVA/AI for photo analysis

📸 Disaster reporting w/Open Source LLaVA/AI for photo analysis

2
Comments 6
2 min read
INTRO : Apache Cassandra

INTRO : Apache Cassandra

Comments
2 min read
Computer Vision Meetup: GraphRAG with a Knowledge Graph 27:01

Computer Vision Meetup: GraphRAG with a Knowledge Graph

Comments
1 min read
Your Guide to Data Science Interview Preparation: Tips for Success

Your Guide to Data Science Interview Preparation: Tips for Success

6
Comments
16 min read
Distributed Databases

Distributed Databases

Comments
1 min read
How Far Are We From AGI

How Far Are We From AGI

1
Comments
4 min read
HMT: Hierarchical Memory Transformer for Long Context Language Processing

HMT: Hierarchical Memory Transformer for Long Context Language Processing

Comments
4 min read
GDPR: Is it worth it? Perceptions of workers who have experienced its implementation

GDPR: Is it worth it? Perceptions of workers who have experienced its implementation

Comments
4 min read
MarkLLM: An Open-Source Toolkit for LLM Watermarking

MarkLLM: An Open-Source Toolkit for LLM Watermarking

2
Comments
5 min read
Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption

Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption

Comments
4 min read
Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Comments
4 min read
NASU -- Novel Actuating Screw Unit: Origami-inspired Screw-based Propulsion on Mobile Ground Robots

NASU -- Novel Actuating Screw Unit: Origami-inspired Screw-based Propulsion on Mobile Ground Robots

Comments
4 min read
MOMENT: A Family of Open Time-series Foundation Models

MOMENT: A Family of Open Time-series Foundation Models

Comments
4 min read
Chameleon: Mixed-Modal Early-Fusion Foundation Models

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Comments
4 min read
SqueezeSAM: User friendly mobile interactive segmentation

SqueezeSAM: User friendly mobile interactive segmentation

Comments
3 min read
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Comments
4 min read
VILA: On Pre-training for Visual Language Models

VILA: On Pre-training for Visual Language Models

Comments
4 min read
Sporthesia: Augmenting Sports Videos Using Natural Language

Sporthesia: Augmenting Sports Videos Using Natural Language

Comments
4 min read
A Spectral Condition for Feature Learning

A Spectral Condition for Feature Learning

Comments
4 min read
Distinguishing Tor From Other Encrypted Network Traffic Through Character Analysis

Distinguishing Tor From Other Encrypted Network Traffic Through Character Analysis

Comments
3 min read
An Analysis of Quantile Temporal-Difference Learning

An Analysis of Quantile Temporal-Difference Learning

1
Comments
4 min read
Thinking Tokens for Language Modeling

Thinking Tokens for Language Modeling

Comments
3 min read
Observational Scaling Laws and the Predictability of Language Model Performance

Observational Scaling Laws and the Predictability of Language Model Performance

1
Comments
4 min read
Multimodal Chain-of-Thought Reasoning in Language Models

Multimodal Chain-of-Thought Reasoning in Language Models

Comments
4 min read
On the Security Vulnerabilities of Text-to-SQL Models

On the Security Vulnerabilities of Text-to-SQL Models

Comments
3 min read
GPT-4 passes most of the 297 written Polish Board Certification Examinations

GPT-4 passes most of the 297 written Polish Board Certification Examinations

Comments
3 min read
Hydragen: High-Throughput LLM Inference with Shared Prefixes

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Comments
4 min read
Training-Free Consistent Text-to-Image Generation

Training-Free Consistent Text-to-Image Generation

Comments
4 min read
Zero-Shot Tokenizer Transfer

Zero-Shot Tokenizer Transfer

Comments
4 min read
Player-Driven Emergence in LLM-Driven Game Narrative

Player-Driven Emergence in LLM-Driven Game Narrative

Comments
4 min read
Do Anything Now: Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

Do Anything Now: Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

Comments
4 min read
Simultaneous Many-Row Activation in Off-the-Shelf DRAM Chips: Experimental Characterization and Analysis

Simultaneous Many-Row Activation in Off-the-Shelf DRAM Chips: Experimental Characterization and Analysis

Comments
4 min read
People cannot distinguish GPT-4 from a human in a Turing test

People cannot distinguish GPT-4 from a human in a Turing test

Comments
5 min read
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models

Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models

Comments
5 min read
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

1
Comments
4 min read
Biomedical knowledge graph-optimized prompt generation for large language models

Biomedical knowledge graph-optimized prompt generation for large language models

Comments
3 min read
Identifying the Risks of LM Agents with an LM-Emulated Sandbox

Identifying the Risks of LM Agents with an LM-Emulated Sandbox

Comments
3 min read
The Platonic Representation Hypothesis

The Platonic Representation Hypothesis

Comments
3 min read
FOLIO: Natural Language Reasoning with First-Order Logic

FOLIO: Natural Language Reasoning with First-Order Logic

Comments
4 min read
Black-Box Access is Insufficient for Rigorous AI Audits

Black-Box Access is Insufficient for Rigorous AI Audits

Comments
4 min read
Layer-Condensed KV Cache for Efficient Inference of Large Language Models

Layer-Condensed KV Cache for Efficient Inference of Large Language Models

Comments
5 min read
loading...