DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Illusion of State in State-Space Models

The Illusion of State in State-Space Models

Comments
4 min read
The Best R Packages Every Data Scientist Should Use

The Best R Packages Every Data Scientist Should Use

Comments
2 min read
"Day 44 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -22)

"Day 44 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -22)

1
Comments
2 min read
"Day 45 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -24)

"Day 45 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -24)

1
Comments
1 min read
Tools Every Data Scientist Should Know

Tools Every Data Scientist Should Know

Comments
2 min read
Large Language Models as Optimizers

Large Language Models as Optimizers

1
Comments
4 min read
Recommender Systems in the Era of Large Language Models (LLMs)

Recommender Systems in the Era of Large Language Models (LLMs)

1
Comments
4 min read
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

1
Comments
4 min read
H2O-Danube-1.8B Technical Report

H2O-Danube-1.8B Technical Report

Comments
4 min read
Dataset Reset Policy Optimization for RLHF

Dataset Reset Policy Optimization for RLHF

Comments
4 min read
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Comments
4 min read
Manipulating Large Language Models to Increase Product Visibility

Manipulating Large Language Models to Increase Product Visibility

Comments
3 min read
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

Comments
3 min read
BooookScore: A systematic exploration of book-length summarization in the era of LLMs

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Comments
4 min read
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
Beginner's Guide to Math's for Machine Learning

Beginner's Guide to Math's for Machine Learning

2
Comments 1
2 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

1
Comments
3 min read
Zero-Shot Prediction Plugin for FiftyOne

Zero-Shot Prediction Plugin for FiftyOne

Comments
8 min read
The Importance of Data in Decision Making

The Importance of Data in Decision Making

4
Comments
2 min read
Day 1 of 30 : Machine Learning

Day 1 of 30 : Machine Learning

11
Comments 6
2 min read
Bulletproof Your Analysis: Data Quality Checklists for Reliable Insights

Bulletproof Your Analysis: Data Quality Checklists for Reliable Insights

1
Comments 1
4 min read
AI and Data Sets – Maximizing the Power of Data

AI and Data Sets – Maximizing the Power of Data

1
Comments
3 min read
The Expressive Power of Transformers with Chain of Thought

The Expressive Power of Transformers with Chain of Thought

5
Comments
4 min read
The Impact of Depth on Compositional Generalization in Transformer Language Models

The Impact of Depth on Compositional Generalization in Transformer Language Models

5
Comments
4 min read
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

5
Comments
3 min read
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

6
Comments
4 min read
Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

6
Comments
4 min read
Show Your Work with Confidence: Confidence Bands for Tuning Curves

Show Your Work with Confidence: Confidence Bands for Tuning Curves

6
Comments
4 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

5
Comments
3 min read
Vision Transformers Need Registers

Vision Transformers Need Registers

5
Comments
4 min read
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

5
Comments
3 min read
Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

5
Comments
4 min read
ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

5
Comments
4 min read
Rho-1: Not All Tokens Are What You Need

Rho-1: Not All Tokens Are What You Need

5
Comments
4 min read
Generalization in diffusion models arises from geometry-adaptive harmonic representations

Generalization in diffusion models arises from geometry-adaptive harmonic representations

5
Comments
4 min read
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

5
Comments
4 min read
CodecLM: Aligning Language Models with Tailored Synthetic Data

CodecLM: Aligning Language Models with Tailored Synthetic Data

6
Comments
4 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

5
Comments
4 min read
Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8

Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8

Comments
6 min read
"Day 43 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -22)

"Day 43 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -22)

1
Comments
2 min read
"Day 50 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis ( Per & Comb- 5)

"Day 50 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis ( Per & Comb- 5)

2
Comments
2 min read
The death of creativity

The death of creativity

47
Comments 15
6 min read
Validating Linear Regression Assumptions: A Comprehensive Approach to Multivariate Normality

Validating Linear Regression Assumptions: A Comprehensive Approach to Multivariate Normality

1
Comments
4 min read
CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

1
Comments
9 min read
Exploring Multicollinearity: Strategies for Detecting and Managing Correlated Predictors in Regression Analysis

Exploring Multicollinearity: Strategies for Detecting and Managing Correlated Predictors in Regression Analysis

3
Comments
3 min read
Unravelling Linearity: My Journey in Regression Modeling

Unravelling Linearity: My Journey in Regression Modeling

1
Comments
3 min read
The Importance of Data Science in Today’s World

The Importance of Data Science in Today’s World

Comments
1 min read
Independence of Errors: A Guide to Validating Linear Regression Assumptions

Independence of Errors: A Guide to Validating Linear Regression Assumptions

5
Comments
3 min read
LLMs are secretly good at regression calculations

LLMs are secretly good at regression calculations

4
Comments
9 min read
94% on CIFAR-10 in 3.29 Seconds on a Single GPU

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

Comments
3 min read
From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications

From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications

Comments
4 min read
The Optimal Choice of Hypothesis Is the Weakest, Not the Shortest

The Optimal Choice of Hypothesis Is the Weakest, Not the Shortest

Comments
4 min read
The Topos of Transformer Networks

The Topos of Transformer Networks

Comments
4 min read
loading...