DEV Community

Julia Silge profile picture

Julia Silge

I’m an international keynote speaker and real-world practitioner focused on data analysis and machine learning. I love making beautiful charts, the statistical programming language R, and Jane Austen.

Location Salt Lake City, UT Joined Joined on  Personal website https://juliasilge.com/ github website twitter website

Work

Data scientist & software engineer at RStudio PBC

Seven Year Club
Writing Debut
Six Year Club
Five Year Club
Four Year Club
Three Year Club
4 Week Writing Streak
Two Year Club
One Year Club
Topic modeling for Spice Girls lyrics 🇬🇧👯‍♀️🎤

Topic modeling for Spice Girls lyrics 🇬🇧👯‍♀️🎤

6
Comments
7 min read

Want to connect with Julia Silge?

Create an account to connect with Julia Silge. You can also sign in below to proceed if you already have an account.

Already have an account? Sign in
Predicting viewership for Doctor Who episodes

Predicting viewership for Doctor Who episodes

Comments
4 min read
Predict giant pumpkin weights 🎃 with tidymodels

Predict giant pumpkin weights 🎃 with tidymodels

6
Comments
6 min read
Spatial resampling for the #30DayMapChallenge 🗺

Spatial resampling for the #30DayMapChallenge 🗺

1
Comments
4 min read
Multiclass predictive modeling for economics research papers 📑

Multiclass predictive modeling for economics research papers 📑

6
Comments
6 min read
Dimensionality reduction for Billboard Top 100 songs 🎶

Dimensionality reduction for Billboard Top 100 songs 🎶

4
Comments
7 min read
Fit and predict with tidymodels for bird baths in Australia 🇦🇺

Fit and predict with tidymodels for bird baths in Australia 🇦🇺

Comments
6 min read
Modeling human/computer interactions on Star Trek 🖖

Modeling human/computer interactions on Star Trek 🖖

Comments
6 min read
Predict housing prices 🏠 in Austin TX with xgboost

Predict housing prices 🏠 in Austin TX with xgboost

1
Comments
10 min read
Use racing methods to tune xgboost models and predict home runs ⚾️

Use racing methods to tune xgboost models and predict home runs ⚾️

3
Comments 1
5 min read
Tune xgboost models with early stopping to predict shelter animal status 🐱🐶

Tune xgboost models with early stopping to predict shelter animal status 🐱🐶

1
Comments
5 min read
Predict which Scooby Doo monsters 👻 are REAL with a tuned decision tree model

Predict which Scooby Doo monsters 👻 are REAL with a tuned decision tree model

3
Comments
5 min read
Create a custom metric for your machine learning model to predict NYC Airbnb prices

Create a custom metric for your machine learning model to predict NYC Airbnb prices

Comments
5 min read
Class imbalance and classification metrics with aircraft wildlife strikes ✈️

Class imbalance and classification metrics with aircraft wildlife strikes ✈️

Comments
7 min read
Partial dependence plots for Mario Kart 🍄 world records

Partial dependence plots for Mario Kart 🍄 world records

1
Comments
5 min read
Predict water availability 🚰 in Sierra Leone with random forests

Predict water availability 🚰 in Sierra Leone with random forests

6
Comments
5 min read
Estimate change in CEO departures with bootstrap resampling

Estimate change in CEO departures with bootstrap resampling

5
Comments
4 min read
Which Netflix titles are movies and which are TV shows? 📺

Which Netflix titles are movies and which are TV shows? 📺

7
Comments 1
6 min read
Use subword features to find which post offices are in Hawaii 🌺

Use subword features to find which post offices are in Hawaii 🌺

1
Comments
8 min read
Dimensionality reduction of United Nations voting patterns 🌍

Dimensionality reduction of United Nations voting patterns 🌍

Comments
4 min read
Bootstrap confidence intervals for Super Bowl commercials 🏈

Bootstrap confidence intervals for Super Bowl commercials 🏈

1
Comments
4 min read
Understand inequality in student debt 🎓 with linear modeling

Understand inequality in student debt 🎓 with linear modeling

1
Comments
3 min read
Learn tidytext with my new learnr course

Learn tidytext with my new learnr course

5
Comments
3 min read
Explore art over time in the Tate collection 🖼

Explore art over time in the Tate collection 🖼

Comments
10 min read
Code generation for tuning random forests using IKEA furniture prices 🛋

Code generation for tuning random forests using IKEA furniture prices 🛋

7
Comments
5 min read
Tune and interpret decision trees for wind turbine capacity 🌬

Tune and interpret decision trees for wind turbine capacity 🌬

3
Comments
7 min read
Predicting class membership for the Datasaurus Dozen 🦖

Predicting class membership for the Datasaurus Dozen 🦖

4
Comments
7 min read
Modeling NCAA women's 🏀 tournament seeds

Modeling NCAA women's 🏀 tournament seeds

7
Comments
8 min read
Introducing our new book, Tidy Modeling with R 📖

Introducing our new book, Tidy Modeling with R 📖

7
Comments 1
1 min read
Handle class imbalance in modeling Himalayan climbing expeditions ⛰

Handle class imbalance in modeling Himalayan climbing expeditions ⛰

Comments
11 min read
Modeling crop yields 🌽🍚🌾 with tidy data principles

Modeling crop yields 🌽🍚🌾 with tidy data principles

5
Comments
4 min read
Build a predictive text model for The Last Airbender

Build a predictive text model for The Last Airbender

28
Comments
7 min read
Get started with tidymodels and the Palmer penguins 🐧

Get started with tidymodels and the Palmer penguins 🐧

9
Comments
6 min read
Announcing our new book 📖! Supervised Machine Learning for Text Analysis in R

Announcing our new book 📖! Supervised Machine Learning for Text Analysis in R

6
Comments
2 min read
Predicting astronaut mission duration 👩‍🚀🚀 with bootstrap aggregation

Predicting astronaut mission duration 👩‍🚀🚀 with bootstrap aggregation

6
Comments
7 min read
The Bechdel test and the X-Mansion with bootstrap resampling 🦸‍♀️🦸‍♂️

The Bechdel test and the X-Mansion with bootstrap resampling 🦸‍♀️🦸‍♂️

17
Comments
6 min read
Impute missing data for historical trans-Atlantic slave voyages

Impute missing data for historical trans-Atlantic slave voyages

8
Comments 1
8 min read
PCA and UMAP with cocktail recipes 🥃🍸🍹

PCA and UMAP with cocktail recipes 🥃🍸🍹

5
Comments
6 min read
Learn about log odds and empirical Bayes with cocktail 🍸 recipes

Learn about log odds and empirical Bayes with cocktail 🍸 recipes

14
Comments 2
6 min read
Tune XGBoost with beach volleyball data 🏐

Tune XGBoost with beach volleyball data 🏐

9
Comments
9 min read
Learn supervised machine learning with my free interactive course

Learn supervised machine learning with my free interactive course

2
Comments
2 min read
Multinomial classification for volcano eruptions 🌋 with tidymodels

Multinomial classification for volcano eruptions 🌋 with tidymodels

Comments
7 min read
Building a sentiment analysis model with Animal Crossing user reviews

Building a sentiment analysis model with Animal Crossing user reviews

10
Comments 1
8 min read
Predicting fines for GDPR violations with tidymodels

Predicting fines for GDPR violations with tidymodels

6
Comments
8 min read
Principal component analysis and the best hip hop songs ever

Principal component analysis and the best hip hop songs ever

8
Comments
10 min read
Bootstrap resampling with #TidyTuesday beer production data

Bootstrap resampling with #TidyTuesday beer production data

5
Comments
5 min read
Tuning random forest hyperparameters in R with #TidyTuesday trees data

Tuning random forest hyperparameters in R with #TidyTuesday trees data

1
Comments
7 min read
Lasso regression for IMDB ratings of The Office

Lasso regression for IMDB ratings of The Office

7
Comments
9 min read
Practice handling dates in R with lubridate... THEATRICALLY

Practice handling dates in R with lubridate... THEATRICALLY

6
Comments
8 min read
loading...