DEV Community

saud khan
saud khan

Posted on

Need help in Machine Learning

Hello everyone,

I am a beginner in machine learning, and I am currently working with the Heart Disease UCI dataset downloaded from Kaggle. Upon exploring the data, I noticed that several columns have missing values, and I believe all these columns are important for the analysis. Here is a summary of the missing values in my dataset:

id: 0 missing values
age: 0 missing values
sex: 0 missing values
dataset: 0 missing values
cp: 0 missing values
trestbps: 59 missing values
chol: 30 missing values
fbs: 90 missing values
restecg: 2 missing values
thalch: 55 missing values
exang: 55 missing values
oldpeak: 62 missing values
slope: 309 missing values
ca: 611 missing values
thal: 486 missing values
num: 0 missing values
Could anyone please guide me on how to handle these missing values effectively, considering all columns are significant? Should I use imputation techniques, or are there better methods for this scenario? Any advice, especially with examples, would be greatly appreciated!

Thank you!

Do your career a big favor. Join DEV. (The website you're on right now)

It takes one minute, it's free, and is worth it for your career.

Get started

Community matters

Top comments (0)

Heroku

Build apps, not infrastructure.

Dealing with servers, hardware, and infrastructure can take up your valuable time. Discover the benefits of Heroku, the PaaS of choice for developers since 2007.

Visit Site

👋 Kindness is contagious

Discover a treasure trove of wisdom within this insightful piece, highly respected in the nurturing DEV Community enviroment. Developers, whether novice or expert, are encouraged to participate and add to our shared knowledge basin.

A simple "thank you" can illuminate someone's day. Express your appreciation in the comments section!

On DEV, sharing ideas smoothens our journey and strengthens our community ties. Learn something useful? Offering a quick thanks to the author is deeply appreciated.

Okay