DEV Community

Discussion on: Welcome Thread - v49

Collapse
 
celinemol profile image
Celine

Hello! I'm Celine. I am visiting from beautiful, sunny Santa Barbara, California. I do software engineering for an awesome company called Apeel Sciences. We develop plant-derived technologies that help extend the shelf life of fresh produce.

I just started in the software space, been at it for a couple of years now. I come from a background in data science. I hope to give back to the data science community by recommending good coding practices to make your data science models more reproducible, scalable, and robust.

Thanks for having me! Excited to be here (:

Collapse
 
samuelbalogh profile image
Samu B

Hi Celine,

the company sounds really interesting!
Best of luck with your work and thanks for helping the data science community!

Best,
Sam B

Collapse
 
celinemol profile image
Celine

Thank you! Best of luck to you as well (:

Collapse
 
anatfradin profile image
AnatFradin

Hello

Collapse
 
angustay174 profile image
Angus

Hey!

Collapse
 
pawarpiyusha profile image
Piyusha

Hi

Collapse
 
oanouman profile image
Martial Anouman

Welcome Celine !!

Collapse
 
celinemol profile image
Celine

Thank you! (: you as well!

Collapse
 
anatfradin profile image
AnatFradin

Hello

Collapse
 
dizzlebot profile image
DizzleBot

Hello. So nice to meet you. I lived in socal for about a 9 years. Hope to pick your brain about some languages hehe.

Collapse
 
celinemol profile image
Celine

Haha you're welcome to, although I am also new to React and Javascript so not sure that I would be the right person to ask (:

Thread Thread
 
dizzlebot profile image
DizzleBot

hahaha well well. It is really fun though. Are you currently working in the field yet. We have about ten days left in our bootcamp and hopefully, i'm able to land a job in it.

Thread Thread
 
celinemol profile image
Celine

Yay good luck!! I’m sure you will find one. React is the hot new language (;

I started about a year ago, so I am still fairly new 😎 but loving it so far!

Collapse
 
jonlim profile image
Jon Lim

Hey Celine!

Would be curious to know if there are any best practices for testing in data science models? What would that look like?

Collapse
 
celinemol profile image
Celine

I think writing unit tests for statement coverage and integration tests is the most important thing. You want to make really easy for yourself by writing a testing script that you can just run every time you have a new iteration to your model or the functions you use for preprocessing so that you know you haven’t broken anything and you can trust that your code works.

I’ll start writing some documentation to provide examples and make this concept easier to digest but for now a quick google search might help you (: Hope this helps!

Thread Thread
 
jonlim profile image
Jon Lim

Right on - in your experience, is it something that happens with a lot of data science teams? (The writing of tests, I mean.)

I'm a weird convert towards testing, if it isn't immediately obvious hehe, but I'm always surprised at just how few tests can be found out there sometimes.

Thread Thread
 
celinemol profile image
Celine

No, I don’t see a lot of testing in the data science world (: I agree, I think there could definitely be a lot more of it. It would make writing data models a lot easier to scale, instead of building code and fix models. But I feel like a lot of data scientists aren’t taught how to write good tests and that’s why they’ve been able to survive without it. Are you a data scientist? Where did you learn how to test?

Collapse
 
danimalss profile image
Dani

hi

Collapse
 
davic64 profile image
David Victoria

That is incredible :3 You are cool

I've been developing for almost a year and I hope to learn a lot with all of you.

Collapse
 
anatfradin profile image
AnatFradin

Hello

Collapse
 
aquaman_ profile image
Andrew Bain

Welcome!

Collapse
 
padu143 profile image
padu143 • Edited

Hello,I am param from India, I want to become a data scientist.
Can you suggest me,which programming languages I need to study.

Collapse
 
celinemol profile image
Celine

Python is the best! You can learn the tensorflow library created by Google. And you can easily follow good coding practices like domain driven design and writing unit tests and acceptance tests.
To manage databases I recommend SQL. PostgreSQL is a great way to learn because it is open source.

Collapse
 
daxsoft profile image
Michael Willian Santos

Be welcome :)

I'm always excited when people show up with ideas that at some sort will help the planet. I'll be following you to watch the news about this tech!

Collapse
 
celinemol profile image
Celine

I’ll keep you posted! (:

Collapse
 
emalwardak profile image
Emal Wardak

Oh, that is great.