DEV Community

chanduthedev
chanduthedev

Posted on

1

Best way to validate content of the entire input data file is consistent!!

In machine learning, preparing data is one of the key step. Below is the simple and best way to check input data file is formatted properly. Below command should always return one unique value other wise file is not formatted properly.

cat file_name | awk -F',' '{print NF}' | sort -u

Refer original post for more details.

Top comments (0)

Sentry image

See why 4M developers consider Sentry, “not bad.”

Fixing code doesn’t have to be the worst part of your day. Learn how Sentry can help.

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay