DEV Community

chanduthedev
chanduthedev

Posted on

1

Best way to validate content of the entire input data file is consistent!!

In machine learning, preparing data is one of the key step. Below is the simple and best way to check input data file is formatted properly. Below command should always return one unique value other wise file is not formatted properly.

cat file_name | awk -F',' '{print NF}' | sort -u

Refer original post for more details.

Top comments (0)

Eliminate Context Switching and Maximize Productivity

Pieces.app

Pieces Copilot is your personalized workflow assistant, working alongside your favorite apps. Ask questions about entire repositories, generate contextualized code, save and reuse useful snippets, and streamline your development process.

Learn more