DEV Community

Discussion on: What Did You Learn This Week --April 17?

Collapse
 
waylonwalker profile image
Waylon Walker

I learned a ton about pytest while building find-kedro

Most notably the built in fixture for tempdir, simply adding tempdir as an argument to your test function gives you a tempdir to create files and run tests against them.

GitHub logo WaylonWalker / find-kedro

Find Kedro is a small library to enhance your kedro experience by automatically assembling pipelines

Find Kedro Title

find-kedro is a small library to enhance your kedro experience. It looks through your modules to find kedro pipelines, nodes, and iterables (lists, sets, tuples) of nodes. It then assembles them into a dictionary of pipelines, each module will create a separate pipeline, and __default__ being a combination of all pipelines. This format is compatible with the kedro _create_pipelines format.

Motivation

kedro is a fantastic project that allows for super-fast prototyping of data pipelines, while yielding production-ready pipelines. find-kedro enhances this experience by adding a pytest like node/pipeline discovery eliminating the need to bubble up pipelines through modules.

When working on larger pipeline projects, it is advisable to break your project down into different sub-modules which requires knowledge of building python libraries, and knowing how to import each module correctly. While this is not too difficult, in some cases, it can trip up even the most senior engineers, losing precious…