DEV Community

PythicCoder for Microsoft Azure

Posted on • Originally published at Medium on

Installing Azure ML Estimator Dependencies from Remote GitHub Branches

TLDR; The Azure ML (AML) Service Estimators make it easy to train distributed machine learning models at scale. This post outlines how configure external AML estimator dependencies from GitHub.

What is Azure ML and Why Should I Care?

The Azure Machine Learning Service provides several ways to train machine learning models, from code first solutions using the SDK to low code solutions such as AutoMLand the Visual Designer.

To learn more check out the documentation.

Azure Machine Learning documentation

If you don’t have an Azure Subscription you can get a free one using the link below.

Create your Azure free account today | Microsoft Azure

With the subscription you can get started learning the ins and outs of the Azure Machine Learning service with the following Microsoft Learn Path.

Build AI solutions with Azure Machine Learning service - Learn

What is an Azure ML Estimator?

Azure ML Estimators make it easy to train models based on popular machine learning frameworks. There are estimator classes for Scikit-learn , PyTorch , TensorFlow , and more. There is also a generic estimator that can be used with frameworks that do not already have a dedicated estimator class. You don’t have to worry about defining a run configuration when using estimators.

To get started with the default Azure ML Estimators check out the link below.

Train ML models with estimators - Azure Machine Learning

How to Configure Estimator Dependencies from GitHub?

Estimators provide the ability to add package dependencies from PyPI and Conda however not all the software dependencies are available in these repositories and sometimes ML engineers might want to install from source.

The following code shows how to install from a remote github package using the nlp-architectand the absa branch as an example.

Now you can get started creating custom Azure ML estimators for your open source GitHub projects without the need to write a custom run configuration or create a docker environment.

Next Steps

Now that you know how to configure dependencies from GitHubcheck out my previous post on 9 Advanced Tips for Production machine learning and how to get started with Azure Notebook VMs.

If AI/ML interests you check out these amazing open source best practice repos as well.

About the Author

Aaron (Ari) Bornstein is an avid AI enthusiast with a passion for history, engaging with new technologies and computational medicine. As an Open Source Engineer at Microsoft’s Cloud Developer Advocacy team, he collaborates with Israeli Hi-Tech Community, to solve real world problems with game changing technologies that are then documented, open sourced, and shared with the rest of the world.

Top comments (0)