This project comprises of operationalizing Machine Leanning in Microsoft Azure. A classification model is trained and deployed. The deployed model endpoint URI can later be used to make predictions A pipline is also created, published and consumed. The published pipeline endpoint URI can later be used to initiate a new run.
- Consider implementing one or more stand out suggestions
- Research and use balanced data to improve the outcome of the predictions
- Version the pipelines. This would allow development of new models while customers continue to use the existing version
This step allows a model to be trained across different algorithms/parameters and highlights the best model
-
An Azure ML Dataset is created and registered if it does not already exist from the URL for Marketing Bank data.
-
A Compute cluster is created if it does not already exist
-
An AtoML confiuration is specified with key information such as the best netric to use, the column label, etc.
-
A run is then submitted to train the model. Once the experiment/run is completed, the best model is identified.
In this step, we use the Azure ML Studio U/I t odeploy the best model. We use ACI (Azure Container Instance) with authentication enabled.
Logging helps troublshoot and understanding the workflow. It also helps with quantifying performance at various stages of the execution. The WebServcie is used to enable/disable Application Insights.
Sample output of the logs that are available once application insight is enabled.
Once a model is published, AzureML exposes a swagger.json file. This can be consumed by Swagger and helps with the documentation of the methods that are exposed together with the JSON payloads for input/output. This makes it much easier to start consuming the endpoint.
The model endpoint is consumed by making a REST API call to the scoring URI. If authentication is enabled, the key must also be provide. The output of such a model invocation is displayed below.
A pipeline is created when it is "run in the context of an experiment. THe pipeline can also be visualized in the Pipelines section of AzureML Studio
Once a pipeline is published, a pipeline endpoint is generated and may be accessed from Azure ML Studio under the 'Pipeline Endpoint' tab
Review of the pipeline from Azure ML studio showing the bank marketing dataset and the AutoML module
Once published, the Pipeline Details tab will show the published pipeline status (Active in this case) and also the pieline REST endpoint that can be called to "run" the pipeline.
THe RunDetails widget asychronously displays the run detais in the Notebook as the pipeline run progresses.
A pipeline run stats (Scheduled, Completed, etc.) can be reviewed in AzureML Studio in the Pipelines section
https://drive.google.com/file/d/1ATN5RPttjm1xlc9htaBOFCGPFaAHfoE8/view?usp=sharing