Azure Machine Learning monitoring integration

New Relic's integrations include an integration for reporting your Azure Machine Learning data to New Relic. This document explains how to activate this integration and describes the data that can be reported.

Activate integration

To enable the integration follow standard procedures to activate your Azure service in New Relic.

Configuration and polling

You can change the polling frequency and filter data using configuration options.

Default polling information for the Azure Machine Learning integration:

  • New Relic polling interval: 5 minutes

Find and use data

To find your integration data, go to one.newrelic.com > Infrastructure > Azure and select an integration.

You can query and explore your data using the following event type:

Entity Event Type Provider
Workspace AzureMachineLearningWorkspaceSample AzureMachineLearningWorkspace

For more on how to use your data, see Understand and use integration data.

Metric data

This integration collects Azure Machine Learning data for Workspace.

Machine Learning Workspace data

Metric Unit Description

completedRuns

Count Number of runs completed successfully for this workspace

startedRuns

Count Number of runs started for this workspace

failedRuns

Count Number of runs failed for this workspace

modelRegisterSucceeded

Count Number of model registrations that succeeded in this workspace

modelRegisterFailed

Count Number of model registrations that failed in this workspace

modelDeployStarted

Count Number of model deployments started in this workspace

modelDeploySucceeded

Count Number of model deployments that succeeded in this workspace

modelDeployFailed

Count Number of model deployments that failed in this workspace

totalNodes

Count Number of total nodes. This total includes some of Active Nodes, Idle Nodes, Unusable Nodes, Preempted Nodes, Leaving Nodes.

activeNodes

Count Number of Active nodes. These are the nodes which are actively running a job.

idleNodes

Count Number of idle nodes. Idle nodes are the nodes which are not running any jobs but can accept new job if available.

unusableNodes

Count Number of unusable nodes. Unusable nodes are not functional due to some unresolvable issue. Azure will recycle these nodes.

preemptedNodes

Count Number of preempted nodes. These nodes are the low priority nodes which are taken away from the available node pool.

leavingNodes

Count Number of leaving nodes. Leaving nodes are the nodes which just finished processing a job and will go to Idle state.

totalCores

Count Number of total cores

activeCores

Count Number of active cores

idleCores

Count Number of idle cores

unusableCores

Count Number of unusable cores

preemptedCores

Count Number of preempted cores

leavingCores

Count Number of leaving cores

quotaUtilizationPercent

Percent Percent of quota utilized

cpuUtilizationPercent

Percent CPU utilization

gpuUtilizationPercent

Percent GPU utilization

For more help

If you need more help, check out these support and learning resources: