Section 5 Reference Guide - Maintenance and Management

Section purpose

Automation that runs silently and fails silently is worse than no automation, because administrators assume it is working when it is not. This section covers the operational discipline required to keep automation workloads maintainable and observable over time: source control, CI/CD deployment, safe use of AI-assisted development, monitoring and alerting, and service principal lifecycle hygiene. These practices apply across all the platforms covered in Section 4.

Learning Objectives

How to configure source control integration and CI/CD deployment for Azure Automation Accounts
How to identify and remediate common security issues in AI-generated automation code
How to set up Log Analytics queries and Azure Monitor alerts for automation failure detection
How to query Graph API for service principal credential expiry and orphan detection
What a service principal naming convention should encode and why it matters

Learning journey - the four operational excellence pillars covered in this section

Source control and version management

Opening scenario - 18 days of silent failure on a CA exclusion runbook

Discussion prompt - how would you detect a silent automation failure today?

Source control as a security requirement - the with/without comparison

CI/CD pipeline - the five stages from code edit to deploy

Why source control is required

Without source control, automation code exists only in the Azure portal or in a local file on someone's workstation. There is no change history, no peer review, no rollback capability, and no audit trail. Portal-edited runbooks are the automation equivalent of ungoverned infrastructure: they change without anyone knowing, they break without anyone understanding why, and they cannot be recovered to a known-good state.

Rule: treat the Azure portal as read-only for production automation code. All changes must go through a pull request in a Git repository.

What belongs in source control

PowerShell runbooks and Python scripts
Logic App workflow definitions (exported JSON for Consumption; /src folder for Standard)
Function App code
Pipeline YAML files (GitHub Actions workflows, Azure DevOps pipeline definitions)
Bicep and ARM templates for automation infrastructure
PSScriptAnalyzer configurations and Pester test files

Repository layout - what belongs (and what does not) in source control

Automation Account source control sync

Automation Accounts support native source control integration with GitHub and Azure DevOps. Configure sync to pull from a specific branch. The sync copies runbook files from the repository to the Automation Account automatically when changes are merged.

Important limitation: native source control sync has incomplete support for PowerShell 7.x runbooks. Teams using PS7 must deploy via a CI/CD pipeline instead.

Automation Account source control sync - native sync vs CI/CD pipeline

CI/CD for runbook deployment

A CI/CD pipeline provides what native sync cannot: pre-deployment validation.

Basic GitHub Actions pipeline for runbook deployment:

name: Deploy Runbook
on:
  push:
    branches: [main]
    paths: ['runbooks/**']

permissions:
  id-token: write
  contents: read

jobs:
  validate-and-deploy:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Run PSScriptAnalyzer
        shell: pwsh
        run: |
          Install-Module PSScriptAnalyzer -Force -Scope CurrentUser
          $results = Invoke-ScriptAnalyzer -Path ./runbooks -Recurse -Severity Error,Warning
          if ($results) { $results; exit 1 }

      - uses: azure/login@v2
        with:
          client-id: ${{ vars.AZURE_CLIENT_ID }}
          tenant-id: ${{ vars.AZURE_TENANT_ID }}
          subscription-id: ${{ vars.AZURE_SUBSCRIPTION_ID }}

      - name: Deploy runbook
        run: |
          az automation runbook replace-content \
            --resource-group ${{ vars.RG_NAME }} \
            --automation-account-name ${{ vars.AA_NAME }} \
            --name MyRunbook \
            --content @./runbooks/MyRunbook.ps1

This pipeline runs PSScriptAnalyzer before deploying. Any severity Error or Warning result fails the pipeline and blocks deployment.

CI/CD demo flow - violation blocks pipeline, fix unblocks deployment

Infrastructure as Code for automation resources

Define Automation Accounts, Logic Apps, Function Apps, RBAC role assignments, and managed identities in Bicep. Store IaC in the same repository as the code. This enables consistent, repeatable deployments and prevents configuration drift between environments.

AI-assisted development

The four common weaknesses in AI-assisted automation code

Safe sandboxes

When using GitHub Copilot, Claude, or other AI coding assistants to write automation:

Use a separate development subscription or resource group. Do not test AI-generated code directly in production.
Do not paste production secrets, tokens, or sensitive data into AI prompts.
Use a test Entra tenant (Microsoft 365 Developer Program provides free test tenants) for validating code that touches identity or security configurations.

Safe sandboxes for AI-assisted coding - review, sandbox, never paste production data

What AI-generated code commonly gets wrong

AI models tend to generate code with predictable security weaknesses. Review every AI-generated script for:

Hardcoded credentials: AI often suggests inline $clientSecret = "..." patterns.
Overly permissive scopes: AI defaults to .ReadWrite.All because it avoids permission errors. Use the minimum scope required.
Missing error handling for auth failures: AI-generated token request code often omits retry logic and error handling for 401 and 403 responses.
Insecure token storage: storing tokens in environment variables or writing them to log output.

PSScriptAnalyzer for PowerShell code review

PSScriptAnalyzer is a static analysis tool for PowerShell. Run it on every runbook before deployment:

Install-Module PSScriptAnalyzer -Scope CurrentUser

# Analyze a single runbook
Invoke-ScriptAnalyzer -Path .\MyRunbook.ps1 -Severity Error, Warning

# Analyze a directory recursively
Invoke-ScriptAnalyzer -Path .\runbooks -Recurse -Severity Error, Warning

# Include specific security rules
Invoke-ScriptAnalyzer -Path .\MyRunbook.ps1 `
  -IncludeRule PSAvoidUsingPlainTextForPassword, PSAvoidUsingConvertToSecureStringWithPlainText

PSScriptAnalyzer catches common security issues: plain-text passwords, use of deprecated cmdlets, missing error handling patterns, and code style violations that obscure intent.

PSScriptAnalyzer pipeline - block on violation, deploy on clean run

Bandit for Python

Bandit is the Python equivalent of PSScriptAnalyzer. Run it on Python runbooks and Function App scripts:

pip install bandit
bandit -r ./scripts/ -ll   # report medium and high severity

Bandit for Python - same CI gate pattern for Python runbooks

Peer review as a security control

Static analysis tools catch syntax and pattern issues. Peer review catches logic errors, incorrect permission scope choices, and missing security considerations that tools cannot detect. Require at least one reviewer for all automation code changes, even in small teams.

Monitoring automation health

Monitoring Stack

The silent failure problem

Automation that runs silently and fails silently is operationally dangerous. An emergency access CA exclusion runbook that has been failing for 18 days looks fine from the outside - until a major incident occurs and the break-glass account does not have the expected policy exclusions. The failure was invisible because there were no alerts.

Monitoring must be configured on every automation workload before it is deployed to production.

Automation Account monitoring

Job history: Automation Account job history shows run status, start time, duration, and output. View it in the portal under Process Automation > Jobs.
Log Analytics integration: configure Automation Account diagnostic settings to stream job logs to a Log Analytics workspace. This enables querying job history, correlating failures with other events, and building dashboards.

// Find failed runbook jobs in the last 24 hours
AzureDiagnostics
| where ResourceProvider == "MICROSOFT.AUTOMATION"
| where Category == "JobLogs"
| where ResultType == "Failed"
| where TimeGenerated > ago(24h)
| project TimeGenerated, RunbookName_s, ResultDescription_s
| order by TimeGenerated desc

Automation Account monitoring - diagnostic settings → Log Analytics → alert

Log Analytics KQL anatomy - each clause has a teaching purpose

Log Analytics portal view - failed job results with the error message field highlighted

Logic App run history monitoring

Logic App run history is visible in the portal under the workflow view. Enable diagnostic settings to send run history to Log Analytics:

// Find failed Logic App runs
AzureDiagnostics
| where ResourceProvider == "MICROSOFT.LOGIC"
| where Category == "WorkflowRuntime"
| where status_s == "Failed"
| where TimeGenerated > ago(24h)
| project TimeGenerated, resource_runId_s, code_s, error_message_s
| order by TimeGenerated desc

Function App monitoring with Application Insights

Function App execution telemetry is captured in Application Insights. Query for failures:

// Application Insights - failed function executions
requests
| where success == false
| where timestamp > ago(24h)
| project timestamp, name, resultCode, duration, operation_Id
| order by timestamp desc

Enable Application Insights on every security automation Function App. Without it, failures are not queryable.

Application Insights for Function Apps - four telemetry pillars

Error alerting and runbook failure notifications

Configure Azure Monitor alerts so that automation failures create visible signals rather than silent voids.

Creating an alert for failed runbook jobs

In the Azure portal, go to Monitor > Alerts > Create alert rule.
Select the Automation Account as the scope.
In Condition, select Total Job Runs metric, dimension Status = Failed, threshold > 0.
In Actions, configure an action group that sends an email or Teams notification.
In Details, name the alert and set severity.

Alternatively, create a Log Analytics alert rule that queries job logs:

AzureDiagnostics
| where ResourceProvider == "MICROSOFT.AUTOMATION"
| where Category == "JobLogs"
| where ResultType == "Failed"

Set this as a scheduled query alert with a frequency of 5 minutes and a threshold of 0 results.

Sentinel playbooks - the testing gap that makes them silently broken

Azure Monitor alert rule anatomy - scope, condition, action group, severity

Connecting failures to incident management

Wire failure events to your incident management workflow:

Automation Account job failed → Azure Monitor alert → Logic App → Teams notification or ServiceNow ticket.
For Sentinel playbooks: test every playbook using synthetic test incidents. Playbooks that were never validated often fail silently in production when the real event occurs.

⚠️Risk - Untested Sentinel Playbooks

A Sentinel playbook that has never been tested with a synthetic incident may have been silently broken for months. Running a playbook only on real incidents means the first time you discover it is broken is during an actual security event.

Service principal naming conventions and ownership

SP naming convention - one display name answers owner, purpose, environment

Owner field plus Notes field - making every SP self-documenting

Why naming conventions matter

Without a naming standard, SP inventories become unmanageable at scale. When a service principal named App1 or test_new shows up in sign-in logs, there is no way to determine ownership, purpose, or environment from the name alone. Orphan detection and lifecycle management depend on being able to identify what an SP is for from its display name alone.

Recommended naming convention

Use the pattern [team]-[purpose]-[env]:

secops-signinmonitor-prod
devteam-deployagent-staging
itops-caexclusion-prod

This encodes team ownership, automation purpose, and target environment into every display name. Combined with the Notes field and owner assignment, it makes inventory and lifecycle management tractable at scale.

Notes field and owner requirement

The app registration Notes field is queryable via Graph API and can be included in automated inventory reports. Use it to record:

Owner email address and team
Ticket or change request reference
Purpose description
Last reviewed date

Every app registration must have at least one owner assigned in the Entra portal. Ownerless registrations should be flagged by automated hygiene checks. An SP whose creator left the organization becomes an orphan with no one responsible for it.

Service principal lifecycle hygiene

Sp Hygiene

Credential expiry alerts

Credential expiry query - sample output highlighting bad names and 7-day-out secrets

Set expiry on all secrets and certificates. Build automation to alert on credentials expiring within 30, 60, and 90 days:

# Find credentials expiring in the next 30 days
$cutoff = (Get-Date).AddDays(30)

Get-MgApplication -All | ForEach-Object {
    $app = $_
    $app.PasswordCredentials | Where-Object {
        $_.EndDateTime -lt $cutoff -and $_.EndDateTime -gt (Get-Date)
    } | ForEach-Object {
        [PSCustomObject]@{
            DisplayName = $app.DisplayName
            AppId       = $app.AppId
            SecretHint  = $_.Hint
            ExpiresOn   = $_.EndDateTime
        }
    }
} | Sort-Object ExpiresOn

Key Vault emits Event Grid events when certificates approach expiry. Use these events to trigger Logic Apps or Automation Account runbooks that alert or initiate rotation.

90-day activity review

Enterprise apps with no recent sign-in activity are candidates for review and possible deletion. Use sign-in logs as a first-pass heuristic by checking whether the app ID has appeared in the last 90 days:

Import-Module Microsoft.Graph.Applications
Import-Module Microsoft.Graph.Reports
Connect-MgGraph -Scopes "Application.Read.All", "AuditLog.Read.All"

# Enterprise apps with no sign-in activity in the past 90 days
$cutoff = (Get-Date).AddDays(-90)
$cutoffString = $cutoff.ToUniversalTime().ToString("yyyy-MM-ddTHH:mm:ssZ")

Get-MgServicePrincipal -All | ForEach-Object {
    $sp = $_
    $signIns = Get-MgAuditLogSignIn `
    -Filter "appId eq '$($sp.AppId)' and createdDateTime ge $cutoffString" `
        -Top 1

    if (-not $signIns) {
        [PSCustomObject]@{
            DisplayName = $sp.DisplayName
            AppId       = $sp.AppId
            ObjectId    = $sp.Id
        }
    }
} | Format-Table -AutoSize

📚Prerequisite Note

This query requires both Application.Read.All and AuditLog.Read.All, plus the Microsoft.Graph.Applications and Microsoft.Graph.Reports modules if you are using Graph PowerShell. Treat the result as a review list, not an automatic delete list, because appId-based sign-in activity is a heuristic rather than a complete ownership signal.

Credential expiry policies

Use Entra application authentication method policies to enforce maximum credential lifetimes. Restrict secrets to a maximum of 180 days (or less, depending on your policy). This prevents indefinitely-lived secrets from accumulating.

Orphan detection

An orphaned SP is one with no current owner assigned and no recent sign-in activity. Build a query that surfaces ownerless registrations:

# Find app registrations with no owner
Get-MgApplication -All | Where-Object {
    -not (Get-MgApplicationOwner -ApplicationId $_.Id)
} | Select-Object DisplayName, AppId, CreatedDateTime

Combine with the 90-day activity heuristic for a combined orphan report: no owner and no recent sign-in activity.

Graph API orphan detection - three decisions, four outcomes

Lab readiness notes

🧪Lab Readiness

Have sample runbook code, analyzer output, KQL examples, and Graph credential reports ready. Log Analytics and alert rules may not show results immediately; use the saved output if the live signal is delayed. Clean up alert rules, test failures, and sample service principals created during the lab.

Guided lab

Lab goal

Configure source control sync for an Automation Account, lint a runbook with PSScriptAnalyzer, inventory expiring Entra application credentials, send Automation diagnostics to Log Analytics, and create a Monitor alert rule from a Log Analytics query.

Prerequisites

Automation Account in the lab resource group with at least one runbook
Log Analytics workspace in the lab resource group
Graph API access (Microsoft Graph PowerShell or Azure CLI)
Contributor on the lab resource group

Task 1: Source control sync for Automation Account

Azure Portal:

Create the source control connection

Open the Automation Account.
Go to Source control and add a new connection.
Use GitHub, the main branch, and the /runbooks folder.
After the first sync job, open the imported runbook and confirm the portal warns that the file is source-controlled.

Az CLI:

Create the source control connection and trigger a sync job

$resourceGroupName = "<resource-group>"
$automationAccountName = "<automation-account>"
$repoUrl = "https://github.com/<github-owner>/<repo-name>.git"
$githubToken = gh auth token

# Source control sync requires the Automation Account managed identity and a Contributor assignment on the Automation Account itself.
$automationId = az automation account show --automation-account-name $automationAccountName --resource-group $resourceGroupName --query id -o tsv
$automationPrincipalId = az automation account show --automation-account-name $automationAccountName --resource-group $resourceGroupName --query identity.principalId -o tsv

az role assignment create `
  --assignee-object-id $automationPrincipalId `
  --assignee-principal-type ServicePrincipal `
  --role Contributor `
  --scope $automationId | Out-Null

az automation source-control create `
  --resource-group $resourceGroupName `
  --automation-account-name $automationAccountName `
  --name github-runbooks `
  --repo-url $repoUrl `
  --branch main `
  --source-type GitHub `
  --folder-path /runbooks `
  --access-token $githubToken `
  --token-type PersonalAccessToken `
  --auto-sync false `
  --publish-runbook true | Out-Null

# In PowerShell, preserve the required empty string for commit-id with the stop-parsing operator.
az --% automation source-control sync-job create --resource-group <resource-group> --automation-account-name <automation-account> --source-control-name github-runbooks --job-id 11111111-1111-1111-1111-111111111111 --commit-id ""

az automation runbook list `
  --automation-account-name $automationAccountName `
  --resource-group $resourceGroupName `
  --query "[].{name:name,state:state}" `
  -o table

Expected outcomes:
- The sync job reaches Succeeded
- The source-controlled runbook appears in the Automation Account
- The imported runbook shows as Published

Task 2: PSScriptAnalyzer on a runbook

Native PowerShell:

Create a sample runbook and lint it

@'
function Invoke-LegacyLogin {
  param([string]$Password)
  Write-Host "Using password input"
}

Invoke-LegacyLogin -Password "hardcoded-password-value-here"
$password = ConvertTo-SecureString "plaintext" -AsPlainText -Force
'@ | Set-Content -Path .\Sample-Runbook.ps1 -Encoding utf8

Install-Module PSScriptAnalyzer -Scope CurrentUser -Force

Invoke-ScriptAnalyzer -Path .\Sample-Runbook.ps1 -Severity Error, Warning

Invoke-ScriptAnalyzer -Path .\Sample-Runbook.ps1 `
  -IncludeRule PSAvoidUsingPlainTextForPassword,PSAvoidUsingConvertToSecureStringWithPlainText,PSAvoidUsingWriteHost

Expected outcomes:
- PSScriptAnalyzer flags the plain-text password rules
- Write-Host is called out as a lint issue in the sample file
- Re-running after a fix removes the matching rule from the results

Task 3: Graph API credential expiry query

Native PowerShell:

Use a Graph token plus raw REST

$cutoff = (Get-Date).AddDays(30)
$token = az account get-access-token --resource-type ms-graph --query accessToken -o tsv
$headers = @{ Authorization = "Bearer $token" }
$uri = "https://graph.microsoft.com/v1.0/applications?`$select=displayName,appId,passwordCredentials&`$top=999"
$results = @()

do {
  $response = Invoke-RestMethod -Method GET -Uri $uri -Headers $headers
  $results += $response.value
  $uri = $response.'@odata.nextLink'
} while ($uri)

$results | ForEach-Object {
  $app = $_
  $app.passwordCredentials | Where-Object {
    [datetime]$_.endDateTime -lt $cutoff -and [datetime]$_.endDateTime -gt (Get-Date)
  } | ForEach-Object {
    [PSCustomObject]@{
      DisplayName = $app.displayName
      AppId       = $app.appId
      ExpiresOn   = $_.endDateTime
      DaysLeft    = [int]([datetime]$_.endDateTime - (Get-Date)).TotalDays
    }
  }
} | Sort-Object DaysLeft | Format-Table -AutoSize

Graph PowerShell:

Use `Invoke-MgGraphRequest` for the same inventory

Connect-MgGraph -Scopes "Application.Read.All"

$cutoff = (Get-Date).AddDays(30)
$uri = "https://graph.microsoft.com/v1.0/applications?`$select=displayName,appId,passwordCredentials&`$top=999"
$results = @()

do {
  $response = Invoke-MgGraphRequest -Method GET -Uri $uri
  $results += $response.value
  $uri = $response.'@odata.nextLink'
} while ($uri)

$results | ForEach-Object {
  $app = $_
  $app.passwordCredentials | Where-Object {
    [datetime]$_.endDateTime -lt $cutoff -and [datetime]$_.endDateTime -gt (Get-Date)
  } | ForEach-Object {
    [PSCustomObject]@{
      DisplayName = $app.displayName
      AppId       = $app.appId
      ExpiresOn   = $_.endDateTime
      DaysLeft    = [int]([datetime]$_.endDateTime - (Get-Date)).TotalDays
    }
  }
} | Sort-Object DaysLeft | Format-Table -AutoSize

Az CLI:

Stay in CLI but keep the same pagination logic

$cutoff = (Get-Date).AddDays(30)
$uri = "https://graph.microsoft.com/v1.0/applications?`$select=displayName,appId,passwordCredentials&`$top=999"
$results = @()

do {
  $page = az rest --method get --uri $uri | ConvertFrom-Json
  $results += $page.value
  $uri = $page.'@odata.nextLink'
} while ($uri)

$results | ForEach-Object {
  $app = $_
  $app.passwordCredentials | Where-Object {
    [datetime]$_.endDateTime -lt $cutoff -and [datetime]$_.endDateTime -gt (Get-Date)
  } | ForEach-Object {
    [PSCustomObject]@{
      DisplayName = $app.displayName
      AppId       = $app.appId
      ExpiresOn   = $_.endDateTime
      DaysLeft    = [int]([datetime]$_.endDateTime - (Get-Date)).TotalDays
    }
  }
} | Sort-Object DaysLeft | Format-Table -AutoSize

Expected outcomes:
- The query returns app registrations with credentials expiring inside the chosen window
- The output includes display name, app ID, expiry, and days remaining

Task 4: Log Analytics job diagnostics

Azure Portal:

Enable the Automation diagnostic categories you actually need

Open the Automation Account.
Go to Diagnostic settings and add a setting.
Enable JobLogs, JobStreams, and AuditEvent.
Send them to the Log Analytics workspace.

Az CLI:

Create the diagnostic setting

$automationId = az automation account show `
  --automation-account-name "<automation-account>" `
  --resource-group "<resource-group>" `
  --query id `
  -o tsv

$workspaceId = az monitor log-analytics workspace show `
  --resource-group "<workspace-resource-group>" `
  --workspace-name "<workspace-name>" `
  --query id `
  -o tsv

az monitor diagnostic-settings create `
  --name send-to-law `
  --resource $automationId `
  --workspace $workspaceId `
  --logs '[{"category":"JobLogs","enabled":true},{"category":"JobStreams","enabled":true},{"category":"AuditEvent","enabled":true}]' `
  --metrics '[{"category":"AllMetrics","enabled":true}]'

Az CLI:

Query the workspace

$workspaceCustomerId = az monitor log-analytics workspace show `
  --resource-group "<workspace-resource-group>" `
  --workspace-name "<workspace-name>" `
  --query customerId `
  -o tsv

az monitor log-analytics query `
  --workspace $workspaceCustomerId `
  --analytics-query 'AzureDiagnostics | where ResourceProvider == "MICROSOFT.AUTOMATION" | where Category in ("JobLogs", "JobStreams", "AuditEvent") | where TimeGenerated > ago(1h) | project TimeGenerated, RunbookName_s, Category, ResultType, ResultDescription_s | order by TimeGenerated desc'

Expected outcomes:
- The diagnostic setting is created successfully
- After the first post-enable runbook job, the Automation records land in Log Analytics
- You can pivot on Category, RunbookName_s, and ResultType

Task 5: Monitor alert for failed runbook

Azure Portal:

Create the alert from the Monitor blade

Go to Monitor > Alerts > Create.
Use the Log Analytics workspace or the Automation Account as the scope, depending on whether you want a query-based or metric-based alert.
Keep the alert disabled until the first successful query proves your data shape.

Az CLI:

Create a disabled scheduled-query alert rule first

$workspaceId = az monitor log-analytics workspace show `
  --resource-group "<workspace-resource-group>" `
  --workspace-name "<workspace-name>" `
  --query id `
  -o tsv

az monitor scheduled-query create `
  --resource-group "<resource-group>" `
  --name "Lab5-RunbookAlert" `
  --scopes $workspaceId `
  --condition "count 'AutomationJobs' > 0" `
  --condition-query AutomationJobs="AzureDiagnostics | where ResourceProvider == 'MICROSOFT.AUTOMATION' | where Category == 'JobLogs' | where ResultType == 'Failed' | where TimeGenerated > ago(15m)" `
  --evaluation-frequency 15m `
  --window-size 15m `
  --severity 3 `
  --disabled true

Expected outcomes:
- The rule is created successfully
- You can inspect the condition safely before enabling it
- Once diagnostics are flowing and the query returns the expected schema, you can remove --disabled true

Common pitfalls

Automation source control sync fails if the Automation Account managed identity is not enabled or does not have Contributor on the Automation Account resource.
In PowerShell, az automation source-control sync-job create requires the stop-parsing operator so the empty --commit-id "" survives the shell.
PSScriptAnalyzer results vary by module version. Install the current gallery version before comparing rule output with classmates.
Automation diagnostics take time to appear after you first enable them. Trigger another runbook job after the setting is saved.
Create the alert rule disabled first if the query schema is still changing. It is easier to inspect and then enable than to debug a noisy live alert.

Admin takeaways

Section takeaways - five operational principles that compound together

Quick recap questions

Why is editing a production runbook directly in the Azure portal a governance problem?
What are two common security issues in AI-generated automation code?
Which PSScriptAnalyzer rule catches plain-text passwords?
What monitoring must be enabled on a Function App to query failures in Log Analytics?
What three fields should the app registration Notes field always contain?
What defines an orphaned service principal?

Key reminders

Treat the Azure portal as read-only for production automation. All changes via pull request.
PSScriptAnalyzer and peer review are complementary, not alternatives - use both.
Application Insights is required for Function App observability. Enable it on every security automation Function App.
Every service principal must have an owner. Ownerless SPs become orphans when creators leave.
A silent automation failure is operationally worse than no automation - alert on every failure.

Section 5 to Section 6 - practices carry forward into solution packaging

Maintenance and Management

Section purpose

Learning Objectives

Source control and version management

Why source control is required

What belongs in source control

Automation Account source control sync

CI/CD for runbook deployment

Infrastructure as Code for automation resources

AI-assisted development

Safe sandboxes

What AI-generated code commonly gets wrong

PSScriptAnalyzer for PowerShell code review

Bandit for Python

Peer review as a security control

Monitoring automation health

The silent failure problem

Automation Account monitoring

Logic App run history monitoring

Function App monitoring with Application Insights

Error alerting and runbook failure notifications

Creating an alert for failed runbook jobs

Connecting failures to incident management

Service principal naming conventions and ownership

Why naming conventions matter

Recommended naming convention

Notes field and owner requirement

Service principal lifecycle hygiene

Credential expiry alerts

90-day activity review

Credential expiry policies

Orphan detection

Lab readiness notes

Guided lab

Lab goal

Prerequisites

Task 1: Source control sync for Automation Account

Azure Portal:

Create the source control connection

Az CLI:

Create the source control connection and trigger a sync job

Task 2: PSScriptAnalyzer on a runbook

Native PowerShell:

Create a sample runbook and lint it

Task 3: Graph API credential expiry query

Native PowerShell:

Use a Graph token plus raw REST

Graph PowerShell:

Use Invoke-MgGraphRequest for the same inventory

Az CLI:

Stay in CLI but keep the same pagination logic

Task 4: Log Analytics job diagnostics

Azure Portal:

Enable the Automation diagnostic categories you actually need

Az CLI:

Create the diagnostic setting

Az CLI:

Query the workspace

Task 5: Monitor alert for failed runbook

Azure Portal:

Create the alert from the Monitor blade

Az CLI:

Create a disabled scheduled-query alert rule first

Common pitfalls

Admin takeaways

Quick recap questions

Key reminders

Current source notes

References

References Appendix

Source control and CI/CD

Static analysis and code review

Monitoring and observability

Service principal lifecycle

Sentinel playbook testing

Use `Invoke-MgGraphRequest` for the same inventory