Amplifier Foundation Agent

Smoke Test Agent

Objective validation for shadow environments.
Pass means your changes work. Fail means stop and investigate.

Active microsoft/amplifier-foundation
February 2026
The Problem

Can I trust my shadow?

๐Ÿณ
Isolated Containers
Shadow environments run your local Amplifier changes in Docker containers. But are your local source snapshots actually being used?
๐Ÿ”€
Git URL Rewriting
Shadows redirect git fetches to local snapshots via URL rewriting. If the rewriting fails silently, you're testing the wrong code.
๐Ÿ—๏ธ
Multi-Repo Complexity
Changes span amplifier-core, amplifier-foundation, modules. Each repo needs its snapshot verified independently.

Shadow-operator creates the environment. But who verifies it actually works? You need an independent second opinion.

The Solution

An Independent
Validation Agent

A single 290-line agent in amplifier-foundation that runs separately from shadow-operator to provide unbiased verification.

290
Lines of Markdown
100
Point Rubric
15
Individual Checks
Architecture

The 100-Point Rubric

Five categories, each weighted by criticality. Score ≥ 75 to pass.

Category Points Checks What It Validates
Source Verification 25 4 Snapshot commits exist, git URL rewriting configured, installed package uses snapshot
Installation Health 20 3 Package installs clean, imports work, CLI responds
Code Execution 30 3 Changed modules load, basic functionality works, integration test passes
Isolation Integrity 15 3 Container hostname differs, host home not accessible, env vars correct
No Regressions 10 2 Standard imports work, basic operations complete

Critical failures override the score. If Source Verification, Code Execution, or Installation Health scores zero, the verdict is FAIL regardless of total points.

Key Design Decision

Halt, Don't Work Around

The agent's most important design principle: if it can't do its job, it stops immediately and tells you why.

๐Ÿ›‘
Mandatory Halt Conditions
No shadow_id provided. Shadow not found. Container not running. Shadow tool unavailable. Any of these = immediate halt with structured error.
๐Ÿšซ
What It Must Never Do
Never test on the host as a "workaround." Never skip shadow-dependent checks. Never give PASS if it couldn't actually run tests.
๐Ÿ”
Independent Verification
Runs separately from shadow-operator. Doesn't trust claims about the environment. Verifies everything with direct evidence.
๐Ÿ“‹
Structured Verdicts
Every check produces evidence. The final verdict includes scores per category plus specific pass/fail evidence for each of 15 checks.
Amplifier-Specific Tests

What It Actually Validates

Tailored tests for each Amplifier repo type.

# Testing amplifier-core changes shadow exec <id> "python -c 'from amplifier_core import Session, Coordinator; print("OK")'" # Testing amplifier-foundation changes shadow exec <id> "python -c 'from amplifier_foundation import load_bundle; print("OK")'" # Testing CLI changes shadow exec <id> "amplifier --version" shadow exec <id> "amplifier provider install anthropic -q" # Testing module changes shadow exec <id> "python -c 'from amplifier_module_tool_filesystem import mount; print("OK")'"
Results

What a Verdict Looks Like

+================================================================+ | AMPLIFIER SHADOW SMOKE TEST | | Shadow ID: shadow_abc123 | | Local Sources: amplifier-core, amplifier-foundation | | Tested: 2026-02-20T11:00:00Z | +================================================================+ ## Source Verification (25/25) - [] Snapshot commits exist - [] Git URL rewriting configured - [] Installed package uses snapshot - [] Unregistered repos NOT redirected ## Installation Health (20/20) - [] Package installs without errors - [] Package imports successfully - [] CLI tools respond Total Score: 100/100 | Pass Threshold: 75 VERDICT: PASS
Ecosystem

Part of the Shadow Stack

Three components work together to provide safe, isolated testing of multi-repo Amplifier changes.

๐Ÿ—๏ธ
shadow-operator
Creates and manages shadow environments. Snapshots local repos, builds Docker containers, sets up git URL rewriting. Lives in the shadow bundle.
๐Ÿ”ฌ
amplifier-smoke-test
Independent validation agent. Runs the 100-point rubric against the shadow. Lives in amplifier-foundation. This deck.
๐Ÿงช
shadow-smoke-test
Generic shadow validation (466 lines). Tests container health without Amplifier-specific knowledge. Lives in the shadow bundle.

Activated via behavior: The shadow-amplifier.yaml behavior in amplifier-foundation includes the shadow bundle and adds this Amplifier-specific smoke test agent.

Development Velocity

6
Commits
1
Repository
19
Days (Jan 16 โ€“ Feb 4)
1
Contributor

Primary contributor: Brian Krabach (6 of 6 commits, 100%)

Repository: microsoft/amplifier-foundation

Sources

Research Methodology

Data as of: February 20, 2026

Feature status: Active (in main branch of amplifier-foundation)

Research performed:

Gaps and corrections from prior deck:

Primary contributor: Brian Krabach (100% of smoke-related commits)

Get Started

Try It

Available as part of amplifier-foundation's shadow-amplifier behavior.

# Enable the shadow-amplifier behavior in your config # This includes the shadow bundle + Amplifier smoke test agent # Then after creating a shadow environment: "Validate that the shadow environment is using my local amplifier-core" # The agent runs the 100-point rubric and returns PASS/FAIL

github.com/microsoft/amplifier-foundation

Open Source Part of Foundation
More Amplifier Stories