Amplifier Browser Runtime

AI as a Document

What happens when you put an AI agent framework
inside a single HTML file

February 2026 Active

The Problem

Every AI product today requires...

🔑

An Account

Sign up, verify email, set up a profile

🗝️

An API Key

Generate, store, rotate, protect

🌐

Internet

Always-on connection to a remote server

💳

A Subscription

Monthly fees, usage limits, billing anxiety

🤞

Trust

That your data stays private on someone else's server

What if none of that was necessary?

The Solution

One file. Everything inside.

file:///ai-tutor.html

Explain how photosynthesis works

Photosynthesis is the process by which plants convert sunlight, water, and CO₂ into glucose and oxygen. It occurs in two stages: the light-dependent reactions in the thylakoid membranes...

Amplifier kernel via Pyodide/WebAssembly

Local LLM via WebLLM/WebGPU

Specialized bundle with domain knowledge

Complete chat interface

Open it in Chrome. No server. No API key. No internet after first model download.

Architecture

Five layers, one file

Your Browser

→

Pyodide (WASM)

→

amplifier-core

→

WebLLM Provider

→

WebGPU · Your GPU

~6 lines to initialize

const amp = new AmplifierBrowser();

await amp.init({
  model: "Phi-3.5-mini-instruct-q4f16_1-MLC"
});

const reply = await amp.chat("Explain quantum entanglement");

What happens under the hood

The AmplifierBrowser facade boots Python inside WebAssembly via Pyodide. The same amplifier-core kernel that powers terminal sessions initializes in your browser tab.

WebLLM loads a quantized model directly onto your GPU through WebGPU. No network calls. No intermediary. Your prompt goes straight to local silicon.

Local Models

Running on YOUR GPU

General Purpose

Phi-3.5-mini

Microsoft's compact powerhouse. Strong reasoning for its size, ideal for general Q&A, tutoring, and code assistance.

~4 GB VRAM · 3.8B parameters

Fast Responses

Llama-3.2-3B

Meta's speed-optimized model. Quick responses for conversational use, summaries, and lightweight tasks.

~4 GB VRAM · 3B parameters

Multilingual · Quality

Qwen2.5-7B

Alibaba's multilingual model. Higher quality output with support for many languages. Needs more VRAM but delivers more capable results.

~8 GB VRAM · 7B parameters

🔒 Your data never leaves your device. Fully offline after first model download.

Possibilities

When AI is just a file...

📚

Education

A teacher sends students an HTML file. Inside is a specialized tutor that knows the curriculum, answers questions, and works offline in any classroom.

🏢

Enterprise

A company distributes a portable knowledge base. No server infrastructure. No ongoing costs. Every employee gets AI on their laptop.

🔒

Hostile Environments

A journalist in a sensitive region has AI assistance without any network traffic. No server logs. No API calls to intercept. Complete operational security.

💻

Prototyping

A developer prototypes agent behavior in a browser tab. No deployment pipeline. No cloud account. Open a file, iterate, ship.

One Kernel, Every Runtime

Same kernel.
Different mineral.

⬛

Terminal

macOS, Linux, Windows — the classic CLI experience

amplifier-core

📦

Docker

Containerized workloads with full isolation

amplifier-core

🔗

SSH

Remote machines and build servers

amplifier-core

🌐

Browser

WebAssembly — the new frontier

amplifier-core

~2,600

Lines of Kernel

5

Protocol Contracts

4

Runtimes

Same protocols. Same composition model. Same bundle format. The platform() function in the kernel has 'wasm' as a first-class return value — the browser was a design target from day one, not an afterthought.

Quality

Built and tested by an agent

The webruntime-developer agent doesn't just build browser AI apps — it tests them autonomously with Playwright and WebGPU flags. No untested code reaches the user.

🔨

Build HTML

🧪

Test

🔧

Fix

✓

Re-test

📦

Deliver

Headless WebGPU testing

Playwright launches Chromium with WebGPU flags enabled. The agent generates tests, runs them headless, reads console output on failure, and iterates until everything works.

Agent-driven iteration

When a test fails, the agent reads the console logs, diagnoses the issue, fixes the code, and re-runs. This loop continues until the app works correctly — no human debugging required.

The Paradigm Shift

AI as a document
changes everything

📧

Email it

Attach an AI assistant to a message. The recipient opens a file and has a working AI.

💾

USB drive

Carry an AI in your pocket. Works anywhere with a modern browser and a GPU.

🌐

Static website

Host AI on a $0/month static site. No backend. No database. No ops team.

📖

Embed anywhere

Drop an AI into documentation, a wiki, an internal portal — it just works.

No accounts. No infrastructure. No ongoing costs.
The file IS the application.

Sources

Research Methodology

Data as of: February 26, 2026

Feature status: Active — code present in production repositories with active development

Research performed:

Source code analysis of amplifier_webruntime.py — confirmed imports from amplifier_core.interfaces (Provider, ContextManager, Tool, Orchestrator)
Protocol analysis of amplifier_env_common/protocol.py — confirmed 'wasm' as first-class platform() return value
WebLLM bundle analysis — confirmed WebGPU integration, model catalog, and provider implementation
Agent definition review — webruntime-developer in amplifier-foundation confirmed with Playwright + WebGPU testing workflow
Kernel line count: ~2,600 lines with 5 protocol contracts across 4 runtimes

Repositories: amplifier-core (kernel), amplifier-foundation (webruntime-developer agent), webllm bundle (browser LLM provider)

Model VRAM requirements: From WebLLM model catalog documentation. Actual VRAM usage may vary by quantization and browser.

Primary contributors: Not determined — research did not extract individual contributor attribution from git history for this deck. The browser runtime spans multiple repos and contributors.

Gaps: No independent VRAM benchmarks were run. Model parameter counts are from published model cards. Exact initialization line count (~6 lines) is from the AmplifierBrowser facade API surface, not a formal LOC metric. Individual contributor commit shares were not analyzed.

Every other AI product requires infrastructure.
Amplifier's browser runtime requires a browser.

That's not a limitation — it's liberation.

Explore Amplifier on GitHub →

AI as a Document

Every AI product today requires...

An Account

An API Key

Internet

A Subscription

Trust

One file. Everything inside.

Five layers, one file

~6 lines to initialize

What happens under the hood

Running on YOUR GPU

Phi-3.5-mini

Llama-3.2-3B

Qwen2.5-7B

When AI is just a file...

Education

Enterprise

Hostile Environments

Prototyping

Same kernel.Different mineral.

Terminal

Docker

SSH

Browser

Built and tested by an agent

Headless WebGPU testing

Agent-driven iteration

AI as a documentchanges everything

Email it

USB drive

Static website

Embed anywhere

Research Methodology

Same kernel.
Different mineral.

AI as a document
changes everything