Agent Sandboxing — htek.dev

// the big picture

The Isolation Spectrum

Before diving into the full taxonomy, here's the landscape. Every sandbox sits somewhere on this spectrum — the tradeoff is always isolation strength vs. overhead.

🏠

Protect Your Machine

Keep agents out of your files, secrets, and home directory. Stop accidental reads of ~/.ssh and .env.

Bubblewrap, Firejail

Zero overhead. Instant. Linux.

📦

Isolate Workloads

Disposable environments that spin up and throw away. Agents get a fresh sandbox each task — state doesn't bleed between runs.

Docker, E2B, Fly.io

Ephemeral. Cross-platform.

🔐

Govern Access

Control exactly which APIs, methods, and endpoints an agent can reach. Policy-as-code enforcement at the network level.

NVIDIA OpenShell

Policy-as-code. Enterprise.

// isolation taxonomy

Isolation Levels — The Full Taxonomy

Seven levels, each wrapping the ones below. The further out you go, the stronger the boundary — and the higher the operational cost.

Level	Tech	How It Works	Kernel Shared?	Escape Risk	Overhead	OS
L1: Process	seccomp-bpf, Landlock	Syscall filtering and filesystem restrictions on a single process	Yes	Medium	Near-zero	Linux
L2: Namespace	Bubblewrap, Firejail	Linux namespaces (PID, mount, net, user) wrap a process in isolation	Yes	Low-Medium	Near-zero	Linux
L3: Container	Docker, Podman, Incus	Full container with cgroups, namespaces, and layered filesystem	Yes	Low (CVEs exist)	Low	Linux, macOS, Windows
L4: Userspace Kernel	gVisor	Intercepts syscalls in userspace, emulates kernel behavior — host kernel never touched	Partial	Low	Medium	Linux
L5: MicroVM	Firecracker, Cloud Hypervisor	Dedicated kernel per workload via hardware virtualization (KVM)	No	Very Low	Medium-High	Linux (KVM)
L6: Full VM	QEMU/KVM, Hyper-V	Complete OS isolation with dedicated hardware abstraction	No	Minimal	High	Cross-platform
L7: Policy Engine	NVIDIA OpenShell	Combines multiple isolation levels + L7 HTTP inspection per binary/endpoint	Varies	Very Low	Medium	Linux (k3s)

L1: Process seccomp-bpf, Landlock

Protects

Filesystem paths
Syscall surface

When to Use

Low-trust local scripts, helper processes, tight latency requirements

Trade-off

Shared kernel — kernel exploit can escape the sandbox

L2: Namespace Bubblewrap, Firejail

Protects

Filesystem
Network
Process tree
User IDs

When to Use

Local agent execution, dev machines, Linux-native workflows

Trade-off

Linux-only; still shares kernel with host

L3: Container Docker, Podman, Incus

Protects

Filesystem
Network
Process tree
Resource limits

When to Use

Cross-platform agent workloads, ephemeral environments, most cloud deployments

Trade-off

Kernel escape CVEs (runc, containerd history); not true hardware isolation

L4: Userspace Kernel gVisor

Protects

Filesystem
Network
Syscall surface
Host kernel

When to Use

Untrusted code execution in cloud, GKE workloads, moderate performance requirement

Trade-off

Performance overhead on syscall-heavy workloads; not all syscalls supported

L5: MicroVM Firecracker, Cloud Hypervisor

Protects

Filesystem
Network
Kernel
Memory isolation

When to Use

Cloud sandboxes (E2B, Fly.io), multi-tenant agent platforms, high-security requirements

Trade-off

Requires KVM; cold start 100ms–1s; harder local dev

L6: Full VM QEMU/KVM, Hyper-V

Protects

Full OS stack
Hardware abstraction
Network
Storage

When to Use

Maximum security, legacy workloads, compliance-mandated isolation

Trade-off

High overhead; slow cold start (minutes); operational complexity

L7: Policy Engine NVIDIA OpenShell

Protects

Filesystem
Network (method+path)
Processes
API endpoints
GPU access

When to Use

Enterprise agent governance, GPU inference, multi-tenant, policy-as-code requirement

Trade-off

Per-endpoint policy maintenance; k3s cluster required; seconds cold start

// tool comparison

Every Tool Compared

19 tools across the entire spectrum — from Linux-native namespace wrappers to managed cloud MicroVM platforms.

Tool	Level	OS	GPU	Self-Hosted	Open Source	Cold Start	Pricing
Bubblewrap (bwrap)	L2	Linux	—	Yes	LGPL	Instant	Free
Firejail	L2	Linux	—	Yes	GPL	Instant	Free
Docker Sandboxes	L3	Lin/Mac/Win	Limited	Yes	Partial	Sub-second	Free
Clampdown	L3	Linux/macOS	—	Yes	Yes	Fast	Free
code-on-incus	L3	Linux	—	Yes	Yes	Fast	Free
Daytona	L3	Cross-platform	Yes	Yes	AGPL	<90ms	~$0.08/hr
BoxLite	L3+	Linux/macOS	—	Yes	Rust	Fast	Free
Sandbox0	L3	Linux	—	Yes	Go	Fast	Free
ContainAI	L3	Linux	—	Yes	MIT	Fast	Free
Cagent	L3+	Linux	—	Yes	Yes	Fast	Free
Modal	L4	Cloud	Best	No	No	Sub-second	Pay/sec
Google Agent Sandbox	L4+	GKE	Yes	No	Yes	Varies	GKE pricing
k8s-sigs/agent-sandbox	L4+	Any K8s	Yes	Yes	Yes	Varies	Free
E2B	L5	Cloud	—	Terraform	Partial	~150ms	~$0.08/hr
Fly.io Sprites	L5	Cloud	—	No	No	1–12s	Per-second
Northflank	L5	Cloud	—	BYOC	No	Fast	Per-second
Alibaba OpenSandbox	L3	Linux	—	Yes	Apache 2.0	Varies	Free
Microsandbox	L5	Linux	—	Yes	Apache 2.0	Varies	Free
NVIDIA OpenShell	L7	Linux (k3s)	Yes	Yes	Apache 2.0	Seconds	Free

// choose your sandbox

Choose by Threat Model

Don't pick a sandbox by vibes. Pick it by what you're defending against. Here's the decision map.

🏠

Local filesystem access

Agent reads ~/.ssh, .env, credentials

L2 Bubblewrap, Firejail

🌐

Network exfiltration

Agent sends data to unauthorized endpoints

L3–L7 Docker (port-level), OpenShell (method-level)

💣

Destructive commands

Agent runs rm -rf, drops tables

L2–L3 Bubblewrap (read-only mounts), Docker

🔑

Privilege escalation

Agent gains root, installs packages

L1–L3 seccomp, containers

🧬

Kernel exploit

Agent escapes via kernel vulnerability

L5–L6 Firecracker, QEMU (separate kernel)

📡

API abuse

Agent calls wrong endpoints or methods

L7 OpenShell (HTTP path + method inspection)

🔄

State persistence

Agent needs to survive restarts

L3+ BoxLite (snapshots), OpenShell, Sandbox0

⚡

Latency-sensitive workflows

Interactive agent, can't wait for boot

L1–L2 seccomp (zero), Bubblewrap (instant)

🎮

GPU inference

Agent needs local model inference

L5–L7 OpenShell (DGX/RTX), Modal, K8s

// before you decide

Considerations

Nine things worth thinking through before you pick your sandbox strategy.

⚖️

Performance vs Security

L2 is instant but shares the kernel. L5 is isolated but adds cold start overhead. Pick based on your actual threat model — most agents don't need MicroVM protection.

🖥️

OS Portability

Bubblewrap, Firejail, and Landlock are Linux-only. If your agents run on macOS or Windows dev machines, containers are your minimum viable sandbox.

📋

Policy Maintenance

L7 policy engines require per-endpoint maintenance. Is your team prepared to define and update network policies per agent binary? It's powerful but carries operational cost.

🚪

Escape Surface

Containers share the kernel — container escape CVEs (runc, containerd) have a documented history. MicroVMs isolate at hardware. Know what you're trusting.

🔍

Debugging Complexity

More isolation means harder debugging. Audit logs help but add overhead. Plan your observability strategy before locking down the sandbox.

🎮

GPU Passthrough

Most lightweight sandboxes can't pass GPUs to containerized workloads. If your agent needs local inference, plan for L5+ or a managed GPU sandbox like Modal or OpenShell.

💾

Statefulness

Ephemeral sandboxes (E2B, Firecracker) are great for short-lived tasks. Persistent agents (long-running sessions, file editing) need snapshot support or persistent volumes.

🕐

Cold Start

Instant (bwrap) → sub-second (Docker) → ~150ms (E2B) → 1–12s (Fly.io Sprites) → minutes (full VM). Match cold start budget to the UX you need.

🌐

DNS in Cluster Sandboxes

K8s-based sandboxes can have DNS issues in child namespaces. Test your agent's DNS resolution behavior early — silent DNS failures are a common gotcha.

// the bigger picture

How Sandboxing Fits the Stack

Sandboxing is Layer 0 — the execution boundary beneath everything else. It's necessary but not sufficient. Above it sit instructions, hooks, and CI/CD gates that together make agentic safety structural.

Layer	What	Where
0: Sandbox	Execution boundary	This page ← you are here
1: Instructions	Context engineering	Agent-Proof Architecture
2: Hooks	Tool-call interception	Copilot CLI Hooks
3: Gates	CI/CD validation	Agentic Workflows

💡

Sandboxing is Layer 0, not the whole solution

A sandbox defines the execution boundary — what the agent can and cannot touch. But it doesn't tell the agent what to do. Instructions (Layer 1) set context and intent. Hooks (Layer 2) enforce rules in real time. CI/CD gates (Layer 3) catch what slips through. The full Agentic DevOps stack uses all four layers together.

// deep dives

Ready to Sandbox Your Agents?

I help engineering teams design and implement agent execution boundaries — from namespace-level isolation for local dev to policy-governed MicroVM environments for production. Let's talk.

Book a Free Consultation

Your Agent Has Access. The Question Is: How Much?

The Isolation Spectrum

Protect Your Machine

Isolate Workloads

Govern Access

Isolation Levels — The Full Taxonomy

Protects

When to Use

Trade-off

Protects

When to Use

Trade-off

Protects

When to Use

Trade-off

Protects

When to Use

Trade-off

Protects

When to Use

Trade-off

Protects

When to Use

Trade-off

Protects

When to Use

Trade-off

Every Tool Compared

Choose by Threat Model

Local filesystem access

Network exfiltration

Destructive commands

Privilege escalation

Kernel exploit

API abuse

State persistence

Latency-sensitive workflows

GPU inference

Considerations

Performance vs Security

OS Portability

Policy Maintenance

Escape Surface

Debugging Complexity

GPU Passthrough

Statefulness

Cold Start

DNS in Cluster Sandboxes

How Sandboxing Fits the Stack

Sandboxing is Layer 0, not the whole solution

Further Reading

The Sandbox Your AI Agents Should Be Running In

NVIDIA OpenShell and the Rise of Agent Sandboxes

Agentic DevOps Hub

Building Agent-Proof Architecture

Ready to Sandbox Your Agents?