GitHub - theonekeyg/boxer: Sandbox server for executing untrusted code in gVisor isolated containers

Sandboxed container execution powered by gVisor

Boxer is a sandboxed container execution service backed by gVisor. It exposes a simple HTTP API for running arbitrary commands inside any container image, with strong isolation guarantees and configurable resource limits.

Why Boxer

Running untrusted code is a hard problem. Docker alone provides namespace isolation but shares the host kernel - a compromised container can exploit kernel vulnerabilities and escape. Boxer wraps every execution in gVisor's user-space kernel (runsc), which intercepts and validates all system calls before they reach the host. The attack surface is dramatically reduced.

This makes Boxer a good fit for:

LLM training and inference pipelines - execute model-generated code safely without exposing your host to arbitrary syscalls
Decentralized oracle evaluations - run untrusted verification scripts submitted by network participants
Prediction markets and agent frameworks - evaluate outcomes by executing code from unknown sources
Code execution as a service - any scenario where you need to run unstructured, user-supplied, or LLM-generated code at scale

How It Works

A client sends a POST /run request with a container image, command, optional files, and resource limits.
Boxer pulls and caches the image rootfs locally (shared read-only across executions).
It constructs a hardened OCI bundle and spawns runsc (gVisor) to execute the command.
Stdout, stderr, wall time, and exit code are returned in the response.

Files can be uploaded before a run and bind-mounted read-only inside the container. Output files written to /output/ inside the container are captured and retrievable after the run.

Getting Started

Docker (recommended)

docker run -d --privileged -p 8080:8080 theonekeyg/boxer

Or with Docker Compose:

# Replace "main" with the version tag you want to deploy, e.g. refs/tags/v1.0.0
curl -fsSL https://raw.githubusercontent.com/theonekeyg/boxer/main/docker-compose.prod.yml -o docker-compose.prod.yml
docker compose -f docker-compose.prod.yml up -d

Why --privileged? Boxer manages cgroups and network namespaces and spawns gVisor to sandbox each execution. See the Docker installation guide for details.

Build from Source

Prerequisites: gVisor runsc in PATH, Go 1.22+

cd packages/core
go run . --config config.dev.json

The server listens on :8080 by default. Configuration can also be set via $BOXER_CONFIG or ~/.boxer/config.json.

Python SDK

pip install boxer-sdk

from boxer import BoxerClient

with BoxerClient("http://localhost:8080") as client:
    result = client.run(
        image="python:3.12-slim",
        cmd=["python3", "-c", "print('hello world')"],
    )
    print(result.stdout)    # hello world
    print(result.exit_code) # 0
    print(result.wall_ms)   # e.g. 312

See packages/sdk/python for the full SDK reference including async support, file upload/download, resource limits, and error handling.

TypeScript / Node.js SDK

npm install boxer-sdk

import { BoxerClient } from "boxer-sdk";

const client = new BoxerClient({ baseUrl: "http://localhost:8080" });
const result = await client.run(
  "python:3.12-slim",
  ["python3", "-c", "print('hello world')"],
);
console.log(result.stdout);    // hello world
console.log(result.exit_code); // 0
console.log(result.wall_ms);   // e.g. 312

See packages/sdk/typescript for the full SDK reference including file upload/download, resource limits, and error handling.

REST API

curl -s http://localhost:8080/run \
  -H 'Content-Type: application/json' \
  -d '{"image":"python:3.12-slim","cmd":["python3","-c","print(42)"]}'

Swagger UI is available at http://localhost:8080/swagger.

Examples

`examples/hello-world`

Minimal Python script that runs a sandboxed "hello world" via the Boxer SDK. Good starting point for understanding the basic client flow.

`examples/upload-and-run`

Uploads a local Python project (source + tests) to Boxer and runs its pytest suite inside a sandboxed container. Demonstrates the file upload workflow and output capture.

`examples/humaneval`

Evaluates OpenAI's o3-mini on the HumanEval benchmark (164 code-generation problems). Each LLM-generated solution is executed inside the Boxer sandbox and scored by exit code. A real-world example of using Boxer in an LLM evaluation pipeline.

Contributing

Before writing any code, please check the open issues. If your bug report, feature request, or proposal is not already tracked there, open an issue first and describe what you want to do. Once the issue is confirmed, open a pull request that references it.

This keeps discussion focused, avoids duplicate work, and ensures effort is spent on changes that will be accepted.

Name		Name	Last commit message	Last commit date
Latest commit History 239 Commits
.github/workflows		.github/workflows
.greptile		.greptile
.vscode		.vscode
_assets		_assets
docker		docker
docs		docs
examples		examples
packages		packages
scripts		scripts
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sandboxed container execution powered by gVisor

Why Boxer

How It Works

Getting Started

Docker (recommended)

Build from Source

Python SDK

TypeScript / Node.js SDK

REST API

Examples

`examples/hello-world`

`examples/upload-and-run`

`examples/humaneval`

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sandboxed container execution powered by gVisor

Why Boxer

How It Works

Getting Started

Docker (recommended)

Build from Source

Python SDK

TypeScript / Node.js SDK

REST API

Examples

examples/hello-world

examples/upload-and-run

examples/humaneval

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`examples/hello-world`

`examples/upload-and-run`

`examples/humaneval`

Packages