VisionClaw

Immersivee, collaborative, agentic knowledge management, & tool factory.

92 CUDA kernels · GPU clustering, anomaly detection & PageRank · Multi-user immersive XR · 83 agent skills · OWL 2 ontology governance · Nostr DID identity · Solid Pod sovereignty

visionclaw.mp4

Why VisionClaw? · Quick Start · Capabilities · Architecture · Performance · Documentation · Contributing

Your Best People Are Already Running the Future

They just haven't told you yet.

73% of frontline AI adoption happens without management sign-off. Your workforce is already building shadow workflows, stitching together AI agents, automating procurement shortcuts, inventing cross-functional pipelines that don't appear on any org chart. The question isn't whether your organisation is becoming an agentic mesh. It's whether you'll shape how it forms.

The personal agent revolution has a governance problem. Tools like Claude Code have shown that autonomous AI agents are powerful, popular, and ready to act. They've also shown what happens when agents operate without shared semantics, formal reasoning, or organisational guardrails: unauthorised actions, prompt injection attacks, and enterprises deploying security scanners just to detect rogue agent instances on their own networks.

VisionClaw takes the opposite approach. Governance isn't an inhibitor, it's an accelerant.

What Is VisionClaw?

VisionClaw is an open-source platform that transforms organisations into governed agentic meshes where autonomous AI agents, human judgment, and institutional knowledge work together through a shared semantic substrate.

The platform ingests knowledge from Logseq notebooks via GitHub, reasons over it with an OWL 2 EL inference engine (Whelk-rs), renders the result as an interactive 3D graph where nodes attract or repel based on their semantic relationships, and exposes everything to AI agents through 7 Model Context Protocol tools. Users collaborate in the same space through multi-user XR presence, spatial voice, and immersive graph exploration. Every agent decision is semantically grounded, every mutation passes consistency checking, and every reasoning chain is auditable from edge case back to first principles.

VisionClaw is production-proven currently operational at DreamLab residential training lab, augmenting a 50 person creative technology team, and validated in partnership with a major UK creative studio and the University of Salford.

GPU-accelerated force-directed graph — 934 nodes responding to spring, repulsion, and ontology-driven semantic forces in real time

Chloe Nevitt interacting with Prof Rob Aspin's precursor to VisionClaw in the Octave Multimodal Lab University of Salford 2017

Core Capabilities

🧠 Semantic Governance

OWL 2 EL reasoning via Whelk-rs (EL++ inference)
subClassOf → attraction, disjointWith → repulsion in GPU physics
Every ontology mutation creates a GitHub PR — human veto before commit
Content-addressed immutable provenance beads (Nostr NIP-09)
10 DDD bounded contexts with CQRS — 114 command/query handlers

⚡ GPU-Accelerated Physics

92 CUDA kernel functions across 11 files (6,585 LOC)
55× speedup vs single-threaded CPU physics
Force-directed layout + semantic forces + stress majorisation
On-demand: K-Means clustering, Louvain communities, LOF anomaly, PageRank
Periodic full broadcast every 300 iterations — no stale-position bugs

🤖 83 Agent Skills

Claude-Flow DAG orchestration with RAFT consensus hive-mind
7 MCP Ontology Tools (discover, read, query, traverse, propose, validate, status)
Nostr DID agent identities with W3C-compliant key rotation
Cascading trust revocation — revoke one agent, cascade to all dependents
RuVector PostgreSQL memory (pgvector + HNSW, 384-dim MiniLM-L6-v2)

🌐 Multi-User Immersive XR

Babylon.js WebXR for immersive/VR mode — Meta Quest 3 optimised
React Three Fiber for desktop graph (dual-renderer architecture)
Vircadia World Server: avatar sync, HRTF spatial audio, collaborative editing
WebGPU with Three Shading Language (TSL) + WebGL fallback
Foveated rendering, DPR capping, dynamic resolution scaling on Quest 3

🔐 Self-Sovereign Identity

Nostr NIP-98 HTTP auth — signed cryptographic events, no passwords
NIP-07 browser extension signing (Alby, nos2x, etc.)
Solid Pod user data sovereignty — each user owns their own Pod
JSS (JSON Solid Server) sidecar for local Pod storage
Per-user agent memory namespace via NIP-26 delegation

🔊 Voice Routing (4-Plane Architecture)

LiveKit SFU + turbo-whisper STT (CUDA) + Kokoro TTS
Plane 1: User mic → whisper → private agent channel
Plane 2: Agent TTS → user ear (private)
Planes 3–4: Public spatial audio via LiveKit + Vircadia HRTF
Opus 48kHz mono end-to-end

Quick Start

git clone https://github.com/DreamLab-AI/VisionClaw.git
cd VisionClaw && cp .env.example .env
docker-compose --profile dev up -d

Service	URL	Description
Frontend	http://localhost:3001	3D knowledge graph interface (via Nginx)
API (direct)	http://localhost:4000/api	REST + WebSocket endpoints (Rust/Actix-web)
Neo4j Browser	http://localhost:7474	Graph database explorer
JSS Solid	http://localhost:3030	Solid Pod server
Vircadia	ws://localhost:3020/world/ws	Multi-user WebSocket endpoint

Enable voice routing (LiveKit + whisper + TTS)

docker-compose -f docker-compose.yml -f docker-compose.voice.yml --profile dev up -d

Adds LiveKit SFU (port 7880), turbo-whisper STT (CUDA), and Kokoro TTS. Requires GPU for real-time transcription.

Enable multi-user XR (Vircadia World Server)

docker-compose -f docker-compose.yml -f docker-compose.vircadia.yml --profile dev up -d

Adds Vircadia World Server with avatar sync, HRTF spatial audio, and collaborative graph editing.

Native Rust + CUDA build

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
git clone https://github.com/DreamLab-AI/VisionClaw.git
cd VisionClaw && cp .env.example .env
cargo build --release --features gpu
cd client && npm install && npm run build && cd ..
./target/release/webxr

Requires CUDA 13.1 toolkit. On CachyOS, set CUDA_ROOT=/opt/cuda. See Deployment Guide for full GPU setup.

The Three Layers of the Dynamic Mesh

VisionClaw implements a three-layer agentic mesh. Insights bubble up from frontline discovery, orchestrated through formal semantic pipelines, governed by declarative policy — with humans as the irreplaceable judgment layer at the top.

flowchart TB
    subgraph Layer3["LAYER 3 — DECLARATIVE GOVERNANCE"]
        JB["Judgment Broker\n(Human-in-the-Loop)"]
        Policy["AI-Enforced Policies\nBias · Security · Alignment"]
        Trust["Cascading Trust\nNostr DID Identity"]
    end

    subgraph Layer2["LAYER 2 — ORCHESTRATION"]
        Skills["83 Agent Skills\nClaude-Flow DAG Pipelines"]
        Ontology["OWL 2 EL Reasoning\nWhelk-rs Inference Engine"]
        MCP["7 MCP Tools\nKnowledge Graph Read/Write"]
        GPU["GPU Compute\n92 CUDA Kernels"]
    end

    subgraph Layer1["LAYER 1 — DISCOVERY ENGINE"]
        Ingest["Knowledge Ingestion\nLogseq · GitHub · RSS"]
        Graph["Neo4j Knowledge Graph\n+ RuVector pgvector Memory"]
        Viz["3D Visualisation\nR3F · Babylon.js · WebXR"]
        Voice["Voice Routing\n4-Plane Architecture"]
    end

    Layer1 -->|"Insights bubble up"| Layer2
    Layer2 -->|"Exceptions surface"| Layer3
    Layer3 -->|"Governance flows down"| Layer2
    Layer2 -->|"Validated workflows deploy"| Layer1

    style Layer3 fill:#1A0A2A,stroke:#8B5CF6
    style Layer2 fill:#0A1A2A,stroke:#00D4FF
    style Layer1 fill:#0A2A1A,stroke:#10B981

Layer 1 — The Discovery Engine

The discovery layer ingests, structures, and renders organisational knowledge as a navigable, interactive 3D space.

Ontology Pipeline — VisionClaw syncs Logseq markdown from GitHub, parses OWL 2 EL axioms embedded in ### OntologyBlock sections, runs Whelk-rs inference for subsumption and consistency checking, and stores results in both Neo4j (persistent) and an in-memory OntologyRepository (fast access). GPU semantic forces use the ontology to drive layout physics — subClassOf creates attraction, disjointWith creates repulsion.

flowchart LR
    GH["GitHub\n(Logseq pages)"] -->|SHA1 delta sync| Parser["OWL Parser\n(assembler → converter)"]
    Parser --> Whelk["Whelk-rs Reasoner\n(EL++ inference)"]
    Whelk --> Store["OntologyRepository\n(In-Memory + Neo4j)"]
    Store --> Physics["Semantic Forces\n(GPU CUDA)"]
    Store --> Agents["Agent Tools\n(7 MCP tools)"]
    Store --> Client["3D Visualisation\n(binary WebSocket V2/V3)"]

Explore a live ontology dataset at narrativegoldmine.com — a 2D interactive graph built on the same ontology data VisionClaw renders in 3D.

Dual-Renderer 3D Visualisation — React Three Fiber (Three.js) powers the desktop graph view with InstancedMesh, SharedArrayBuffer zero-copy positions, and custom TSL/WebGL materials. Babylon.js powers the immersive XR mode — the two renderers coexist but never overlap, with R3F suspended when entering XR.

RuVector AI Memory Substrate — 1.17M+ agent memory entries in PostgreSQL + pgvector with HNSW indexing. MiniLM-L6-v2 generates 384-dim embeddings at ingestion. Semantic search at 61µs p50 via HNSW (vs ~5ms brute-force). All agent memory flows through MCP tools — raw SQL bypasses the embedding pipeline.

Node geometry and material system

Node Type	Geometry	Material	ID Encoding
Knowledge (public pages)	Icosahedron r=0.5	`GemNodeMaterial` — analytics-driven colour	Bit 30 set (`0x40000000`)
Ontology	Sphere r=0.5	`CrystalOrbMaterial` — depth-pulsing cosmic spectrum	Bits 26-28 set (`0x1C000000`)
Agent	Capsule r=0.3 h=0.6	`AgentCapsuleMaterial` — bioluminescent heartbeat	Bit 31 set (`0x80000000`)
Linked pages	Icosahedron r=0.35	`GemNodeMaterial`	No flag bits

Agent visual states: #10b981 (idle) · #fbbf24 (spawning/active) · #ef4444 (error) · #f97316 (busy). Animation: breathing pulse (active), error flicker (failed), slow pulse (idle).

Voice routing (4-plane architecture)

Plane	Direction	Scope	Trigger
1	User mic → turbo-whisper STT → Agent	Private	PTT held
2	Agent → Kokoro TTS → User ear	Private	Agent responds
3	User mic → LiveKit SFU → All users	Public (spatial)	PTT released
4	Agent TTS → LiveKit → All users	Public (spatial)	Agent configured public

Opus 48kHz mono end-to-end. HRTF spatial panning from Vircadia entity positions.

Logseq ontology input (source data)

Ontology metadata	Graph structure

OWL entity page with category, hierarchy, and source metadata	Graph view showing semantic clusters

Dense knowledge graph in Logseq — the raw ontology VisionClaw ingests, reasons over, and renders in 3D

Layer 2 — The Orchestration Layer

The orchestration layer is where agents reason, coordinate, and act — always against the shared semantic substrate of the OWL 2 ontology.

83 Specialist Agent Skills — Claude-Flow coordination with RAFT consensus hive-mind and 83 skill modules spanning creative production, research, knowledge codification, governance, workflow discovery, financial intelligence, spatial/immersive, and identity/trust domains. Agents are assigned cryptographic Nostr DID identities and appear as physics nodes in the 3D graph — their status (idle, working, error) drives visual state changes in real time via the Agent-Physics Bridge.

Why OWL 2 Is the Secret Weapon — Most agentic systems fail at scale because they lack a shared language. In VisionClaw, agents reason against a common OWL 2 ontology. The same concept of "deliverable" means the same thing to a Creative Production agent and a Governance agent. Agent skill routing isn't keyword matching — it's ontological subsumption. The orchestration layer knows that a "risk assessment" is a sub-task of "governance review", and routes accordingly.

7 Ontology Agent Tools (MCP) — Read/write access to the knowledge graph via Model Context Protocol:

Tool	Purpose
`ontology_discover`	Semantic keyword search with Whelk inference expansion
`ontology_read`	Enriched note with axioms, relationships, schema context
`ontology_query`	Validated Cypher execution with schema-aware label checking
`ontology_traverse`	BFS graph traversal from starting IRI
`ontology_propose`	Create/amend notes → consistency check → GitHub PR
`ontology_validate`	Axiom consistency check against Whelk reasoner
`ontology_status`	Service health and statistics

GPU-Accelerated Compute — 92 CUDA kernel functions across 11 kernel files (6,585 LOC) run server-authoritative graph layout and analytics. The physics pipeline (force-directed layout, semantic forces, ontology constraints, stress majorisation) runs at 60 Hz with a periodic full broadcast every 300 iterations to keep late-connecting clients synchronised. The analytics pipeline (K-Means clustering, Louvain community detection, LOF anomaly detection, PageRank) runs on-demand via API, with results streamed in the V3 binary protocol's analytics fields.

Metric	Result
CUDA kernel functions	92 across 11 files
GPU vs CPU speedup	55×
V2 protocol size	36 bytes/node
V3 protocol size (+ analytics)	48 bytes/node
WebSocket latency	10ms
Binary vs JSON bandwidth	80% reduction

Binary WebSocket Protocol (V2/V3)

High-frequency position updates use a compact binary protocol instead of JSON, achieving 80% bandwidth reduction.

V2 Standard (36 bytes/node) — production default:

Bytes	Field	Type	Description
0–3	Node ID	u32	Flag bits 26-31 encode node type
4–15	Position (X/Y/Z)	f32×3	World-space position
16–27	Velocity (X/Y/Z)	f32×3	Physics velocity
28–31	SSSP distance	f32	Shortest-path from source
32–35	Timestamp	u32	ms since session start

V3 Analytics (48 bytes/node) — includes GPU analytics:

Adds cluster_id (u16), anomaly_score (f32), community_id (u16), page_rank (f32) at bytes 36–47.

V4 Delta (16 bytes/changed node) — experimental, not production-ready. See Known Issues → WS-001.

BroadcastOptimizer (V2 production): server-side culling that skips unchanged nodes. Distinct from V4 delta encoding — same pipeline, different concept. See websocket-binary.md for the full disambiguation.

Agent skill domains (83 skills)

Creative Production — Script, storyboard, shot-list, grade & publish workflows. ComfyUI orchestration for image, video, and 3D asset generation via containerised API middleware.

Research & Synthesis — Multi-source ingestion, GraphRAG, semantic clustering, Perplexity integration.

Knowledge Codification — Tacit-to-explicit extraction; OWL concept mapping; Logseq-formatted output.

Governance & Audit — Bias detection, provenance chains (content-addressed Nostr beads), declarative policy enforcement.

Workflow Discovery — Shadow workflow detection; DAG proposal & validation against ontology.

Financial Intelligence — R&D tax modelling, grant pipeline, ROI attribution.

Spatial & Immersive — XR scene graph, light field, WebXR rendering agent, Blender MCP, ComfyUI SAM3D.

Identity & Trust — DID management, key rotation, Nostr agent communications, NIP-26 delegation.

Development & Quality — Rust development, pair programming, agentic QE fleet (111+ sub-agents), GitHub code review, performance analysis.

Infrastructure & DevOps — Docker management, Kubernetes ops, Linux admin, network analysis, monitoring.

Document Processing — LaTeX, DOCX, XLSX, PPTX, PDF generation and manipulation.

Layer 3 — Declarative Governance

The governance layer is what separates VisionClaw from every "move fast and break things" agent framework. Policies are code. Bias thresholds, access controls, and audit trails are embedded into every DAG transition — not bolted on afterwards.

The Judgment Broker — Freed from reporting, forecasting, and coordination (all automated), the Judgment Broker focuses on three irreplaceable human capacities:

Strategic direction — Only humans decide what the organisation should be doing next year, and whether the mesh is pointed at it.
Ethical adjudication — Bias, fairness, and consequence live in human judgment. No agent is the final word on edge cases.
Relational intelligence — Trust, culture, and coalition-building are the lubrication layer no algorithm can replicate.

HITL by Design — The Human-in-the-Loop is an architectural feature. Agents know their authority boundary and surface exceptions cleanly. Every ontology mutation passes through a GitHub pull request, giving human reviewers full visibility and veto over structural changes before commit.

Ontological Provenance — Every agent decision traces back through the OWL 2 knowledge graph. Auditors can traverse the full reasoning chain agent-by-agent, task-by-task. Every action is recorded as an immutable bead — content-addressed, cryptographically verifiable (Nostr NIP-33) — with a deterministic lifecycle state machine (ADR-034), exhaustive outcome classification, retry with exponential backoff, and structured learning capture. Inspired by NEEDLE's trait-based architecture. See the Bead Provenance PRD and DDD Bounded Context.

Cascading Trust Hierarchies — Nostr DID-based agent identities with W3C-compliant key rotation. NIP-26 delegation allows agents to act on behalf of users within scoped permissions. When an agent is revoked, the revocation cascades through dependent agents automatically.

Mesh KPIs — measuring what matters in a governed agentic organisation

KPI	Formula	Target	What It Measures
Mesh Velocity	Δt(insight → codified workflow)	< 48h	How fast a discovered shortcut becomes a sanctioned, reusable DAG
Augmentation Ratio	Cognitive load offloaded ÷ Total cognitive load	> 65%	Percentage of decision-making handled by agents without human escalation
Trust Variance	σ(Agent Decision Quality) over 30-day window	< 0.12σ	Drift or bias monitoring in the automated task layer
HITL Precision	Correct escalations ÷ Total escalations	> 90%	Are the edge cases the mesh flags actually requiring human intervention?

The Insight Ingestion Loop

How shadow workflows become sanctioned organisational intelligence:

flowchart LR
    D["DISCOVERY\nPassive agent monitoring\ndetects the pattern"]
    C["CODIFICATION\nIRIS maps the new path\nas a proposed DAG —\nOWL 2 formalised\nwith provenance"]
    V["VALIDATION\nThe Judgment Broker\nreviews for strategic\nfit & bias"]
    I["INTEGRATION\nPromoted to live mesh\nwith SLAs, ownership,\nquality"]
    A["AMPLIFICATION\nMesh propagates\npattern to other\nteams where it applies"]

    D --> C --> V --> I --> A

    style D fill:#0A2A1A,stroke:#10B981
    style C fill:#0A1A2A,stroke:#00D4FF
    style V fill:#1A0A2A,stroke:#8B5CF6
    style I fill:#0A1A2A,stroke:#00D4FF
    style A fill:#0A2A1A,stroke:#10B981

Architecture

flowchart TB
    subgraph Client["Browser Client (React 19 + Three.js / Babylon.js)"]
        R3F["React Three Fiber\n(desktop graph)"]
        BabylonXR["Babylon.js\n(immersive XR)"]
        BinProto["Binary Protocol V2/V3"]
        Voice["Voice Orchestrator"]
        WasmFX["WASM Scene Effects\n(zero-copy Float32Array)"]
    end

    subgraph Server["Rust Backend (Actix-web · Hexagonal · CQRS)"]
        Handlers["HTTP/WS Handlers\n(9 ports · 12 adapters)"]
        Actors["21 Actix Actors\n(supervised concurrency)"]
        Services["OWL Ontology Pipeline\n(Whelk-rs EL++)"]
        AudioRouter["Audio Router\n(LiveKit SFU)"]
        MCP["MCP Tool Server\n(:9500 TCP)"]
    end

    subgraph Data["Data Layer"]
        Neo4j[("Neo4j 5.13\n(primary graph store)")]
        RuVector[("RuVector PostgreSQL\n(pgvector + HNSW)")]
        Solid["Solid Pod\n(JSS sidecar · user data)"]
    end

    subgraph GPU["GPU Compute (CUDA 13.1)"]
        Kernels["92 CUDA Kernels"]
        Physics["Force Physics\n+ Semantic Forces"]
        Analytics["K-Means · Louvain\nPageRank · LOF Anomaly"]
    end

    subgraph Agents["Multi-Agent Stack"]
        Skills["83 Agent Skills"]
        ClaudeFlow["Claude-Flow\n(RAFT hive-mind)"]
        NostrDID["Nostr DID Identities\n(NIP-26 delegation)"]
    end

    Client <-->|"Binary V2/V3 + REST"| Server
    Voice <-->|"LiveKit SFU + Opus"| AudioRouter
    Server <--> Neo4j
    Server <--> RuVector
    Server <--> Solid
    Server <--> GPU
    MCP <--> Agents
    Agents -->|"GitHub PRs (ontology mutations)"| Services

    style Client fill:#e1f5ff,stroke:#0288d1
    style Server fill:#fff3e0,stroke:#ff9800
    style Data fill:#f3e5f5,stroke:#9c27b0
    style GPU fill:#e8f5e9,stroke:#4caf50
    style Agents fill:#fce4ec,stroke:#e91e63

Hexagonal architecture (9 ports · 12 adapters · 114 CQRS handlers)

VisionClaw follows strict hexagonal architecture. Business logic in src/services/ depends only on port traits in src/ports/. Concrete implementations live in src/adapters/, swapped at startup via dependency injection. All mutations flow through one of 114 command handlers; all reads through query handlers. No direct database access from handlers.

flowchart LR
    subgraph Ports["src/ports/ (Traits)"]
        GP[GraphRepository]
        OR[OntologyRepository]
        IE[InferenceEngine]
        GPA[GpuPhysicsAdapter]
        GSA[GpuSemanticAnalyzer]
        SR[SettingsRepository]
        SP[SolidPodRepository]
        NR[NostrRelay]
        VR[VectorRepository]
    end

    subgraph Adapters["src/adapters/ (Implementations)"]
        Neo4jGraph[Neo4jGraphRepository]
        Neo4jOntology[Neo4jOntologyRepository]
        Whelk[WhelkInferenceEngine]
        CudaPhysics[PhysicsOrchestratorAdapter]
        JSSPod[JssSolidPodAdapter]
        RuVectorAdapter[RuVectorAdapter]
    end

    subgraph Services["src/services/ (Business Logic)"]
        OQS[OntologyQueryService]
        OMS[OntologyMutationService]
        GPS[GitHubPRService]
        OPS[OntologyPipelineService]
    end

    Services --> Ports
    Adapters -.->|implements| Ports

    style Ports fill:#e8f5e9,stroke:#4caf50
    style Adapters fill:#fff3e0,stroke:#ff9800
    style Services fill:#e1f5ff,stroke:#0288d1

Port Trait	Adapter	Purpose
`GraphRepository`	`ActorGraphRepository`	Graph CRUD via actor messages
`OntologyRepository`	`Neo4jOntologyRepository`	OWL class/axiom storage
`InferenceEngine`	`WhelkInferenceEngine`	OWL 2 EL reasoning
`GpuPhysicsAdapter`	`PhysicsOrchestratorAdapter`	CUDA force simulation
`GpuSemanticAnalyzer`	`GpuSemanticAnalyzerAdapter`	GPU semantic forces
`SolidPodRepository`	`JssSolidPodAdapter`	User Pod CRUD via JSS
`VectorRepository`	`RuVectorAdapter`	pgvector HNSW semantic search

21-Actor supervision tree

The backend uses Actix actors for supervised concurrency. GPU actors form a hierarchy: GraphServiceSupervisor → PhysicsOrchestratorActor → ForceComputeActor. All actors restart automatically on failure per their supervision strategy.

GPU Physics Actors:

Actor	Purpose
`ForceComputeActor`	Core force-directed layout (CUDA) — 60Hz
`StressMajorizationActor`	Stress majorisation algorithm
`ClusteringActor`	K-Means + Louvain community detection (GPU)
`PageRankActor`	GPU PageRank centrality computation
`ShortestPathActor`	Delta-stepping SSSP (GPU)
`ConnectedComponentsActor`	Label propagation component detection (GPU)
`AnomalyDetectionActor`	LOF / Z-score anomaly detection (GPU)
`SemanticForcesActor`	OWL-driven attraction/repulsion constraints
`ConstraintActor`	Layout constraint solving
`AnalyticsSupervisor`	GPU analytics orchestration
`BroadcastOptimizerActor`	Delta-filter + periodic full-broadcast (300 iters)

Service Actors:

Actor	Purpose
`GraphStateActor`	Canonical graph state — single source of truth
`OntologyActor`	OWL class management and Whelk bridge
`ClientCoordinatorActor`	Per-client session management + WebSocket
`PhysicsOrchestratorActor`	Delegates to GPU actors, manages convergence
`SemanticProcessorActor`	NLP query processing
`VoiceCommandsActor`	Voice-to-action routing
`TaskOrchestratorActor`	Background task scheduling
`GitHubSyncActor`	Incremental GitHub sync (SHA1 delta)
`OntologyPipelineActor`	Assembler → converter → Whelk pipeline
`GraphServiceSupervisor`	Top-level GPU supervision and restart

DDD bounded contexts (10 contexts)

VisionClaw implements Domain-Driven Design with 10 bounded contexts across three domain rings:

Core Domain: Knowledge Graph · Ontology Governance · Physics Simulation

Supporting Domain: Authentication (Nostr NIP-98) · Identity (DID/Solid) · Agent Orchestration · Semantic Analysis

Generic Domain: User Management · Bead Provenance (ADR-034) · Configuration

Each context has its own aggregate roots, domain events, and anti-corruption layers. Cross-context communication uses domain events, never direct model sharing. See DDD Bounded Contexts.

Real-World Validation

Deployment	Context	Scale
DreamLab Creative Hub	50-person creative technology team — live production deployment	~998 knowledge graph nodes, daily ontology mutations
University of Salford	Research partnership validating semantic force-directed layout for academic knowledge graphs	Multi-institution ontology
THG World Record	Large-scale multi-user immersive data visualisation event — see THG project	250+ concurrent XR users

Performance

Metric	Result	Conditions
GPU physics speedup	55×	vs single-threaded CPU
HNSW semantic search	61µs p50	RuVector pgvector, 1.17M entries
WebSocket latency	10ms	Local network, V2 binary
Bandwidth reduction	80%	Binary V2 vs JSON
Concurrent XR users	250+	Vircadia World Server
Position update size	36 bytes (V2) / 48 bytes (V3)	Per node
CUDA kernels	92	6,585 LOC across 11 files
Agent concurrency	50+	Via actor supervisor tree
Physics convergence	~600 frames (~10s)	Typical graph at rest

Technology Stack

Full technology breakdown

Layer	Technology	Detail
Backend	Rust 2021 · Actix-web	427 files, 175K LOC · hexagonal CQRS · 9 ports · 12 adapters · 114 handlers
Frontend (desktop)	React 19 · Three.js 0.182 · R3F	370 files, 96K LOC · TypeScript 5.9 · InstancedMesh · SAB zero-copy
Frontend (XR)	Babylon.js	Immersive/VR mode — Quest 3 foveated rendering, hand tracking
WASM	Rust → wasm-pack	`scene-effects` crate: zero-copy `Float32Array` view over `WebAssembly.Memory`
Graph DB	Neo4j 5.13	Primary store · Cypher queries · bolt protocol
Vector Memory	RuVector PostgreSQL · pgvector	1.17M+ entries · HNSW 384-dim · MiniLM-L6-v2 embeddings · 61µs search
GPU	CUDA 13.1 · cudarc	92 kernel functions · 6,585 LOC · PTX ISA auto-downgrade in build.rs
Ontology	OWL 2 EL · Whelk-rs	EL++ subsumption · consistency checking · 20 source files
XR	WebXR · Babylon.js	Meta Quest 3 · hand tracking · foveated rendering · `?force=quest3`
Multi-User	Vircadia World Server	Avatar sync · spatial HRTF audio · entity CRUD · collaborative editing
Voice	LiveKit SFU · turbo-whisper · Kokoro	CUDA STT · TTS · Opus 48kHz · 4-plane routing
Identity	Nostr NIP-07/NIP-98 · DID	Browser extension signing · NIP-26 delegation · W3C key rotation
User Data	Solid Pods · JSS sidecar	Per-user data sovereignty · WAC access control · JSON-LD
Agents	Claude-Flow · MCP · RAFT	83 skills · 7 ontology tools · hive-mind consensus
AI/ML	GraphRAG · RAGFlow	Knowledge retrieval · semantic inference
Build	Vite 6 · Vitest · Playwright	Frontend build · unit tests · E2E tests
Infra	Docker Compose	15+ services · multi-profile (dev/prod/voice/xr)
CI	GitHub Actions	Build · test · docs quality · ontology federation

Documentation

VisionClaw uses the Diataxis framework — 106 markdown files across four categories, 46 with embedded Mermaid diagrams:

Category	Path	Content
Tutorials	`docs/tutorials/`	First graph, Neo4j basics, platform overview
How-To Guides	`docs/how-to/`	Deployment, agents, XR setup, performance profiling, features, operations
Explanation	`docs/explanation/`	Architecture, DDD, ontology, GPU physics, XR, security, Solid/Nostr, deployment topology
Reference	`docs/reference/`	REST API, WebSocket protocol, Neo4j schema, agents catalog, error codes

Key entry points:

Full Documentation Hub — all 106 docs indexed
Known Issues — active P1/P2 bugs (read before debugging)
Architecture Overview
Deployment Guide
Deployment Topology — 15-service map, network architecture, dependency chain
Quest 3 VR Setup
Agent Orchestration
REST API Reference
WebSocket Binary Protocol
Performance Profiling

Development

Prerequisites

Tool	Version	Purpose
Rust	2021 edition	Backend
Node.js	20+	Frontend
Docker + Docker Compose	—	Services
CUDA Toolkit	13.1	GPU acceleration (optional)

Build and Test

# Backend
cargo build --release
cargo test

# Frontend
cd client && npm install && npm run build && npm test

# Integration tests
cargo test --test ontology_agent_integration_test

System Requirements

Tier	CPU	RAM	GPU	Use Case
Minimum	4-core 2.5GHz	8 GB	Integrated	Development · < 10K nodes
Recommended	8-core 3.0GHz	16 GB	GTX 1060 / RX 580	Production · < 50K nodes
Enterprise	16+ cores	32 GB+	RTX 4080+ (16GB VRAM)	Large graphs · multi-user XR

Platform support: Linux (full GPU) · macOS (CPU-only) · Windows (WSL2) · Meta Quest 3 (Beta)

Key environment variables

Copy .env.example and configure:

Variable	Description
`NEO4J_URI`	Neo4j bolt connection (default: `bolt://localhost:7687`)
`NEO4J_USER` / `NEO4J_PASSWORD`	Neo4j credentials
`RUVECTOR_PG_CONNINFO`	RuVector PostgreSQL connection string
`NOSTR_PRIVATE_KEY`	Server-side Nostr signing key (hex)
`GITHUB_TOKEN`	GitHub token for ontology PR creation
`GITHUB_OWNER` / `GITHUB_REPO` / `GITHUB_BASE_PATH`	Logseq source repository
`VITE_VIRCADIA_ENABLED`	Enable Vircadia multi-user (`true`/`false`)
`VITE_VIRCADIA_SERVER_URL`	Vircadia World Server WebSocket URL
`LIVEKIT_URL` / `LIVEKIT_API_KEY` / `LIVEKIT_API_SECRET`	Voice routing
`VITE_QUEST3_ENABLE_HAND_TRACKING`	Enable Quest 3 hand tracking

Full reference: Environment Variables

Project Structure

VisionClaw/
├── src/                          # Rust backend (427 files, 175K LOC)
│   ├── actors/                   #   21 Actix actors (GPU compute + services)
│   ├── adapters/                 #   Neo4j, Whelk, CUDA, JSS, RuVector adapters
│   ├── handlers/                 #   HTTP/WebSocket request handlers (CQRS)
│   ├── services/                 #   Business logic (ontology, voice, agents)
│   ├── ports/                    #   Trait definitions (9 hexagonal boundaries)
│   ├── gpu/                      #   CUDA kernel bridge, memory, streaming
│   ├── ontology/                 #   OWL parser, Whelk bridge, physics integration
│   └── config/                   #   Configuration management
├── client/                       # React frontend (370 files, 96K LOC)
│   ├── src/
│   │   ├── features/             #   13 feature modules (graph, settings, etc.)
│   │   ├── services/             #   Voice, WebSocket, Nostr auth, Solid integration
│   │   ├── rendering/            #   Custom TSL/WebGL materials, post-processing
│   │   └── immersive/            #   Babylon.js XR mode
│   └── crates/scene-effects/     #   Rust WASM crate — zero-copy scene FX
├── multi-agent-docker/           # AI agent orchestration container
│   ├── skills/                   #   83 agent skill modules
│   ├── mcp-infrastructure/       #   MCP servers, config, tool registration
│   └── management-api/           #   Agent lifecycle management
├── docs/                         # Diataxis documentation (106 files, 46 with Mermaid)
│   ├── tutorials/                #   Getting started
│   ├── how-to/                   #   Operational guides
│   ├── explanation/              #   Architecture deep-dives
│   ├── reference/                #   API, protocol, schema specs
│   ├── adr/                      #   Architecture Decision Records
│   └── KNOWN_ISSUES.md           #   Active P1/P2 bugs
├── tests/                        # Integration tests
├── config/                       # LiveKit, deployment config
└── scripts/                      # Build, migration, embedding ingestion scripts

Contributing

See the Contributing Guide for development workflow, branching conventions, and coding standards.

Before contributing: Check Known Issues — the Ontology Edge Gap (ONT-001) and V4 delta instability (WS-001) are active P1/P2 bugs that may affect your work area.

License

Mozilla Public License 2.0 — Use commercially, modify freely, share changes to MPL files.

VisionClaw is built by DreamLab AI Consulting.

Documentation · Known Issues · Discussions · Issues

Name		Name	Last commit message	Last commit date
Latest commit History 2,165 Commits
.agentic-qe		.agentic-qe
.github/workflows		.github/workflows
.promptcode/presets		.promptcode/presets
.qlty		.qlty
.roo		.roo
.vscode		.vscode
bin		bin
client		client
config		config
data		data
docs		docs
examples		examples
multi-agent-docker		multi-agent-docker
presentation		presentation
public/contexts		public/contexts
schema		schema
scripts		scripts
sdk		sdk
src		src
test-data-idp-accounts		test-data-idp-accounts
test		test
tests		tests
vircadia-world/server/service/schemas		vircadia-world/server/service/schemas
whelk-rs		whelk-rs
.dockerignore		.dockerignore
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierrc		.prettierrc
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CTH.md		CTH.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
ChloeOctave.jpg		ChloeOctave.jpg
Dockerfile.dev		Dockerfile.dev
Dockerfile.production		Dockerfile.production
Dockerfile.unified		Dockerfile.unified
LICENSE		LICENSE
README.md		README.md
agentdb.rvf		agentdb.rvf
agentdb.rvf.lock		agentdb.rvf.lock
aqe		aqe
build.rs		build.rs
config.yml		config.yml
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.production.yml		docker-compose.production.yml
docker-compose.unified-with-neo4j.yml		docker-compose.unified-with-neo4j.yml
docker-compose.unified.yml		docker-compose.unified.yml
docker-compose.vircadia.yml		docker-compose.vircadia.yml
docker-compose.voice.yml		docker-compose.voice.yml
docker-compose.yml		docker-compose.yml
graph-physics-live.png		graph-physics-live.png
logseq1.png		logseq1.png
logseq2.png		logseq2.png
logseq3.png		logseq3.png
logseq4.png		logseq4.png
nginx.conf		nginx.conf
nginx.dev.conf		nginx.dev.conf
nginx.production.conf		nginx.production.conf
ontology_physics.toml		ontology_physics.toml
package-lock.json		package-lock.json
package.json		package.json
supervisord.dev.conf		supervisord.dev.conf
supervisord.production.conf		supervisord.production.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisionClaw

Immersivee, collaborative, agentic knowledge management, & tool factory.

Your Best People Are Already Running the Future

What Is VisionClaw?

Chloe Nevitt interacting with Prof Rob Aspin's precursor to VisionClaw in the Octave Multimodal Lab University of Salford 2017

Core Capabilities

Quick Start

The Three Layers of the Dynamic Mesh

Layer 1 — The Discovery Engine

Layer 2 — The Orchestration Layer

Layer 3 — Declarative Governance

The Insight Ingestion Loop

Architecture

Real-World Validation

Performance

Technology Stack

Documentation

Development

Prerequisites

Build and Test

System Requirements

Project Structure

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VisionClaw

Immersivee, collaborative, agentic knowledge management, & tool factory.

Your Best People Are Already Running the Future

What Is VisionClaw?

Chloe Nevitt interacting with Prof Rob Aspin's precursor to VisionClaw in the Octave Multimodal Lab University of Salford 2017

Core Capabilities

Quick Start

The Three Layers of the Dynamic Mesh

Layer 1 — The Discovery Engine

Layer 2 — The Orchestration Layer

Layer 3 — Declarative Governance

The Insight Ingestion Loop

Architecture

Real-World Validation

Performance

Technology Stack

Documentation

Development

Prerequisites

Build and Test

System Requirements

Project Structure

Contributing

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages