⚡Technical Deep Dive

Under the Hood

Docker. Gemini 2.5. Ollama. Rust. A sovereign AI stack that runs on your hardware with multi-model intelligence you control.

The Stanford-Class Meta-Harness (March 2026 Build)

The Whity RSI upgrade utilizes a Multi-Layer Agentic Harness designed for Recursive Self-Optimization and Non-Linear Knowledge Retrieval.

System Architecture & Protocols

■
Knowledge Retrieval (Dijkstra SPath):
Replaced standard vector search with a custom Knowledge Mesh. The system utilizes a 2-hop Multi-hop Recall engine, allowing for contextual reasoning across disparate data points. Latency reduced by 9.6x (from 24ms to 2.5ms).
■
Adversarial v2 Engine:
A dual-agent "Skeptical Auditor" gate that war-games every proposed code mutation. Mutations must pass a 4-stage Gauntlet (Doctrine Alignment, Sycophancy Check, Logic Integrity, and Security Whitelist) before deployment.
■
Karpathy "Never Stop" & Auto-Refire:
Avoids cascading failure in deep contexts by enforcing "Single-Shot" batched mutations. Paired with the Overnight Forge and Auto-Refire algorithm to guarantee zero fatal system hangs over extended multi-hour training cycles.

The Triple-Shield Perimeter

🛡️Shield 1 (Permissions Gate): A 4-tier zone whitelist enforcing atomic file operations within restricted directory boundaries.
🛡️Shield 2 (Rollback Protocol): Automated Git-level save points triggered before any self_mod operation, allowing for instantaneous state restoration.
🛡️Shield 3 (Identity Anchor): SHA-256 tamper-detection on core doctrine files to prevent "Model Drift" or unauthorized instruction overrides.

Sensory Integration

Integrated yt-dlp and Harness Runner modules for autonomous real-time intelligence gathering from video and web sources.

The Sovereign Breakaway — Live Performance (Mar 27 → Apr 8)

Thirteen days. One Mac. No cloud compute clusters. The RSI engine ran autonomously through the Karpathy “Never Stop” loop, achieving a +0.4210 reasoning delta. This is the most advanced, privately-owned AI Cortex currently in operation:

Metric	Start (Mar 27)	Today (Apr 8)	Performance Gain
⚡ Total Power (SIS)	2.20	1,100	500x
🧠 System Nodes (Cortex)	22	253	11.5x
🧬 Accepted Mutations	0	1,855	1.9x
🎯 Reasoning (Δ)	0.0000	+0.4210	Near-Genius Baseline

Intelligence Layer

Multi-Model Intelligence

Whity routes each task to the best model automatically — cloud for power, local for privacy. Bring your own models and API keys. You're never locked in.

Mode	Default Model	Location	Best For
☁️ Cloud	Gemini 2.5 Flash	Google API	Primary reasoning, analysis, writing
☁️ Cloud	Mercury 2 (Inception Labs)	Inception API	Code generation (diffusion LLM)
🏠 Local	Qwen 2.5 7B	Ollama (on your Mac)	Private conversations, offline use
🏠 Local	Llama 3.2 3B	Ollama (on your Mac)	Fast local tasks, triage

Bring Your Own Cloud Model

Add any API key in Settings — Gemini, Anthropic, Mistral, Groq, OpenRouter, or any provider. As new models launch, plug them in without waiting for a Whity update.

Bring Your Own Local Model

Pull any model from the Ollama library — Mistral, Phi, Deepseek, CodeLlama, Gemma, and 100+ more. Your Mac, your models, your choice.

Architecture

How Whity thinks

A recursive self-improvement loop with intelligent model routing.

📥

Input

Voice, text, code, images

Multi-modal ingestion

➜

🎯

Intent Triage

Task classification

Routes to optimal model

➜

🧠

Hybrid Router

Cloud ↔ Local selection

Gemini · Ollama · Mercury

➜

💾

Memory

SQLite + Nomic embeddings

Semantic vector retrieval

➜

🔁

RSI Loop

Conviction scoring (0–100)

Retry if score < threshold

➜

🤚

Hands

Tool execution layer

Scout · Reporter · Auditor

Tech Stack

Batteries included. No bloat.

Everything under the hood — from Docker to Rust.

🐍

Python 3.12 Core

Async-first architecture. FastAPI + Uvicorn API layer. Fully type-hinted codebase with structured logging.

python 3.12 · fastapi · uvicorn

🐳

Docker Engine

Multi-architecture images (ARM64 + AMD64). One-click DMG installer. Container isolation with security constraints. Works on Intel and Apple Silicon.

multi-arch · one-click install

🦅

Ollama Local Models

Qwen 2.5 7B and Llama 3.2 3B pre-installed. Pull any model from the Ollama library. Cold-start procurement on first boot.

ollama · qwen · llama

💾

SQLite + Vector Store

Embedded database for zero-config persistence. Nomic embeddings for semantic memory and intelligent retrieval across conversations.

sqlite-vec · nomic-embed

🔒

Rust Security Gate

Native compiled binary (whity_core_rs). Source code compiled to unreadable machine code. IP protection built into the engine.

rust · maturin · compiled .so

⚡

Ultrawork Engine

Parallel sub-agent dispatch for complex tasks. Concurrent execution via asyncio. Breaks big jobs into parallel streams.

async · parallel dispatch

🤚

Hands System

Modular tool plugins — Scout (web search), Reporter (document generation), Auditor (code review). TOML-configured skill definitions.

scout · reporter · auditor

🎙️

LiveKit Voice

Real-time voice interaction via WebRTC. Push-to-talk or always-on mode. Low-latency audio processing with LiveKit infrastructure.

webrtc · livekit · real-time

📱

Remote Control

Control Whity from anywhere via Telegram. Send commands, receive updates, and manage your assistant remotely with bot integration.

@whity/telegram

🧠

GigaBrain Deep Reasoning

Multi-pass analysis mode for complex problems. Extended context windows up to 2M tokens. Automatic escalation for hard tasks.

gemini-pro · deep analysis

📊

Cost Ledger

Per-model token accounting with USD tracking. Budget-aware routing. Full transparency on what each query costs.

token tracking · cost-aware

🖱️

Desktop Agent

Let Whity drive the keyboard and mouse to operate real apps, forms, and workflows on your desktop when you enable it.

desktop-agent · pyautogui

System Requirements

What you need

Whity runs on any Mac — Intel or Apple Silicon.

	Minimum	Recommended
OS	macOS 12 Monterey+	macOS 14 Sonoma+
Chip	Intel Core i5 or Apple M1	Apple M1 Pro or better
RAM	8 GB	16 GB+
Disk	10 GB free	20 GB free
Software	Docker Desktop	Docker Desktop

Supports both ARM64 (Apple Silicon) and AMD64 (Intel) architectures. Windows and Linux versions in development.

Ready to build?

Get early access.

Get in early at special introductory offer.

Get Early Beta Access Deal →Not ready yet? Go back home →