Under the Hood
Docker. Gemini 2.5. Ollama. Rust. A sovereign AI stack that runs on your hardware with multi-model intelligence you control.
Intelligence Layer
Multi-Model Intelligence
Whity routes each task to the best model automatically β cloud for power, local for privacy. Bring your own models and API keys. You're never locked in.
Add any API key in Settings β Gemini, Anthropic, Mistral, Groq, OpenRouter, or any provider. As new models launch, plug them in without waiting for a Whity update.
Pull any model from the Ollama library β Mistral, Phi, Deepseek, CodeLlama, Gemma, and 100+ more. Your Mac, your models, your choice.
Architecture
How Whity thinks
A recursive self-improvement loop with intelligent model routing.
Tech Stack
Batteries included. No bloat.
Everything under the hood β from Docker to Rust.
Python 3.12 Core
Async-first architecture. FastAPI + Uvicorn API layer. Fully type-hinted codebase with structured logging.
python 3.12 Β· fastapi Β· uvicornDocker Engine
Multi-architecture images (ARM64 + AMD64). One-click DMG installer. Container isolation with security constraints. Works on Intel and Apple Silicon.
multi-arch Β· one-click installOllama Local Models
Qwen 2.5 7B and Llama 3.2 3B pre-installed. Pull any model from the Ollama library. Cold-start procurement on first boot.
ollama Β· qwen Β· llamaSQLite + Vector Store
Embedded database for zero-config persistence. Nomic embeddings for semantic memory and intelligent retrieval across conversations.
sqlite-vec Β· nomic-embedRust Security Gate
Native compiled binary (whity_core_rs). Source code compiled to unreadable machine code. IP protection built into the engine.
rust Β· maturin Β· compiled .soUltrawork Engine
Parallel sub-agent dispatch for complex tasks. Concurrent execution via asyncio. Breaks big jobs into parallel streams.
async Β· parallel dispatchHands System
Modular tool plugins β Scout (web search), Reporter (document generation), Auditor (code review). TOML-configured skill definitions.
scout Β· reporter Β· auditorLiveKit Voice
Real-time voice interaction via WebRTC. Push-to-talk or always-on mode. Low-latency audio processing with LiveKit infrastructure.
webrtc Β· livekit Β· real-timeRemote Control
Control Whity from anywhere via Telegram. Send commands, receive updates, and manage your assistant remotely with bot integration.
@whity/telegramGigaBrain Deep Reasoning
Multi-pass analysis mode for complex problems. Extended context windows up to 2M tokens. Automatic escalation for hard tasks.
gemini-pro Β· deep analysisCost Ledger
Per-model token accounting with USD tracking. Budget-aware routing. Full transparency on what each query costs.
token tracking Β· cost-awareDesktop Agent
Let Whity drive the keyboard and mouse to operate real apps, forms, and workflows on your desktop when you enable it.
desktop-agent Β· pyautoguiSystem Requirements
What you need
Whity runs on any Mac β Intel or Apple Silicon.
Supports both ARM64 (Apple Silicon) and AMD64 (Intel) architectures. Windows and Linux versions in development.
Ready to build?
Get early access.
Get in early at special introductory offer.