Own your intelligence.

Open-source tools for self-hosted AI infrastructure.

shipping

nollama

Sovereign inference

The model runner Ollama should have been. Plain GGUFs. Zero overhead. Multi-GPU. Federation across your fleet.

View on GitHub →
launching

seidr

Sovereign memory

Cognitive memory system with A2M protocol support. Hybrid vector search, GPU-accelerated embeddings, and PostgreSQL at the core.

View on GitHub →
in development

hirdforge

Sovereign agents

Multi-agent orchestration under human sovereignty. Kubernetes-native warbands, GitOps pipelines, and infrastructure-enforced authority.

Coming soon →

The stack

Three layers. One philosophy. Everything runs on your hardware.

Click a layer to explore.

Your hardware. Your network. Your authority.

Three commands to inference

No Docker. No account. No blob store.

nollama
# Install the runtime$ nollama runtime install# Pull a model from HuggingFace$ nollama pull unsloth/Qwen3-8B-GGUF:Q4_K_M# Run inference$ nollama serve && nollama load Qwen3-8B-Q4_K_M.gguf✓ Model loaded on GPU 0 (RTX 3090, 24GB) — port 11435✓ OpenAI-compatible API ready at http://localhost:11434

Secure user directed artificially intelligent action.

Your models on your hardware. Your memory in your database. Your agents under your authority. No cloud dependency. No vendor lock-in. No permission needed.