Sovereignty Labs

Own your intelligence.

Open-source tools for self-hosted AI infrastructure

shipping

nollama

Sovereign inference

The model runner Ollama should have been. Plain GGUFs. Zero overhead. Multi-GPU. Federation across your fleet.

View on GitHub
in development

seidr

Sovereign memory

Hybrid vector search with GPU-accelerated embeddings, four-signal ranking, and PostgreSQL at the core.

Coming soon
in development

hirdforge

Sovereign agents

Multi-agent orchestration under human sovereignty. Kubernetes-native. Your warbands, your workflows, your authority.

Coming soon
~
# Install the runtime
$ nollama runtime install

# Pull a model from HuggingFace
$ nollama pull unsloth/Qwen3-8B-GGUF:Q4_K_M

# Run inference
$ nollama serve && nollama load Qwen3-8B-Q4_K_M.gguf
✓ Model loaded on GPU 0 (RTX 3090, 24GB) — port 11435
✓ OpenAI-compatible API ready at http://localhost:11434

# No Docker. No account. No blob store. Just inference.

Secure user directed artificially intelligent action.

Your models on your hardware. Your memory in your database. Your agents under your authority.

No cloud dependency. No vendor lock-in.
No permission needed.

— Sovereignty Labs