github Active AI dev

Repository profile

defilantech/LLMKube

Kubernetes operator for self-hosted LLM inference across a heterogeneous GPU fleet: NVIDIA CUDA, AMD Vulkan, and Apple Silicon Metal. Runtimes: llama.cpp, vLLM, TGI, mlx-server. Multi-GPU sharding, model caching, OpenAI-compatible endpoints. Apache-2.0, run across homelab and on-prem fleets, actively developed.

Go Apache-2.0 main Stack scanned README.md

Open website Open GitHub

Stars: 162
Forks: 24
Watchers: 1
Issues: 60
Commits: 630
Awesome lists: 1

Repository updates

Get generated defilantech/LLMKube development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-07 03:12

Star growth, last 7 days: No 7-day history
Commit velocity, last 7 days: No 7-day history
Stars since baseline: +56
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-23

Stars from baseline +56

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-07 03:12

Stack signals: 2
Package managers: 1
Manifest files: 3
Dependencies: 217

Frameworks and tools

Cobra cli framework · high confidence
gRPC Go rpc framework · high confidence

Go modules go

Dependency files

3 manifests

go.mod go ecosystem, 93 dependencies
go.sum go ecosystem, 124 dependencies
pkg/foreman/agent/testdata/mutation/weak/go.mod go ecosystem, 0 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 20
Tags: 0
Stacks: 2

Topics

#ai #apple-silicon #autoscaling #edge-computing #gguf #gpu #homelab #inference #kubernetes #kubernetes-operator #llama-cpp #llm #local-llm #metal #mlx #multi-gpu #nvidia #self-hosted #tgi #vllm

Generated tags

No generated tags yet.

Stack labels

Cobra gRPC Go

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

1 path

AI agent config detected

1 config path 1 file 0 directories

Agent instructions

Key config paths

file AGENTS.md

Similar repositories

Nearest indexed repositories by embedding similarity.

llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

3,800 stars

Shell 1 awesome list

ggml-org/llama.cpp

LLM inference in C/C++

119,175 stars

C++ 2 awesome lists

kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

531 stars

Go 2 awesome lists

EricLBuehler/candle-vllm

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

691 stars

Rust 1 awesome list

mostlygeek/llama-swap

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

5,001 stars

Go 2 awesome lists

paperclipinc/openclaw-operator

Kubernetes operator for deploying and managing OpenClaw AI agent instances with production-grade security, observability, and lifecycle management.

388 stars

Go 1 awesome list

Metadata

Language: Go
License: Apache-2.0
Default branch: main
Created: 2025-11-12
First commit: 2025-11-17
Last pushed: 2026-07-06
GitHub updated: 2026-07-06
Last synced: 2026-07-07 03:12
Stack detected: 2026-07-07 03:12
Archived: no

Links and files

GitHub Website

https://llmkube.com

README

Appears in

Awesome Selfhosted

defilantech/LLMKube

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

llm-d/llm-d

ggml-org/llama.cpp

kaito-project/aikit

EricLBuehler/candle-vllm

mostlygeek/llama-swap

paperclipinc/openclaw-operator

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

llm-d/llm-d

ggml-org/llama.cpp

kaito-project/aikit

EricLBuehler/candle-vllm

mostlygeek/llama-swap

paperclipinc/openclaw-operator

Metadata

Links and files

Appears in