Sign in
← Back to search

defilantech/LLMKube

Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server — multi-GPU NVIDIA + Apple Silicon Metal, autoscaling, air-gapped, production-ready

Stars
112
Forks
17
Commits
375
Language
Go
Awesome lists
1

Similar repositories

llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

3250 stars
Shell 1 awesome list

kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

524 stars
Go 2 awesome lists

llama-farm/llamafarm

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

829 stars
Python 1 awesome list

mudler/LocalAI

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

46491 stars
Go 5 awesome lists

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-28 03:05

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2025-11-12
  • First commit: 2025-11-17
  • Last pushed: 2026-05-28
  • Website: https://llmkube.com
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

AI agent config detected

1 config path 1 file 0 directories
Agent instructions

Key config paths

  • file AGENTS.md

Appears in