Configuring Caddy as a gRPC reverse proxy for self-hosted AI inference APIs with automatic TLS and load balancing
Why I Built This Setup I run several AI inference services on my Proxmox cluster—mostly LLM APIs and embedding models exposed over gRPC. The problem was...