- Published on
Running a vLLM GPU Workload on k3s
A record of setting up the conditions needed to run the Code Place AI assistant on an operations cluster, including the NVIDIA driver, NVIDIA runtime, k3s containerd configuration, RuntimeClass, and device plugin.
FeaturedKubernetesk3svLLMCUDAInfra