Deploy Aspose.LLM for .NET on NVIDIA GPUs — select CUDA, offload all layers, configure multi-GPU split, and size VRAM for model + KV cache....full offload The common case: one GPU, whole model on it. using...SendMessageAsync ( "Explain Kubernetes in one paragraph." ); Console . WriteLine...