Deploy Aspose.LLM for .NET on NVIDIA GPUs — select CUDA, offload all layers, configure multi-GPU split, and size VRAM for model + KV cache....var to hide GPUs from the process: CUDA_VISIBLE_DEVICES = 1 dotnet...180-250+ Numbers vary with batch size, context depth, driver...