Run Aspose.LLM for .NET on NVIDIA GPUs with CUDA acceleration — driver requirements, compute capability, single and multi-GPU setup....engine places the offloaded layers on GPU 0. preset . BinaryManagerParamet...LlamaSplitMode . LLAMA_SPLIT_MODE_LAYER ; preset . BaseModelInferencePa...