Fit Aspose.LLM for .NET into a tight memory budget — small model, short context, KV cache quantization, aggressive offload, and memory mapping.... Apply a license . Pick a small model Start...Quantize the KV cache using Aspose.LLM.Abstractions.Models ; preset...