Reference for Aspose.LLM for .NET context parameters — context size, batch, threading, rope/YaRN scaling, flash attention, KV cache types....attention is a fused-kernel optimization that reduces memory and...performance timings. A micro-optimization for high-throughput production...