Five compact recipes for common Aspose.LLM for .NET tasks — first message, image input, session save/restore, CPU-only run, and CUDA GPU run....your first message Ask the model a single question and print...than GPU inference; a 7B Q4 model typically produces 5-15 tokens/second...