Primer on GGUF quantization levels — Q4, Q5, Q8, F16, BF16, IQ variants. What to pick for your memory budget and quality target....Quantization reduces the precision of model weights from the full-precision...approximate; actual size depends on model architecture and specific quantizer...