Primer on GGUF quantization levels — Q4, Q5, Q8, F16, BF16, IQ variants. What to pick for your memory budget and quality target....or BF16) to fewer bits per value. Smaller weights mean smaller...(very aggressive) ~0.15× Large Values are approximate; actual size...