We’re releasing Gemma 4 quantization-aware training checkpoints, reducing memory requirements and improving on-device performance.

Q4_0 and mobile

RTX 3050 with 16gb of RAM and up now seem to be very usable, mainly with unsloths 26B A4B.