<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Model Optimization on AI VOID</title><link>https://ai-blog.noorshomelab.dev/categories/model-optimization/</link><description>Recent content in Model Optimization on AI VOID</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sun, 07 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://ai-blog.noorshomelab.dev/categories/model-optimization/index.xml" rel="self" type="application/rss+xml"/><item><title>Quantization-Aware Training (QAT): Preserving Accuracy at the Edge</title><link>https://ai-blog.noorshomelab.dev/gemma-4-qat-guide-2026/quantization-aware-training-deep-dive/</link><pubDate>Sun, 07 Jun 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/gemma-4-qat-guide-2026/quantization-aware-training-deep-dive/</guid><description>&lt;p&gt;Deploying powerful AI models like Google&amp;rsquo;s Gemma 4 on everyday devices such as mobile phones and laptops presents a significant challenge. These environments often lack the vast computational resources of data centers. How can we make large language models (LLMs) both powerful and practical for edge deployment without sacrificing their intelligence?&lt;/p&gt;
&lt;p&gt;This chapter introduces you to Quantization-Aware Training (QAT), a critical technique that optimizes AI models for efficiency while preserving their accuracy. We&amp;rsquo;ll explore QAT&amp;rsquo;s core principles, understand why it&amp;rsquo;s superior for complex models like Gemma 4, and guide you through practical steps to leverage Gemma 4 QAT checkpoints for your own high-performance, edge-ready applications.&lt;/p&gt;</description></item><item><title>Gemma 4 QAT: Efficient AI for Edge Devices</title><link>https://ai-blog.noorshomelab.dev/gemma-4-qat-guide-2026/</link><pubDate>Sun, 07 Jun 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/gemma-4-qat-guide-2026/</guid><description>&lt;p&gt;This comprehensive guide empowers developers to leverage Gemma 4 QAT models for optimized on-device AI. Dive into Quantization-Aware Training (QAT) from its foundational principles, understanding how it dramatically compresses models for mobile and laptop efficiency. Explore practical steps, benchmarks, and real-world use cases to seamlessly integrate these new, high-performance checkpoints into your applications.&lt;/p&gt;</description></item></channel></rss>