<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Memory Footprint on AI VOID</title><link>https://ai-blog.noorshomelab.dev/tags/memory-footprint/</link><description>Recent content in Memory Footprint on AI VOID</description><generator>Hugo</generator><language>en</language><lastBuildDate>Wed, 21 Jan 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://ai-blog.noorshomelab.dev/tags/memory-footprint/index.xml" rel="self" type="application/rss+xml"/><item><title>How AI Model Quantization Works: Deep Dive into Internals</title><link>https://ai-blog.noorshomelab.dev/how-it-works/ai-model-quantization/</link><pubDate>Wed, 21 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/how-it-works/ai-model-quantization/</guid><description>&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;In the rapidly evolving world of artificial intelligence, the deployment of powerful neural networks into real-world applications often hits a bottleneck: their immense computational and memory requirements. AI model quantization is a critical optimization technique designed to address this challenge. It allows large, complex models—trained using high-precision floating-point numbers—to be compressed and executed efficiently on resource-constrained devices, from smartphones and IoT sensors to specialized AI accelerators.&lt;/p&gt;
&lt;p&gt;Understanding the internals of quantization is no longer a niche skill but a fundamental requirement for AI engineers and researchers aiming to build performant and deployable AI systems. It bridges the gap between theoretical model development and practical application, enabling faster inference times, reduced memory footprints, and lower power consumption.&lt;/p&gt;</description></item></channel></rss>