<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>PyTorch on AI VOID</title><link>https://ai-blog.noorshomelab.dev/tags/pytorch/</link><description>Recent content in PyTorch on AI VOID</description><generator>Hugo</generator><language>en</language><lastBuildDate>Fri, 20 Mar 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://ai-blog.noorshomelab.dev/tags/pytorch/index.xml" rel="self" type="application/rss+xml"/><item><title>Representing Reality: From Raw Data to Embeddings</title><link>https://ai-blog.noorshomelab.dev/multimodal-ai-guide-2026/representing-reality-raw-data-to-embeddings/</link><pubDate>Fri, 20 Mar 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/multimodal-ai-guide-2026/representing-reality-raw-data-to-embeddings/</guid><description>&lt;p&gt;Welcome back, future multimodal AI maestros! In our previous chapter, we explored the exciting world of multimodal AI and its incredible potential. Now, it&amp;rsquo;s time to dive deeper and understand the fundamental step that makes all this magic possible: transforming the messy, diverse &amp;ldquo;real world&amp;rdquo; data into a language our AI models can understand.&lt;/p&gt;
&lt;p&gt;This chapter is all about &lt;strong&gt;representing reality&lt;/strong&gt;. We&amp;rsquo;ll learn how raw inputs like text, images, audio, and video, which seem so different to us, are converted into a common, numerical format called &lt;strong&gt;embeddings&lt;/strong&gt;. Think of it as teaching your AI system to &amp;ldquo;see,&amp;rdquo; &amp;ldquo;hear,&amp;rdquo; and &amp;ldquo;read&amp;rdquo; by giving it a universal dictionary of meaning. Mastering this concept is crucial, as it forms the bedrock for any multimodal system you&amp;rsquo;ll ever build.&lt;/p&gt;</description></item><item><title>Chapter 5: Model Training, Evaluation &amp;amp; Hyperparameter Tuning</title><link>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/model-training-evaluation/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/model-training-evaluation/</guid><description>&lt;h2 id="introduction-sharpening-your-models-skills"&gt;Introduction: Sharpening Your Model&amp;rsquo;s Skills&lt;/h2&gt;
&lt;p&gt;Welcome back, future AI/ML expert! In previous chapters, we laid the groundwork by understanding the mathematical and programming foundations, exploring data, and even building our first simple models. But a model, no matter how well-designed, is just potential until it&amp;rsquo;s properly trained and evaluated.&lt;/p&gt;
&lt;p&gt;This chapter is where your models truly come to life. We&amp;rsquo;ll embark on a journey through the heart of machine learning: the training process. You&amp;rsquo;ll learn how to teach your models to identify patterns, how to objectively measure their performance, and most importantly, how to fine-tune them to achieve peak effectiveness. Think of it as guiding your model through a rigorous education, complete with exams and personalized study plans!&lt;/p&gt;</description></item><item><title>Integrating with ML Frameworks (PyTorch/TensorFlow)</title><link>https://ai-blog.noorshomelab.dev/metadataflow-guide-2026/08-integrating-ml-frameworks/</link><pubDate>Wed, 28 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/metadataflow-guide-2026/08-integrating-ml-frameworks/</guid><description>&lt;h2 id="integrating-with-ml-frameworks-pytorchtensorflow"&gt;Integrating with ML Frameworks (PyTorch/TensorFlow)&lt;/h2&gt;
&lt;p&gt;Welcome back, data adventurers! In our previous chapters, you&amp;rsquo;ve mastered the fundamentals of Meta AI&amp;rsquo;s powerful new dataset management library, understanding how it helps organize, clean, and version your precious data. You&amp;rsquo;ve seen its robust features for handling various data types and preparing them for the machine learning journey. But what&amp;rsquo;s the ultimate goal of perfectly managed data? To feed it into your machine learning models, of course!&lt;/p&gt;</description></item><item><title>Chapter 8: Recurrent Neural Networks (RNNs) for Sequence Data</title><link>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/recurrent-neural-networks/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/recurrent-neural-networks/</guid><description>&lt;h2 id="chapter-8-recurrent-neural-networks-rnns-for-sequence-data"&gt;Chapter 8: Recurrent Neural Networks (RNNs) for Sequence Data&lt;/h2&gt;
&lt;p&gt;Welcome back, future AI engineer! In our previous chapters, we mastered the fundamentals of deep learning with feedforward neural networks (FNNs). We learned how these networks excel at tasks where inputs are independent and fixed in size, like classifying images or predicting a single value from a structured dataset.&lt;/p&gt;
&lt;p&gt;But what happens when the order of your data matters? What if your input isn&amp;rsquo;t a single, fixed-size vector, but a sequence of varying length, where each element&amp;rsquo;s meaning is influenced by what came before it? Think about natural language, where the meaning of a word depends on the preceding words, or time series data, where future values are influenced by past observations. Traditional FNNs hit a wall here because they lack &amp;ldquo;memory&amp;rdquo; and treat each input independently.&lt;/p&gt;</description></item><item><title>Chapter 14: Model Training Workflows &amp;amp; Optimization Techniques</title><link>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/training-workflows-optimization/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/training-workflows-optimization/</guid><description>&lt;h2 id="introduction-to-model-training-workflows--optimization"&gt;Introduction to Model Training Workflows &amp;amp; Optimization&lt;/h2&gt;
&lt;p&gt;Welcome back, future AI engineer! In the previous chapters, we laid the groundwork by understanding the mathematical foundations of AI, classic machine learning algorithms, and delving into the fascinating world of neural networks and their diverse architectures. You&amp;rsquo;ve learned how to construct these powerful models. But a model, no matter how well-designed, is useless until it learns from data. That&amp;rsquo;s where &lt;strong&gt;model training workflows&lt;/strong&gt; come in.&lt;/p&gt;</description></item><item><title>Chapter 17: Distributed Training &amp;amp; Scaling Deep Learning</title><link>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/distributed-training/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/distributed-training/</guid><description>&lt;h2 id="chapter-17-distributed-training--scaling-deep-learning"&gt;Chapter 17: Distributed Training &amp;amp; Scaling Deep Learning&lt;/h2&gt;
&lt;p&gt;Welcome back, future AI architect! In our journey so far, we&amp;rsquo;ve built a strong foundation in deep learning, mastering neural network architectures, understanding training workflows, and optimizing models. We&amp;rsquo;ve even considered how powerful hardware like GPUs accelerate our tasks. But what happens when your model becomes so massive it won&amp;rsquo;t fit on a single GPU? Or when your dataset is so enormous that training takes weeks, even on the most powerful single machine?&lt;/p&gt;</description></item><item><title>Chapter 21: Project: Building a Custom Image Classifier</title><link>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/project-image-classifier/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/project-image-classifier/</guid><description>&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;Welcome to Chapter 21! After exploring the theoretical foundations of deep learning, neural networks, and various architectures, it&amp;rsquo;s time to get your hands dirty with a complete, practical project. In this chapter, we&amp;rsquo;ll build a custom image classifier from scratch, leveraging the power of modern deep learning frameworks and techniques.&lt;/p&gt;
&lt;p&gt;This project will guide you through the entire lifecycle of an image classification task: from preparing your own dataset, to selecting and modifying a pre-trained model, training it, and evaluating its performance. By the end, you&amp;rsquo;ll not only have a working image classifier but also a much deeper understanding of the practical considerations involved in real-world deep learning applications. This is a foundational skill for any aspiring AI/ML engineer or researcher, opening doors to advanced computer vision tasks.&lt;/p&gt;</description></item><item><title>Chapter 23: Project: Fine-Tuning an LLM for a Specific Task</title><link>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/project-llm-fine-tuning/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/project-llm-fine-tuning/</guid><description>&lt;h2 id="chapter-23-project-fine-tuning-an-llm-for-a-specific-task"&gt;Chapter 23: Project: Fine-Tuning an LLM for a Specific Task&lt;/h2&gt;
&lt;h3 id="introduction"&gt;Introduction&lt;/h3&gt;
&lt;p&gt;Welcome to an exciting hands-on chapter where we&amp;rsquo;ll dive deep into the practical art of fine-tuning Large Language Models (LLMs)! You&amp;rsquo;ve learned about the power of these models, their architectures, and how they process language. Now, it&amp;rsquo;s time to make them truly yours by adapting them to perform a specific task that their general pre-training might not have fully covered.&lt;/p&gt;</description></item><item><title>UniFace Concepts: Face Biometrics</title><link>https://ai-blog.noorshomelab.dev/guides/uniface-mastery-guide/</link><pubDate>Wed, 11 Mar 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/guides/uniface-mastery-guide/</guid><description>&lt;h2 id="welcome-to-the-uniface-concepts-mastery-guide"&gt;Welcome to the UniFace Concepts Mastery Guide!&lt;/h2&gt;
&lt;p&gt;Are you fascinated by the power of face biometrics? Do you want to understand how cutting-edge systems recognize faces, verify identities, and build secure applications? This guide is your comprehensive pathway to mastering the advanced techniques and principles embodied by &amp;ldquo;UniFace&amp;rdquo; in the realm of open-source face biometrics.&lt;/p&gt;
&lt;h3 id="what-are-uniface-concepts"&gt;What are UniFace Concepts?&lt;/h3&gt;
&lt;p&gt;The term &amp;ldquo;UniFace&amp;rdquo; primarily refers to innovative &lt;em&gt;concepts&lt;/em&gt; and &lt;em&gt;algorithms&lt;/em&gt;, particularly the &lt;strong&gt;Unified Cross-Entropy Loss&lt;/strong&gt;, which has significantly advanced the field of deep face recognition. Unlike a single, monolithic software toolkit with a standalone installation, UniFace represents a collection of state-of-the-art methodologies for training highly accurate and robust face recognition models.&lt;/p&gt;</description></item><item><title>AI/ML Engineering: A Zero-to-Advanced Career Path</title><link>https://ai-blog.noorshomelab.dev/guides/ai-ml-career-path-guide/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/guides/ai-ml-career-path-guide/</guid><description>&lt;h1 id="mastering-aiml-engineering-a-zero-to-advanced-career-path"&gt;Mastering AI/ML Engineering: A Zero-to-Advanced Career Path&lt;/h1&gt;
&lt;p&gt;Welcome, future AI/ML engineer or researcher! You&amp;rsquo;re about to embark on an exhilarating journey into the world of Artificial Intelligence and Machine Learning. This comprehensive guide is meticulously designed to take you from foundational concepts to advanced practical applications, equipping you with the knowledge, skills, and confidence to thrive in this rapidly evolving field.&lt;/p&gt;
&lt;h3 id="what-is-this-guide-about"&gt;What is This Guide About?&lt;/h3&gt;
&lt;p&gt;This learning path is a complete, step-by-step roadmap for anyone aspiring to build a career in core AI and Machine Learning development. We&amp;rsquo;ll start with the essential mathematical and programming foundations, gradually progressing through classical machine learning, deep learning, and cutting-edge neural network architectures. You&amp;rsquo;ll learn about entire training workflows, meticulous data preparation, advanced optimization techniques, robust model evaluation, and specialized topics like fine-tuning large language models (LLMs), understanding embeddings, and working with multimodal models. We&amp;rsquo;ll dive into inference optimization, hardware considerations (CPU/GPU/accelerators), distributed training, experimentation tracking, and crucial debugging strategies. Finally, we&amp;rsquo;ll foster research literacy and instill best practices for responsible AI. Throughout this journey, you&amp;rsquo;ll engage in extensive hands-on projects, utilizing real-world datasets, building and training models from scratch, and developing your independent problem-solving skills.&lt;/p&gt;</description></item><item><title>LLM Quantization: Making Models Lean for Local Deployment</title><link>https://ai-blog.noorshomelab.dev/ai/llm-quantization-mastery/</link><pubDate>Fri, 22 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai/llm-quantization-mastery/</guid><description>&lt;h1 id="llm-quantization-making-models-lean-for-local-deployment"&gt;LLM Quantization: Making Models Lean for Local Deployment&lt;/h1&gt;
&lt;h2 id="table-of-contents"&gt;Table of Contents&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;&lt;a href="#introduction-the-need-for-lean-llms"&gt;Introduction: The Need for Lean LLMs&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#what-are-llms-and-why-are-they-so-large"&gt;What are LLMs and Why Are They So Large?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#the-challenge-of-local-deployment"&gt;The Challenge of Local Deployment&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#enter-quantization-a-solution-for-resource-constrained-environments"&gt;Enter Quantization: A Solution for Resource-Constrained Environments&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#understanding-the-basics-what-is-quantization"&gt;Understanding the Basics: What is Quantization?&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#floating-point-numbers-fp32-in-llms"&gt;Floating-Point Numbers (FP32) in LLMs&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#the-concept-of-reduced-precision"&gt;The Concept of Reduced Precision&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#analogy-from-high-definition-to-standard-definition"&gt;Analogy: From High-Definition to Standard-Definition&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#benefits-of-quantization-size-speed-and-energy-efficiency"&gt;Benefits of Quantization: Size, Speed, and Energy Efficiency&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#the-trade-off-accuracy-vs-efficiency"&gt;The Trade-Off: Accuracy vs. Efficiency&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#quantization-techniques-a-deep-dive"&gt;Quantization Techniques: A Deep Dive&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#post-training-quantization-ptq-vs-quantization-aware-training-qat"&gt;Post-Training Quantization (PTQ) vs. Quantization-Aware Training (QAT)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#symmetric-vs-asymmetric-quantization"&gt;Symmetric vs. Asymmetric Quantization&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#per-tensor-vs-per-channel-quantization"&gt;Per-Tensor vs. Per-Channel Quantization&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#common-quantization-bit-widths"&gt;Common Quantization Bit-Widths&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#8-bit-quantization-int8"&gt;8-bit Quantization (INT8)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#4-bit-quantization-int4"&gt;4-bit Quantization (INT4)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#other-bit-widths-eg-2-bit-3-bit-5-bit"&gt;Other Bit-Widths (e.g., 2-bit, 3-bit, 5-bit)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#specific-quantization-algorithms-and-formats"&gt;Specific Quantization Algorithms and Formats&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#gptq-general-purpose-parameter-quantization"&gt;GPTQ (General-purpose Parameter Quantization)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#awq-activation-aware-weight-quantization"&gt;AWQ (Activation-aware Weight Quantization)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#gguf-gpt-generated-unified-format-a-key-for-llamacpp-and-ollama"&gt;GGUF (GPT-Generated Unified Format): A Key for &lt;code&gt;llama.cpp&lt;/code&gt; and Ollama&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#gguf-quantization-types-q2_k-q3_k-q4_k-q5_k-q6_k-q8_0"&gt;GGUF Quantization Types (Q2_K, Q3_K, Q4_K, Q5_K, Q6_K, Q8_0)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#practical-implementation-quantizing-llms"&gt;Practical Implementation: Quantizing LLMs&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#using-bitsandbytes-for-quantization-aware-training-and-inference-pytorch"&gt;Using &lt;code&gt;bitsandbytes&lt;/code&gt; for Quantization-Aware Training and Inference (PyTorch)&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#installation"&gt;Installation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#loading-8-bit-models"&gt;Loading 8-bit Models&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#loading-4-bit-models-nf4"&gt;Loading 4-bit Models (NF4)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#integrating-with-hugging-face-transformers"&gt;Integrating with Hugging Face Transformers&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#fine-tuning-4-bit-models-qlora"&gt;Fine-tuning 4-bit Models (QLoRA)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#leveraging-llamacpp-and-gguf-for-cpu-friendly-inference"&gt;Leveraging &lt;code&gt;llama.cpp&lt;/code&gt; and GGUF for CPU-friendly Inference&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#introduction-to-llamacpp"&gt;Introduction to &lt;code&gt;llama.cpp&lt;/code&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#building-llamacpp"&gt;Building &lt;code&gt;llama.cpp&lt;/code&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#converting-models-to-gguf-format"&gt;Converting Models to GGUF Format&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#quantizing-gguf-models-with-llamacpps-quantize-tool"&gt;Quantizing GGUF Models with &lt;code&gt;llama.cpp&lt;/code&gt;&amp;rsquo;s &lt;code&gt;quantize&lt;/code&gt; tool&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#running-gguf-models-with-llamacpp"&gt;Running GGUF Models with &lt;code&gt;llama.cpp&lt;/code&gt;&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#ollama-simplified-local-llm-deployment"&gt;Ollama: Simplified Local LLM Deployment&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#how-ollama-utilizes-gguf"&gt;How Ollama Utilizes GGUF&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#downloading-and-running-quantized-models-with-ollama"&gt;Downloading and Running Quantized Models with Ollama&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#creating-custom-modelfiles-for-quantized-models"&gt;Creating Custom Modelfiles for Quantized Models&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#evaluating-quantization-trade-offs"&gt;Evaluating Quantization Trade-offs&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#model-size-reduction"&gt;Model Size Reduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#inference-speed-latency"&gt;Inference Speed (Latency)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#accuracy-metrics-and-evaluation"&gt;Accuracy Metrics and Evaluation&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#perplexity"&gt;Perplexity&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#benchmark-tasks-eg-helm-mmlu"&gt;Benchmark Tasks (e.g., HELM, MMLU)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#qualitative-evaluation"&gt;Qualitative Evaluation&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#hardware-considerations-cpu-vs-gpu"&gt;Hardware Considerations (CPU vs. GPU)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#choosing-the-right-quantization-scheme-for-your-use-case"&gt;Choosing the Right Quantization Scheme for Your Use Case&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#advanced-topics-and-future-directions"&gt;Advanced Topics and Future Directions&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#dynamic-vs-static-quantization"&gt;Dynamic vs. Static Quantization&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#mixed-precision-training-and-inference"&gt;Mixed-Precision Training and Inference&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#fine-grained-quantization-techniques"&gt;Fine-grained Quantization Techniques&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#emerging-quantization-research"&gt;Emerging Quantization Research&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#conclusion"&gt;Conclusion&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#recap-of-key-concepts"&gt;Recap of Key Concepts&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#the-future-of-lean-llms"&gt;The Future of Lean LLMs&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#further-learning-resources"&gt;Further Learning Resources&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;hr&gt;
&lt;h2 id="1-introduction-the-need-for-lean-llms"&gt;1. Introduction: The Need for Lean LLMs&lt;/h2&gt;
&lt;p&gt;The advent of Large Language Models (LLMs) has revolutionized various fields, from natural language processing to creative content generation. Models like GPT-3, LLaMA, Mistral, and many others have demonstrated unprecedented capabilities in understanding and generating human-like text. However, this power comes at a significant cost: immense model size and computational requirements.&lt;/p&gt;</description></item><item><title>Mastering Deep Learning with PyTorch: From Tensors to Advanced Neural Networks for LLMs</title><link>https://ai-blog.noorshomelab.dev/ai/deep-learning-frameworks/</link><pubDate>Fri, 22 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai/deep-learning-frameworks/</guid><description>&lt;h1 id="mastering-deep-learning-with-pytorch-from-tensors-to-advanced-neural-networks-for-llms"&gt;Mastering Deep Learning with PyTorch: From Tensors to Advanced Neural Networks for LLMs&lt;/h1&gt;
&lt;hr&gt;
&lt;h2 id="1-introduction-to-deep-learning-and-pytorch"&gt;1. Introduction to Deep Learning and PyTorch&lt;/h2&gt;
&lt;h3 id="what-is-deep-learning"&gt;What is Deep Learning?&lt;/h3&gt;
&lt;p&gt;Deep learning is a subfield of machine learning inspired by the structure and function of the human brain&amp;rsquo;s neural networks. Instead of explicit programming, deep learning models learn from vast amounts of data, automatically discovering intricate patterns and representations. These models are characterized by their &amp;ldquo;deep&amp;rdquo; architecture, consisting of multiple layers, which allows them to extract hierarchical features from raw data. From recognizing objects in images to understanding human language and generating creative content, deep learning has revolutionized numerous domains.&lt;/p&gt;</description></item><item><title>Mastering LLM Fine-tuning: Pre-training, SFT, and PEFT for Custom Models</title><link>https://ai-blog.noorshomelab.dev/ai/llm-fine-tuning/</link><pubDate>Fri, 22 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai/llm-fine-tuning/</guid><description>&lt;h1 id="llm-pre-training-and-fine-tuning-concepts"&gt;LLM Pre-training and Fine-tuning Concepts&lt;/h1&gt;
&lt;hr&gt;
&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;Large Language Models (LLMs) have revolutionized the field of Artificial Intelligence, demonstrating remarkable capabilities in understanding, generating, and processing human language. These powerful models are at the heart of many cutting-edge applications, from sophisticated chatbots and content generators to complex code assistants. This document serves as a comprehensive guide to understanding the lifecycle of LLMs, from their initial pre-training to the crucial process of fine-tuning them for specific tasks and data.&lt;/p&gt;</description></item><item><title>NLP Fundamentals: Mastering Attention and Transformers for Large Language Models</title><link>https://ai-blog.noorshomelab.dev/ai/natural-language-processing-fundamentals/</link><pubDate>Fri, 22 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai/natural-language-processing-fundamentals/</guid><description>&lt;h1 id="natural-language-processing-fundamentals-from-text-preprocessing-to-transformers"&gt;Natural Language Processing Fundamentals: From Text Preprocessing to Transformers&lt;/h1&gt;
&lt;hr&gt;
&lt;h2 id="1-introduction-to-natural-language-processing"&gt;1. Introduction to Natural Language Processing&lt;/h2&gt;
&lt;h3 id="what-is-nlp"&gt;What is NLP?&lt;/h3&gt;
&lt;p&gt;Natural Language Processing (NLP) is a subfield of artificial intelligence (AI) that focuses on enabling computers to understand, interpret, and generate human language. It&amp;rsquo;s the technology behind everyday applications like spam filters, virtual assistants (Siri, Alexa), machine translation (Google Translate), and sentiment analysis. NLP combines computational linguistics—rule-based modeling of human language—with AI, machine learning, and deep learning models to process vast amounts of text and speech data.&lt;/p&gt;</description></item></channel></rss>