<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Ollama on AI VOID</title><link>https://ai-blog.noorshomelab.dev/tags/ollama/</link><description>Recent content in Ollama on AI VOID</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sun, 17 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://ai-blog.noorshomelab.dev/tags/ollama/index.xml" rel="self" type="application/rss+xml"/><item><title>Welcome to AIPack: Your Agentic Runtime for AI</title><link>https://ai-blog.noorshomelab.dev/aipack-guide-2026/welcome-to-aipack/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/aipack-guide-2026/welcome-to-aipack/</guid><description>&lt;p&gt;Building sophisticated AI agents that can tackle real-world problems isn&amp;rsquo;t just about crafting clever prompts. It&amp;rsquo;s about orchestrating complex workflows, managing context, integrating diverse tools, and ensuring your agents are reliable and shareable. Without a robust system, these challenges quickly lead to unmanageable, brittle AI applications. This is precisely where AIPack steps in.&lt;/p&gt;
&lt;p&gt;This guide will take you on a journey from zero to mastery with AIPack, an open-source agentic runtime designed to simplify the entire lifecycle of AI agents. In this first chapter, you&amp;rsquo;ll learn how to install AIPack, understand its core architecture, and build your very first intelligent agent. By the end, you&amp;rsquo;ll have a foundational understanding of how to define, run, and interact with an AIPack agent, setting the stage for more advanced capabilities in your daily AI-assisted software engineering workflows.&lt;/p&gt;</description></item><item><title>Setting Up Your AIPack Development Environment</title><link>https://ai-blog.noorshomelab.dev/aipack-guide-2026/setup-aipack-environment/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/aipack-guide-2026/setup-aipack-environment/</guid><description>&lt;p&gt;Embarking on the journey of building sophisticated AI agents requires a well-prepared workshop. This chapter will guide you through setting up your complete &lt;strong&gt;AIPack development environment&lt;/strong&gt;, turning your machine into a powerful hub for designing, testing, and deploying intelligent agents. We&amp;rsquo;ll cover everything from core dependencies to specialized tools, ensuring you have a smooth and efficient workflow.&lt;/p&gt;
&lt;p&gt;Why is a robust setup so crucial? Imagine trying to build a complex machine with missing tools or a disorganized workspace. It&amp;rsquo;s frustrating and inefficient. For AI agents, your development environment is that workshop. A properly configured setup prevents common pitfalls, streamlines debugging, and allows you to focus on the creative challenge of agent design rather than wrestling with your tools. By the end of this chapter, you&amp;rsquo;ll have a fully functional environment, ready for your first AIPack project.&lt;/p&gt;</description></item><item><title>Your First AI Pack: Understanding .aip Files and Basic Agents</title><link>https://ai-blog.noorshomelab.dev/aipack-guide-2026/first-ai-pack-aip-files/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/aipack-guide-2026/first-ai-pack-aip-files/</guid><description>&lt;p&gt;Welcome to Chapter 3! If you&amp;rsquo;ve ever wanted to build your own intelligent agent and share it with others, you&amp;rsquo;re in the right place. In this chapter, we&amp;rsquo;re taking the crucial step from setting up our environment to creating our very first AI agent using AIPack.&lt;/p&gt;
&lt;p&gt;This chapter is your hands-on introduction to the core components of AIPack: the &lt;code&gt;.aip&lt;/code&gt; file format and the structure of basic multi-stage markdown agents. We&amp;rsquo;ll start with the simplest possible agent and gradually add more functionality, ensuring you understand each piece before moving on. By the end, you&amp;rsquo;ll not only have a working agent but also a solid mental model for how AIPack organizes and executes AI workflows.&lt;/p&gt;</description></item><item><title>Connecting to AI: Provider Integrations (Ollama, Cloud APIs)</title><link>https://ai-blog.noorshomelab.dev/aipack-guide-2026/provider-integrations/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/aipack-guide-2026/provider-integrations/</guid><description>&lt;p&gt;AI agents, at their core, are problem-solvers that leverage the intelligence of Large Language Models (LLMs). To build truly powerful and versatile AI Packs, your agents need the ability to communicate with these LLMs, whether they&amp;rsquo;re running locally on your machine or accessible through cloud services. This chapter guides you through the essential process of integrating various AI model providers into your AIPack projects.&lt;/p&gt;
&lt;p&gt;Understanding and implementing provider integrations is a critical skill for any AI agent developer. Why does this matter so much? Because it offers immense flexibility and resilience. You can choose local models like Ollama for privacy, cost-effectiveness, and rapid offline iteration. Alternatively, you can leverage cloud APIs (like OpenAI or Anthropic) for their scalability, advanced capabilities, and access to cutting-edge research models. Mastering these integrations allows you to design agents that are performant, adaptable to different operational environments, and aligned with diverse budget constraints.&lt;/p&gt;</description></item><item><title>Chapter 8: Local AI Integration - Running Models with Ollama/Docker</title><link>https://ai-blog.noorshomelab.dev/a2ui-guide-2025/local-ai-ollama-docker/</link><pubDate>Tue, 23 Dec 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/a2ui-guide-2025/local-ai-ollama-docker/</guid><description>&lt;h2 id="chapter-8-local-ai-integration---running-models-with-ollamadocker"&gt;Chapter 8: Local AI Integration - Running Models with Ollama/Docker&lt;/h2&gt;
&lt;p&gt;Welcome back, future A2UI maestro! In our journey so far, we&amp;rsquo;ve explored the foundations of A2UI, understood how agents generate dynamic interfaces, and even built some basic components. Often, these agents rely on powerful Large Language Models (LLMs) to make decisions and generate content. While cloud-based LLMs are fantastic, there are compelling reasons to run these models locally: privacy, cost control, offline capabilities, and the sheer joy of having an AI brain on your own machine!&lt;/p&gt;</description></item><item><title>Real-World Project: AI-Assisted Python Debugging Agent</title><link>https://ai-blog.noorshomelab.dev/aipack-guide-2026/project-ai-python-debugging/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/aipack-guide-2026/project-ai-python-debugging/</guid><description>&lt;p&gt;Debugging Python code, especially within complex applications, can feel like searching for a needle in a haystack—time-consuming and often frustrating. Imagine having an intelligent assistant that not only highlights errors but also suggests fixes, explains the root cause, and helps you verify the solution. This chapter guides you through building exactly that: an AI-powered Python debugging agent using AIPack.&lt;/p&gt;
&lt;p&gt;You&amp;rsquo;ll learn how to harness AIPack&amp;rsquo;s powerful multi-stage agent capabilities, integrate with the MCP (Multi-Agent Communication Protocol) server for real-time interaction with your Python environment, and craft intelligent prompts to create a truly helpful debugging companion. This project will solidify your understanding of AIPack&amp;rsquo;s core principles by applying them to a practical, real-world development challenge.&lt;/p&gt;</description></item><item><title>Local LLMs with any-llm (Ollama Integration)</title><link>https://ai-blog.noorshomelab.dev/any-llm-guide-2025/local-llms-ollama/</link><pubDate>Tue, 30 Dec 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/any-llm-guide-2025/local-llms-ollama/</guid><description>&lt;h2 id="introduction-bringing-llms-home"&gt;Introduction: Bringing LLMs Home&lt;/h2&gt;
&lt;p&gt;Welcome back, future AI architect! So far in our &lt;code&gt;any-llm&lt;/code&gt; journey, we&amp;rsquo;ve largely focused on interacting with powerful cloud-based LLMs like OpenAI, Anthropic, or Mistral. These services are incredible for their scale and performance, but what if you need more privacy, lower latency, or simply want to experiment without incurring API costs?&lt;/p&gt;
&lt;p&gt;This chapter is all about bringing the power of Large Language Models directly to your machine. We&amp;rsquo;ll dive into the exciting world of &lt;strong&gt;Local LLMs&lt;/strong&gt; and learn how to run them efficiently using a fantastic tool called &lt;strong&gt;Ollama&lt;/strong&gt;. Best of all, we&amp;rsquo;ll see how &lt;code&gt;any-llm&lt;/code&gt; seamlessly integrates with Ollama, allowing you to switch between local and cloud models with minimal code changes. Pretty neat, right?&lt;/p&gt;</description></item><item><title>Best Practices for Building and Sharing Production AI Packs</title><link>https://ai-blog.noorshomelab.dev/aipack-guide-2026/best-practices-production-packs/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/aipack-guide-2026/best-practices-production-packs/</guid><description>&lt;h2 id="introduction-to-production-ready-ai-packs"&gt;Introduction to Production-Ready AI Packs&lt;/h2&gt;
&lt;p&gt;Moving from an experimental AI agent that works on your local machine to a robust, reliable, and shareable &amp;ldquo;AI Pack&amp;rdquo; ready for production workflows introduces a new set of challenges and considerations. This isn&amp;rsquo;t just about getting an agent to respond; it&amp;rsquo;s about ensuring it performs consistently, handles errors gracefully, is maintainable over time, and can be easily shared and deployed by others.&lt;/p&gt;
&lt;p&gt;In this chapter, we&amp;rsquo;ll dive deep into the best practices that transform your AIPack projects from prototypes into production-grade solutions. We&amp;rsquo;ll cover everything from architectural design patterns to efficient context management, robust error handling, and strategies for effective sharing. By the end, you&amp;rsquo;ll have a clear understanding of how to build AI Packs that stand up to the demands of real-world use cases.&lt;/p&gt;</description></item><item><title>AIPack Zero-to-Mastery Guide</title><link>https://ai-blog.noorshomelab.dev/aipack-guide-2026/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/aipack-guide-2026/</guid><description>&lt;p&gt;Embark on a comprehensive journey to master AIPack, the cutting-edge platform for AI-assisted software engineering. This guide covers everything from initial setup and configuration to building, deploying, and sharing sophisticated AI Packs for real-world production workflows. Explore AIPack architecture, multi-stage agents, Lua logic, provider integrations, and advanced techniques for debugging, optimization, and agent composition.&lt;/p&gt;</description></item><item><title>AIPack: Building Production-Ready AI Agents</title><link>https://ai-blog.noorshomelab.dev/guides/aipack-zero-to-mastery-guide/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/guides/aipack-zero-to-mastery-guide/</guid><description>&lt;p&gt;Building reliable and shareable AI agents for real-world production tasks can feel complex. How do you manage agent logic, integrate with various AI models, and ensure your agents can handle intricate, multi-step workflows, especially when dealing with large codebases? This guide introduces you to AIPack, an open-source agentic runtime designed to simplify this entire process.&lt;/p&gt;
&lt;h3 id="why-aipack-matters-for-your-projects"&gt;Why AIPack Matters for Your Projects&lt;/h3&gt;
&lt;p&gt;AIPack provides a structured way to define, execute, and distribute AI agents. It&amp;rsquo;s not just about running prompts; it&amp;rsquo;s about orchestrating sophisticated, multi-stage agent behaviors that can tackle complex problems like automated code generation, intelligent debugging, or even cloud infrastructure management. By using AIPack, you gain:&lt;/p&gt;</description></item><item><title>How to Integrate VS Code with Ollama for Local AI Assistance: Step-by-Step Guide</title><link>https://ai-blog.noorshomelab.dev/tutorials/integrate-vscode-ollama-local-ai/</link><pubDate>Thu, 09 Apr 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/tutorials/integrate-vscode-ollama-local-ai/</guid><description>&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;This tutorial will guide you through setting up a powerful, private, and cost-free AI coding assistant directly within your Visual Studio Code environment. By integrating &lt;a href="https://ollama.com/"&gt;Ollama&lt;/a&gt; with the &lt;a href="https://continue.dev/"&gt;Continue VS Code extension&lt;/a&gt;, you&amp;rsquo;ll be able to run large language models (LLMs) locally on your machine. This setup allows for code generation, completion, debugging assistance, and refactoring without relying on external APIs, ensuring complete privacy for your code and eliminating API costs.&lt;/p&gt;</description></item><item><title>MCP - Model Context Protocol: A Guide for AI Agent Developers</title><link>https://ai-blog.noorshomelab.dev/guides/mcp-model-context-protocol-for-ai-agents/</link><pubDate>Mon, 25 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/guides/mcp-model-context-protocol-for-ai-agents/</guid><description>&lt;h1 id="mastering-mcp---model-context-protocol-a-guide-for-ai-agent-developers"&gt;Mastering MCP - Model Context Protocol: A Guide for AI Agent Developers&lt;/h1&gt;
&lt;p&gt;Welcome to the cutting edge of AI agent development! This document will guide you through the intricacies of the Model Context Protocol (MCP), a revolutionary open standard that allows AI agents to interact with external systems, tools, and data in a standardized, secure, and highly effective manner. By the end of this guide, you will be equipped to design, build, and deploy your own MCP servers and integrate them with popular AI tools like Ollama and development environments like Visual Studio Code.&lt;/p&gt;</description></item><item><title>Local LLMs: A Comprehensive Learning Path</title><link>https://ai-blog.noorshomelab.dev/guides/learn-ai-from-scratch/</link><pubDate>Sat, 23 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/guides/learn-ai-from-scratch/</guid><description>&lt;p&gt;Embark on an exciting journey to master data science, where you&amp;rsquo;ll gain the power to fine-tune, restructure, quantize, and retrain local LLMs like Ollama. This ambitious yet incredibly rewarding quest blends traditional data science, cutting-edge machine learning, and specialized deep learning for large language models.&lt;/p&gt;
&lt;h3 id="foundational-data-science-skills"&gt;Foundational Data Science Skills:&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/python-programming"&gt;Python Programming&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;Core Python (data structures, control flow, functions, OOP).&lt;/li&gt;
&lt;li&gt;File I/O.&lt;/li&gt;
&lt;li&gt;Virtual environments and package management (&lt;code&gt;pip&lt;/code&gt;, &lt;code&gt;conda&lt;/code&gt;).&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/data-manipulation-analysis"&gt;Data Manipulation and Analysis&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;NumPy:&lt;/strong&gt; Efficient array operations, linear algebra.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Pandas:&lt;/strong&gt; Data loading, cleaning, transformation, and analysis with DataFrames.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Data Visualization:&lt;/strong&gt; Matplotlib, Seaborn (for understanding data distributions, model performance).&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/machine-learning-fundamentals"&gt;Machine Learning Fundamentals (Traditional ML)&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Scikit-learn:&lt;/strong&gt; Supervised learning (regression, classification), unsupervised learning (clustering), model evaluation metrics, cross-validation.&lt;/li&gt;
&lt;li&gt;Feature engineering.&lt;/li&gt;
&lt;li&gt;Understanding bias-variance tradeoff, overfitting, underfitting.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="deep-learning-and-llm-specific-skills"&gt;Deep Learning and LLM-Specific Skills:&lt;/h3&gt;
&lt;ol start="4"&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/deep-learning-frameworks"&gt;Deep Learning Frameworks&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;PyTorch (highly recommended) or TensorFlow:&lt;/strong&gt; Tensor operations, defining neural network architectures, training loops, optimizers, loss functions, GPU acceleration.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/natural-language-processing-fundamentals"&gt;Natural Language Processing (NLP) Fundamentals&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;Text preprocessing (tokenization, stemming, lemmatization).&lt;/li&gt;
&lt;li&gt;Word embeddings (Word2Vec, GloVe, FastText - conceptual understanding).&lt;/li&gt;
&lt;li&gt;Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTMs) - conceptual.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Attention Mechanisms and Transformers:&lt;/strong&gt; This is &lt;em&gt;critical&lt;/em&gt; for LLMs. Understanding how they work is fundamental.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/llm-architectures"&gt;Large Language Model (LLM) Architectures&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Decoder-only models (GPT-series):&lt;/strong&gt; Causal language modeling.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Encoder-decoder models (T5, BART):&lt;/strong&gt; Sequence-to-sequence tasks.&lt;/li&gt;
&lt;li&gt;Understanding model sizes (parameters: 7B, 13B, 70B etc.).&lt;/li&gt;
&lt;li&gt;Open-source LLM families (Llama, Mistral, Gemma, Qwen, Phi).&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/llm-pre-training-fine-tuning"&gt;LLM Pre-training and Fine-tuning Concepts&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Pre-training:&lt;/strong&gt; Conceptual understanding of how base models are trained on vast text data.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Fine-tuning:&lt;/strong&gt; Customizing LLMs for specific tasks or domains.
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Supervised Fine-tuning (SFT):&lt;/strong&gt; Training on labeled datasets (question-answer pairs, instruction-following).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Instruction Fine-tuning:&lt;/strong&gt; Aligning models to follow instructions.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Parameter-Efficient Fine-Tuning (PEFT):&lt;/strong&gt; LoRA, QLoRA (understanding how they work to reduce computational resources for fine-tuning).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Reinforcement Learning from Human Feedback (RLHF) / Direct Preference Optimization (DPO):&lt;/strong&gt; Aligning models with human preferences (conceptual understanding for advanced work).&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Data Preparation for Fine-tuning:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;Data collection and curation.&lt;/li&gt;
&lt;li&gt;Data cleaning, labeling, and structuring (e.g., into chat templates like ChatML).&lt;/li&gt;
&lt;li&gt;Synthetic data generation.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/llm-quantization-mastery"&gt;LLM Quantization: Making Models Lean for Local Deployment&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;Reducing model size and memory footprint (e.g., 4-bit, 8-bit quantization) to run on local/edge devices.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/llm-deployment-serving"&gt;LLM Deployment and Serving (Local)&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Ollama:&lt;/strong&gt; How to use Ollama to download, serve, and manage local LLMs.&lt;/li&gt;
&lt;li&gt;Converting fine-tuned models to formats compatible with local inference (e.g., GGUF).&lt;/li&gt;
&lt;li&gt;Hardware considerations for local LLMs (GPU VRAM, RAM).&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/agentic-ai-frameworks"&gt;Agentic AI Frameworks (for Application Building)&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;LangChain / LangGraph:&lt;/strong&gt; Building intelligent agents, chaining LLM calls, integrating tools, managing memory, and constructing complex workflows.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;CrewAI:&lt;/strong&gt; For multi-agent systems and collaborative task execution.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;n8n:&lt;/strong&gt; For workflow automation and integration of LLMs with other services.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/retrieval-augmented-generation"&gt;Retrieval-Augmented Generation (RAG)&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;Understanding when to use RAG vs. fine-tuning.&lt;/li&gt;
&lt;li&gt;Components of a RAG system: Document loaders, text splitters, embedding models, vector databases (ChromaDB, Pinecone, Weaviate), retrievers.&lt;/li&gt;
&lt;li&gt;Integrating RAG with local LLMs (Ollama + LangChain/LlamaIndex).&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="../../ai/mlops-llmops"&gt;MLOps/LLMOps (Operationalizing LLMs)&lt;/a&gt;:&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;Experiment tracking (e.g., Weights &amp;amp; Biases for fine-tuning).&lt;/li&gt;
&lt;li&gt;Model versioning.&lt;/li&gt;
&lt;li&gt;Monitoring performance and cost.&lt;/li&gt;
&lt;li&gt;Debugging agent behavior (e.g., LangSmith).&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;hr&gt;</description></item><item><title>Local LLM Deployment: Mastering Ollama for Custom Fine-tuned Models</title><link>https://ai-blog.noorshomelab.dev/ai/llm-deployment-serving/</link><pubDate>Fri, 22 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai/llm-deployment-serving/</guid><description>&lt;h1 id="llm-deployment-and-serving-local-mastering-ollama-for-custom-models"&gt;LLM Deployment and Serving (Local): Mastering Ollama for Custom Models&lt;/h1&gt;
&lt;hr&gt;
&lt;h2 id="1-introduction-the-power-of-local-llms"&gt;1. Introduction: The Power of Local LLMs&lt;/h2&gt;
&lt;p&gt;Large Language Models (LLMs) have ushered in a new era of intelligent applications, from advanced chatbots to sophisticated code assistants. While powerful, many LLMs are often accessed via cloud-based APIs, leading to concerns about data privacy, recurring costs, and internet dependency. This document champions the increasingly vital practice of deploying and serving LLMs locally. It offers a comprehensive guide to understanding, implementing, and optimizing local LLM inference, with a particular emphasis on &lt;strong&gt;Ollama&lt;/strong&gt;, an innovative framework that simplifies this complex process for both pre-packaged and custom fine-tuned models.&lt;/p&gt;</description></item></channel></rss>