<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Computer Vision on AI VOID</title><link>https://ai-blog.noorshomelab.dev/tags/computer-vision/</link><description>Recent content in Computer Vision on AI VOID</description><generator>Hugo</generator><language>en</language><lastBuildDate>Fri, 20 Mar 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://ai-blog.noorshomelab.dev/tags/computer-vision/index.xml" rel="self" type="application/rss+xml"/><item><title>Chapter 1: Introduction to Face Biometrics and UniFace Concepts</title><link>https://ai-blog.noorshomelab.dev/uniface-biometrics-guide-2026/intro-face-biometrics/</link><pubDate>Wed, 11 Mar 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/uniface-biometrics-guide-2026/intro-face-biometrics/</guid><description>&lt;h2 id="welcome-to-the-world-of-face-biometrics-with-uniface"&gt;Welcome to the World of Face Biometrics with UniFace!&lt;/h2&gt;
&lt;p&gt;Hello, future face biometrics expert! Welcome to the very first chapter of your journey into mastering the UniFace toolkit. In this guide, we&amp;rsquo;re going to demystify advanced face biometrics, breaking down complex ideas into easy, actionable steps. You&amp;rsquo;ll learn not just &lt;em&gt;how&lt;/em&gt; to use tools, but &lt;em&gt;why&lt;/em&gt; they work the way they do, empowering you to build intelligent, robust facial recognition applications.&lt;/p&gt;</description></item><item><title>Chapter 1: What are Vector Embeddings? The Language of AI</title><link>https://ai-blog.noorshomelab.dev/usearch-scylladb-vector-search-guide-2026/01-what-are-vector-embeddings/</link><pubDate>Tue, 17 Feb 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/usearch-scylladb-vector-search-guide-2026/01-what-are-vector-embeddings/</guid><description>&lt;h3 id="introduction"&gt;Introduction&lt;/h3&gt;
&lt;p&gt;Welcome to the exciting world of USearch and ScyllaDB vector search! Before we dive into the powerful tools that enable lightning-fast similarity lookups, we need to understand the fundamental concept that makes it all possible: &lt;strong&gt;vector embeddings&lt;/strong&gt;. Think of vector embeddings as the secret language that allows Artificial Intelligence (AI) to truly understand and interact with the complex information around us.&lt;/p&gt;
&lt;p&gt;In this first chapter, we&amp;rsquo;ll demystify vector embeddings. You&amp;rsquo;ll learn what they are, why they&amp;rsquo;ve become indispensable for modern AI applications, and how they transform raw data—like text, images, or even audio—into a numerical format that computers can process meaningfully. We&amp;rsquo;ll explore the core ideas behind their creation and the properties that make them so powerful for tasks like recommendation systems, semantic search, and anomaly detection.&lt;/p&gt;</description></item><item><title>Chapter 7: Convolutional Neural Networks (CNNs) for Computer Vision</title><link>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/convolutional-neural-networks/</link><pubDate>Sat, 17 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/ai-ml-career-path-2026/convolutional-neural-networks/</guid><description>&lt;h2 id="chapter-7-convolutional-neural-networks-cnns-for-computer-vision"&gt;Chapter 7: Convolutional Neural Networks (CNNs) for Computer Vision&lt;/h2&gt;
&lt;p&gt;Welcome back, future AI architect! In our journey, we&amp;rsquo;ve explored the basics of neural networks and understood how they can learn patterns from data. But what about images? Images are special: they have spatial relationships, and a simple dense neural network might struggle to capture these effectively.&lt;/p&gt;
&lt;p&gt;This chapter introduces you to &lt;strong&gt;Convolutional Neural Networks (CNNs)&lt;/strong&gt;, the powerhouse behind most modern computer vision applications. From recognizing faces on your phone to autonomous driving, CNNs are everywhere. You&amp;rsquo;ll learn the fundamental building blocks of CNNs, understand why they are so effective for image data, and get hands-on experience building and training your very own image classifier using TensorFlow and Keras.&lt;/p&gt;</description></item><item><title>Understanding Multimodal AI Systems</title><link>https://ai-blog.noorshomelab.dev/multimodal-ai-guide-2026/</link><pubDate>Fri, 20 Mar 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/multimodal-ai-guide-2026/</guid><description>&lt;p&gt;Welcome to this comprehensive guide on multimodal AI systems. Here, you will explore how these advanced systems integrate and process text, image, audio, and video inputs, covering their core architectures and data pipelines. Discover real-world applications, from intelligent voice assistants to sophisticated vision-based AI, and understand their practical impact.&lt;/p&gt;</description></item><item><title>Project 2: Interactive Image Captioning Tool</title><link>https://ai-blog.noorshomelab.dev/transformers-js-guide/project-interactive-image-captioning-tool/</link><pubDate>Sun, 26 Oct 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/transformers-js-guide/project-interactive-image-captioning-tool/</guid><description>&lt;h1 id="8-project-2-interactive-image-captioning-tool"&gt;8. Project 2: Interactive Image Captioning Tool&lt;/h1&gt;
&lt;p&gt;This project will challenge you to build an interactive web application that generates descriptive captions for uploaded images. This utilizes a &lt;strong&gt;multimodal&lt;/strong&gt; AI model, which can process both visual and textual information to understand and describe an image.&lt;/p&gt;
&lt;h2 id="81-project-objective-and-problem-statement"&gt;8.1. Project Objective and Problem Statement&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Objective:&lt;/strong&gt; Develop a client-side web application where users can upload an image, and the application uses a Transformers.js model to automatically generate a human-readable caption describing the image&amp;rsquo;s content.&lt;/p&gt;</description></item><item><title>Visual Intelligence: Computer Vision Tasks</title><link>https://ai-blog.noorshomelab.dev/transformers-js-guide/visual-intelligence-computer-vision-tasks/</link><pubDate>Sun, 26 Oct 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/transformers-js-guide/visual-intelligence-computer-vision-tasks/</guid><description>&lt;h1 id="4-visual-intelligence-computer-vision-tasks"&gt;4. Visual Intelligence: Computer Vision Tasks&lt;/h1&gt;
&lt;p&gt;Computer Vision (CV) enables computers to &amp;ldquo;see&amp;rdquo; and interpret visual information from images and videos. Transformers.js brings powerful CV models directly to the browser, allowing for client-side image processing, analysis, and understanding. This chapter explores common CV tasks.&lt;/p&gt;
&lt;h2 id="41-image-classification"&gt;4.1. Image Classification&lt;/h2&gt;
&lt;p&gt;Image classification involves assigning a label (or class) to an entire image, determining what the main subject of the image is.&lt;/p&gt;
&lt;h3 id="411-detailed-explanation"&gt;4.1.1. Detailed Explanation&lt;/h3&gt;
&lt;p&gt;An image classification pipeline takes an image (as a URL, &lt;code&gt;File&lt;/code&gt; object, or &lt;code&gt;HTMLImageElement&lt;/code&gt;) and outputs a list of predicted labels with confidence scores. Models are trained on vast datasets like ImageNet, learning to recognize patterns associated with thousands of different categories.&lt;/p&gt;</description></item></channel></rss>