<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data Preparation on AI VOID</title><link>https://ai-blog.noorshomelab.dev/tags/data-preparation/</link><description>Recent content in Data Preparation on AI VOID</description><generator>Hugo</generator><language>en</language><lastBuildDate>Fri, 30 Jan 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://ai-blog.noorshomelab.dev/tags/data-preparation/index.xml" rel="self" type="application/rss+xml"/><item><title>Chapter 5: Data Preparation and Loading for Tunix</title><link>https://ai-blog.noorshomelab.dev/tunix-mastery-2026/05-data-preparation/</link><pubDate>Fri, 30 Jan 2026 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/tunix-mastery-2026/05-data-preparation/</guid><description>&lt;h2 id="chapter-5-data-preparation-and-loading-for-tunix"&gt;Chapter 5: Data Preparation and Loading for Tunix&lt;/h2&gt;
&lt;p&gt;Welcome back, future LLM master! In the previous chapters, we laid the groundwork by understanding Tunix&amp;rsquo;s architecture and setting up our development environment. Now, it&amp;rsquo;s time to talk about the fuel that powers any Large Language Model: data!&lt;/p&gt;
&lt;p&gt;This chapter is all about getting your data ready for Tunix. We&amp;rsquo;ll dive deep into the crucial steps of preparing your text-based datasets, understanding how to tokenize them, and setting up efficient data loading pipelines that play nicely with JAX and Tunix. Think of this as preparing a delicious meal – you need to carefully select, clean, and chop your ingredients before you can even think about cooking!&lt;/p&gt;</description></item><item><title>Data Manipulation and Analysis: NumPy, Pandas, and Visualization for AI</title><link>https://ai-blog.noorshomelab.dev/guides/data-manipulation-analysis-numpy-pandas/</link><pubDate>Fri, 22 Aug 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/guides/data-manipulation-analysis-numpy-pandas/</guid><description>&lt;h1 id="mastering-data-manipulation-and-analysis-numpy-pandas-and-visualization-for-ai"&gt;Mastering Data Manipulation and Analysis: NumPy, Pandas, and Visualization for AI&lt;/h1&gt;
&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;In the ever-evolving landscape of artificial intelligence and machine learning, the ability to effectively manipulate, analyze, and visualize data is not just a skill but a cornerstone for success. From the foundational steps of cleaning raw datasets to the sophisticated preparation required for training large language models (LLMs) or understanding agent performance, a deep understanding of data tools is paramount.&lt;/p&gt;</description></item></channel></rss>