<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: moses omondi</title>
    <description>The latest articles on DEV Community by moses omondi (@moses_omondi_d411af81e579).</description>
    <link>https://dev.to/moses_omondi_d411af81e579</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2117569%2F8d2987b4-94e5-4a90-b10b-d83e4058878c.png</url>
      <title>DEV Community: moses omondi</title>
      <link>https://dev.to/moses_omondi_d411af81e579</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/moses_omondi_d411af81e579"/>
    <language>en</language>
    <item>
      <title>Build &amp; Host Video Data Factory</title>
      <dc:creator>moses omondi</dc:creator>
      <pubDate>Tue, 15 Oct 2024 16:09:24 +0000</pubDate>
      <link>https://dev.to/moses_omondi_d411af81e579/build-host-video-data-factory-4bhk</link>
      <guid>https://dev.to/moses_omondi_d411af81e579/build-host-video-data-factory-4bhk</guid>
      <description>&lt;p&gt;&lt;a href="https://hadithi.studio/console/login/" rel="noopener noreferrer"&gt;Hadithi&lt;/a&gt; help developers easily build data factories so that they can generate , process and manage LLM video Datasets.&lt;br&gt;
We take care of data collection, ingestion, processing, annotation, validation, storage and integration of video datasets with Large Language Models as developers focus on fine-tuning or training AI applications using our LLM video datasets.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>Data factory for generative video models</title>
      <dc:creator>moses omondi</dc:creator>
      <pubDate>Sat, 28 Sep 2024 01:37:02 +0000</pubDate>
      <link>https://dev.to/moses_omondi_d411af81e579/data-factory-for-generative-video-models-4fcf</link>
      <guid>https://dev.to/moses_omondi_d411af81e579/data-factory-for-generative-video-models-4fcf</guid>
      <description>&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxwobz92epveka2tnihs5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxwobz92epveka2tnihs5.png" alt="Image description" width="800" height="461"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Hadithi automates video processing: it organizes and renames videos with timestamps, segments them into clips, detects scenes, removes audio if needed, filters out short videos, rescales and extracts frames, batches videos, validates image counts in folders, and creates videos from images at the correct frame rate.&lt;/p&gt;

&lt;p&gt;It is easy to use, open-source, and runs entirely on a CPU with minimal setup:&lt;/p&gt;

&lt;p&gt;Developers simply point the path to their dataset folder and, with the click of a single button, start extracting structured datasets—a task that is usually time consuming, very expensive, and requires expert skill.&lt;/p&gt;

&lt;p&gt;The source code is written in bash, which is lightweight and easy to understand.Developers can modify the source code to suit their needs. They can even use it to set up their own data foundry!&lt;/p&gt;

&lt;p&gt;Unlike most video processing tools, it doesn't require a GPU.Anyone with a moderate cpu and sufficient storage hardware can create thousands of videos.&lt;/p&gt;

&lt;p&gt;Only Bash, FFmpeg, and Exiftool are required to setup the system.Sorry, Windows and Mac OS users.,I developed the system on Ubuntu 18.04 but you can test it on your operating systems.&lt;/p&gt;

</description>
      <category>opensource</category>
      <category>datascience</category>
      <category>llm</category>
      <category>tooling</category>
    </item>
    <item>
      <title>Data factory for LLM video models</title>
      <dc:creator>moses omondi</dc:creator>
      <pubDate>Thu, 26 Sep 2024 18:07:58 +0000</pubDate>
      <link>https://dev.to/moses_omondi_d411af81e579/data-factory-for-llm-video-models-2om5</link>
      <guid>https://dev.to/moses_omondi_d411af81e579/data-factory-for-llm-video-models-2om5</guid>
      <description>&lt;p&gt;Hadithi is an open-source, bash-based command-line tool that enables AI and ML developers to easily convert Youtube, Torrent, and enterprise videos into high-quality datasets for fine-tuning large language models (LLMs).&lt;br&gt;
&lt;a href="https://hadithi.studio" rel="noopener noreferrer"&gt;access source code&lt;/a&gt;&lt;/p&gt;

</description>
      <category>video</category>
      <category>datasets</category>
      <category>llm</category>
      <category>models</category>
    </item>
  </channel>
</rss>
