<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Jimmy Guerrero</title>
    <description>The latest articles on DEV Community by Jimmy Guerrero (@jguerrero-voxel51).</description>
    <link>https://dev.to/jguerrero-voxel51</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1255335%2Fae95dea7-5209-47b4-b1d3-12ac47fa900c.png</url>
      <title>DEV Community: Jimmy Guerrero</title>
      <link>https://dev.to/jguerrero-voxel51</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/jguerrero-voxel51"/>
    <language>en</language>
    <item>
      <title>May 27 - Video Understanding Workshop</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Fri, 22 May 2026 16:12:02 +0000</pubDate>
      <link>https://dev.to/voxel51/may-27-video-understanding-workshop-3f0e</link>
      <guid>https://dev.to/voxel51/may-27-video-understanding-workshop-3f0e</guid>
      <description>&lt;p&gt;Join us for a hands-on virtual session on May 27 exploring video-native multimodal AI and how to integrate cutting-edge video understanding models into your computer vision workflows.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/getting-started-perceptron-ai-fiftyone-video-understanding-may-27-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Akshat Shrivastava from &lt;a href="https://www.perceptron.inc/" rel="noopener noreferrer"&gt;Perceptron&lt;/a&gt; will introduce their latest video-native multimodal model that matches frontier models at a fraction of inference cost, followed by Harpreet Sahota demonstrating how to get started with Perceptron AI inside &lt;a href="https://docs.voxel51.com/" rel="noopener noreferrer"&gt;FiftyOne&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fro65gkbfs4hhzgjmw5nu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fro65gkbfs4hhzgjmw5nu.png" alt=" " width="800" height="301"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Work through annotation QA, large scale dataset curation, and model evaluation workflows with the Voxel51 team — customized to your use case, your tech stack, and your data. These hands-on workshops are delivered by FiftyOne experts, available through virtual and in-person formats.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/workshops" rel="noopener noreferrer"&gt;&lt;strong&gt;Book a workshop!&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;See you online!&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>computervision</category>
      <category>agents</category>
    </item>
    <item>
      <title>June 25 - AI, ML and Computer Vision Meetup</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Wed, 20 May 2026 16:47:09 +0000</pubDate>
      <link>https://dev.to/voxel51/june-25-ai-ml-and-computer-vision-meetup-4l2k</link>
      <guid>https://dev.to/voxel51/june-25-ai-ml-and-computer-vision-meetup-4l2k</guid>
      <description>&lt;p&gt;&lt;strong&gt;Date, Time and Location&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Jun 25, 2026&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;9AM Pacific&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Online. &lt;a href="https://voxel51.com/events/ai-ml-and-computer-vision-meetup-june-25-2026" rel="noopener noreferrer"&gt;Register for the Zoom!&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Talks will include:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Large-Scale Scene Reconstruction via Local View Transformers&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Transformer-based models have advanced 3D scene reconstruction, but their quadratic attention limits scalability to large scenes. We introduce the Local View Transformer (LVT), which replaces global attention with locality-aware attention over neighboring views, conditioned on relative camera geometry. LVT decodes directly into 3D Gaussian splats with view-dependent color and opacity for high-fidelity rendering. Our approach enables scalable, single-pass reconstruction of large, high-resolution scenes.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;About the Speaker&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.linkedin.com/in/tooba-imtiaz/" rel="noopener noreferrer"&gt;Tooba Imtiaz&lt;/a&gt; is a PhD candidate in Electrical and Computer Engineering at Northeastern University, working in the Machine Learning Lab.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lessons learned from running AI workloads in production&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;He’ll share his “tales from the engine room” - practical insights from operating AI systems at scale, including the challenges of abstraction layers, the realities of data movement and hardware constraints, and how systems thinking is essential for building high-performance, secure, and responsible AI infrastructure.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;About the Speaker&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.linkedin.com/in/-davehughes-/" rel="noopener noreferrer"&gt;Dave Hughes&lt;/a&gt; is CTO at Stelia. He was formerly CTO at Genesis Cloud, which pioneered what is now commonly known as 'neoclouds', and Principal Engineer/Interim Director of Engineering at Adjust GmbH where he built large-scale data warehousing and processing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Enhancing Low-Field MRI with Deep Super-Resolution for Improved Nipah Virus Neuroimaging&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Advances in deep learning make very-low-field (VLF) MRI systems a viable alternative for in vivo neuroimaging. Zero-shot super-resolution, self-supervised learning, and generative AI were explored to improve the quality of low-field MRI images. We present a framework for the first deployment of a VLF scanner for imaging Nipah virus-inoculated nonhuman primates (NHPs) using a 0.05 T MRI system.&lt;/p&gt;

&lt;p&gt;First, a retrospective simulation study assessed the feasibility of imaging NiV infection at low field, followed by a prospective deployment (0.05 T) that enabled longitudinal imaging. The VLF-NiV imaging was characterized by low image quality and included multiple contrasts. A deep learning-based unpaired domain adaptation (CycleGAN) conditioned on acquisition parameters was used to harmonize contrast, and a simulation-based ResUNet model was used to reduce unwanted noise and preserve T2-weighted structural fidelity. We also highlight studies involving zero-shot super-resolution and denoising experiments that are advantageous for accessible neuroimaging.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;About the Speaker&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.linkedin.com/in/ajaysharma1996/" rel="noopener noreferrer"&gt;Ajay Sharma&lt;/a&gt; is a deep learning engineer with a broad background in biomedical image analysis. His research focuses on developing advanced deep learning methods for computer-aided disease detection and diagnosis.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;And Now for Something Completely Different with FiftyOne&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Often the best way to understand what a tool is truly capable of, is to use in ways it was never intended to be used. This session pushes FiftyOne past its computer vision roots through a series of demos showing how to push the boundaries with FiftyOne. A few practical, some playful, all built with open source code. You'll see how FiftyOne's core building blocks generalize far beyond labeled datasets, and leave with patterns and ideas you can take in your own direction.&lt;/p&gt;

&lt;p&gt;About the Speaker&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.linkedin.com/in/burhan-qa/" rel="noopener noreferrer"&gt;Burhan Qaddoumi&lt;/a&gt; is a ML DevRel Engineer at Voxel51 and perpetual "new guy" as a life long learner. Active in communities all across the web, eager to help, learn, and share with others that demonstrate initiative, interest, and drive.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>computervision</category>
    </item>
    <item>
      <title>May 1 - Best of WACV 2026</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Wed, 29 Apr 2026 16:07:22 +0000</pubDate>
      <link>https://dev.to/voxel51/may-1-best-of-wacv-2026-5e53</link>
      <guid>https://dev.to/voxel51/may-1-best-of-wacv-2026-5e53</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgdck6qpephq5o07od675.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgdck6qpephq5o07od675.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on May 1 for day two of the Best of WACV 2026 series of virtual events.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="https://voxel51.com/events/best-of-wacv-2026-may-1-2026" rel="noopener noreferrer"&gt;Register for the Zoom&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Talks will include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Beyond Pixels: Type-Aware Contrastive Learning for Global Urban Similarity&lt;/strong&gt; - Idan Kligvasser at Google Research&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Perceptually Guided 3DGS Streaming and Rendering for Mixed Reality&lt;/strong&gt; - Sai Harsha Mupparaju at New York University&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SAVIOR: Sample-efficient Adaptation of Vision-Language Models for OCR Representation&lt;/strong&gt; - Akshata Bhat at Hyperbots Inc.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SynthForm: Towards a DLA-free E2E Form understanding model&lt;/strong&gt; - Andre Fu at Ecliptor&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>computervision</category>
      <category>machinelearning</category>
      <category>agents</category>
    </item>
    <item>
      <title>April 30 - Best of WACV 2026 (Day 1)</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Tue, 28 Apr 2026 20:30:25 +0000</pubDate>
      <link>https://dev.to/voxel51/april-30-best-of-wacv-2026-day-1-l</link>
      <guid>https://dev.to/voxel51/april-30-best-of-wacv-2026-day-1-l</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3805ugxi9lsye43mj4vl.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3805ugxi9lsye43mj4vl.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on April 30 for day one of the Best of WACV 2026 series of virtual events.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="https://voxel51.com/events/best-of-wacv-2026-april-30-2026" rel="noopener noreferrer"&gt;Register for the Zoom!&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Talks will include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Zero-Shot Coreset Selection via Iterative Subspace Sampling&lt;/strong&gt; - Brent Griffin at Voxel51&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;ENCORE: A Neural Collapse Perspective on Out-of-Distribution Detection in Deep Neural Networks&lt;/strong&gt; - A Q M Sazzad Sayyed at Northeastern University&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Synthesizing Compositional Videos from Text Description - Shanmuganathan Raman&lt;/strong&gt; at IIT Gandhinagar&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The Perceptual Observatory Characterizing Robustness and Grounding in MLLMs&lt;/strong&gt; - Fenil Bardoliya at Arizona State University&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>computervision</category>
      <category>datascience</category>
    </item>
    <item>
      <title>Thursday: April 9 - Visual AI Agents Workshop</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Tue, 07 Apr 2026 18:23:18 +0000</pubDate>
      <link>https://dev.to/voxel51/thursday-april-9-visual-ai-agents-workshop-5aff</link>
      <guid>https://dev.to/voxel51/thursday-april-9-visual-ai-agents-workshop-5aff</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuwvdui5f2l1qn3u7tzd1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuwvdui5f2l1qn3u7tzd1.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on April 9 at 9 AM Pacific for the &lt;strong&gt;Visual Agents: What it Takes to Build an Agent that can Navigate GUIs like Humans&lt;/strong&gt; virtual workshop.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/visual-agents-what-it-takes-to-build-an-agent-that-can-navigate-guis-like-humans-april-9-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This hands-on workshop provides a comprehensive introduction to building and evaluating visual agents for GUI automation using modern tools and techniques. Participants will learn how to leverage FiftyOne, an open-source toolkit for dataset curation and computer vision workflows, to build production-ready GUI agent systems.&lt;/p&gt;

&lt;p&gt;What You'll Learn:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Dataset Creation &amp;amp; Management:&lt;/strong&gt; How to structure, annotate, and load GUI interaction datasets using the COCO4GUI standardized format&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data Exploration &amp;amp; Analysis:&lt;/strong&gt; Using FiftyOne's interactive interface to visualize datasets, analyze action distributions, and understand annotation patterns&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Multimodal Embeddings:&lt;/strong&gt; Computing embeddings for screenshots and UI element patches to enable similarity search and retrieval&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model Inference:&lt;/strong&gt; Running state-of-the-art models like Microsoft's GUI-Actor to predict interaction points from natural language instructions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Performance Evaluation:&lt;/strong&gt; Measuring model accuracy using standard metrics and normalized click distance to assess localization precision&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Failure Analysis:&lt;/strong&gt; Investigating model failures through attention maps, error pattern analysis, and systematic debugging workflows&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data-Driven Improvement:&lt;/strong&gt; Tagging samples based on error types (attention misalignment vs. localization errors) to prioritize fine-tuning efforts&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Synthetic Data Generation:&lt;/strong&gt; Using FiftyOne plugins to augment training data with synthetic task descriptions and variations&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>computervision</category>
      <category>datascience</category>
    </item>
    <item>
      <title>Wednesday : April 8 - Computer Vision Workflows Workshop</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Mon, 06 Apr 2026 17:48:20 +0000</pubDate>
      <link>https://dev.to/voxel51/wednesday-april-8-computer-vision-workflows-workshop-39n6</link>
      <guid>https://dev.to/voxel51/wednesday-april-8-computer-vision-workflows-workshop-39n6</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3sz5mpnuxnwv18k8vfj5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3sz5mpnuxnwv18k8vfj5.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on April 8 at 10 AM Pacific for a free, 60-minute, virtual hands-on workshop to learn how to curate, visualize, and evaluate models using the open source &lt;a href="https://docs.voxel51.com/" rel="noopener noreferrer"&gt;FiftyOne app&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/getting-started-with-fiftyone-april-8-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;What you'll learn:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Structure unstructured data.&lt;/strong&gt; Map data and metadata into a queryable schema for images, videos, and point clouds.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Query datasets with the FiftyOne SDK&lt;/strong&gt;. Create complex views based on model predictions, labels, and custom tags. Use the FiftyOne to filter data based on logical conditions and confidence scores.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Visualize high dimensional embeddings.&lt;/strong&gt; Project features into lower dimensions to find clusters of similar samples. Identify data gaps and outliers using FiftyOne Brain.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Automate data curation.&lt;/strong&gt; Implement algorithmic measures to select diverse subsets for training. Reduce labeling costs by prioritizing high entropy samples.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Debug model performance.&lt;/strong&gt; Run evaluation routines to generate confusion matrices and precision recall curves. Visualize false positives and false negatives directly in the App to understand model failures.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Customize FiftyOne.&lt;/strong&gt; Build custom dashboards and interactive panels. Create specialized views for domain specific tasks.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>agents</category>
      <category>mcp</category>
      <category>computervision</category>
    </item>
    <item>
      <title>April 9 - Visual AI Agents Workshop</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Fri, 03 Apr 2026 21:23:06 +0000</pubDate>
      <link>https://dev.to/voxel51/april-9-visual-ai-agents-workshop-482m</link>
      <guid>https://dev.to/voxel51/april-9-visual-ai-agents-workshop-482m</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuwvdui5f2l1qn3u7tzd1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuwvdui5f2l1qn3u7tzd1.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on April 9 at 9 AM Pacific for the &lt;strong&gt;Visual Agents: What it Takes to Build an Agent that can Navigate GUIs like Humans&lt;/strong&gt; virtual workshop.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/visual-agents-what-it-takes-to-build-an-agent-that-can-navigate-guis-like-humans-april-9-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This hands-on workshop provides a comprehensive introduction to building and evaluating visual agents for GUI automation using modern tools and techniques. Participants will learn how to leverage FiftyOne, an open-source toolkit for dataset curation and computer vision workflows, to build production-ready GUI agent systems.&lt;/p&gt;

&lt;p&gt;What You'll Learn:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Dataset Creation &amp;amp; Management:&lt;/strong&gt; How to structure, annotate, and load GUI interaction datasets using the COCO4GUI standardized format&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data Exploration &amp;amp; Analysis:&lt;/strong&gt; Using FiftyOne's interactive interface to visualize datasets, analyze action distributions, and understand annotation patterns&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Multimodal Embeddings:&lt;/strong&gt; Computing embeddings for screenshots and UI element patches to enable similarity search and retrieval&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model Inference:&lt;/strong&gt; Running state-of-the-art models like Microsoft's GUI-Actor to predict interaction points from natural language instructions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Performance Evaluation:&lt;/strong&gt; Measuring model accuracy using standard metrics and normalized click distance to assess localization precision&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Failure Analysis:&lt;/strong&gt; Investigating model failures through attention maps, error pattern analysis, and systematic debugging workflows&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data-Driven Improvement:&lt;/strong&gt; Tagging samples based on error types (attention misalignment vs. localization errors) to prioritize fine-tuning efforts&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Synthetic Data Generation:&lt;/strong&gt; Using FiftyOne plugins to augment training data with synthetic task descriptions and variations&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>computervision</category>
      <category>datascience</category>
    </item>
    <item>
      <title>April 8 - Getting Started with Computer Vision Workflows Workshop</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Thu, 02 Apr 2026 15:32:50 +0000</pubDate>
      <link>https://dev.to/voxel51/april-8-getting-started-with-computer-vision-workflows-workshop-1mf8</link>
      <guid>https://dev.to/voxel51/april-8-getting-started-with-computer-vision-workflows-workshop-1mf8</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3sz5mpnuxnwv18k8vfj5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3sz5mpnuxnwv18k8vfj5.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on April 8 at 10 AM Pacific for a free, 60-minute, virtual hands-on workshop to learn how to curate, visualize, and evaluate models using the open source &lt;a href="https://docs.voxel51.com/" rel="noopener noreferrer"&gt;FiftyOne app&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/getting-started-with-fiftyone-april-8-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;What you'll learn:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Structure unstructured data.&lt;/strong&gt; Map data and metadata into a queryable schema for images, videos, and point clouds.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Query datasets with the FiftyOne SDK&lt;/strong&gt;. Create complex views based on model predictions, labels, and custom tags. Use the FiftyOne to filter data based on logical conditions and confidence scores.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Visualize high dimensional embeddings.&lt;/strong&gt; Project features into lower dimensions to find clusters of similar samples. Identify data gaps and outliers using FiftyOne Brain.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Automate data curation.&lt;/strong&gt; Implement algorithmic measures to select diverse subsets for training. Reduce labeling costs by prioritizing high entropy samples.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Debug model performance.&lt;/strong&gt; Run evaluation routines to generate confusion matrices and precision recall curves. Visualize false positives and false negatives directly in the App to understand model failures.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Customize FiftyOne.&lt;/strong&gt; Build custom dashboards and interactive panels. Create specialized views for domain specific tasks.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>agents</category>
      <category>machinelearning</category>
      <category>computervision</category>
    </item>
    <item>
      <title>Thursday: April 2 - AI, ML and Computer Vision Meetup</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Wed, 01 Apr 2026 18:06:06 +0000</pubDate>
      <link>https://dev.to/voxel51/thursday-april-2-ai-ml-and-computer-vision-meetup-3p06</link>
      <guid>https://dev.to/voxel51/thursday-april-2-ai-ml-and-computer-vision-meetup-3p06</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F13z7vsfxskklxi6zis2w.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F13z7vsfxskklxi6zis2w.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on April 2 at 9 AM Pacific for the monthly AI, ML and Computer Vision Meetup!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/ai-ml-and-computer-vision-meetup-april-2-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Talks will include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Visual AI at the Edge: Beyond the Model&lt;/strong&gt; - David Moser at Orella Vision&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Sanitizing Evaluation Datasets: From Detection to Correction&lt;/strong&gt; - Nick Lotz at Voxel51&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Async Agents in Production: Failure Modes and Fixes&lt;/strong&gt; - Meryem Arik at Doubleword&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Building enterprise agentic systems that scale&lt;/strong&gt; - Aman Sardana at Cisco&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>computervision</category>
      <category>machinelearning</category>
      <category>opensource</category>
    </item>
    <item>
      <title>April 2 - AI, ML and Computer Vision Meetup</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Fri, 27 Mar 2026 17:05:14 +0000</pubDate>
      <link>https://dev.to/voxel51/april-2-ai-ml-and-computer-vision-meetup-33na</link>
      <guid>https://dev.to/voxel51/april-2-ai-ml-and-computer-vision-meetup-33na</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8kqot8l2akh1zqcy1kuu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8kqot8l2akh1zqcy1kuu.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on April 2 at 9 AM Pacific for the monthly AI, ML and Computer Vision Meetup!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/ai-ml-and-computer-vision-meetup-april-2-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Talks will include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Visual AI at the Edge: Beyond the Model&lt;/strong&gt; - David Moser at Orella Vision&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Sanitizing Evaluation Datasets: From Detection to Correction&lt;/strong&gt; - Nick Lotz at Voxel51&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Async Agents in Production: Failure Modes and Fixes&lt;/strong&gt; - Meryem Arik at Doubleword&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Building enterprise agentic systems that scale&lt;/strong&gt; - Aman Sardana at Cisco&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>computervision</category>
      <category>machinelearning</category>
      <category>opensource</category>
    </item>
    <item>
      <title>March 26 - Advances in AI at Northeastern Virtual Meetup</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Tue, 24 Mar 2026 18:46:10 +0000</pubDate>
      <link>https://dev.to/voxel51/march-26-advances-in-ai-at-northeastern-virtual-meetup-32gb</link>
      <guid>https://dev.to/voxel51/march-26-advances-in-ai-at-northeastern-virtual-meetup-32gb</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fay0caeg842pbark5g50n.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fay0caeg842pbark5g50n.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on March 26 at 9 AM Pacific for the Advances in AI at Northeastern University virtual Meetup! &lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/advances-in-ai-at-northeastern-university-march-26-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This special event will feature some of the latest research happening at NEU.&lt;/p&gt;

&lt;p&gt;Talks will include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Physical AI Research (PAIR) Center: Foundational Pairing of Digital Intelligence &amp;amp; Physical World Deployments&lt;/strong&gt; - Edmund Yeh at Northeastern University&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Grounding Visual AI Models in Real-World Physics&lt;/strong&gt; - Sarah Ostadabbas and Xiangyu Bai at Northeastern University&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;WorldFormer: Diffusion Transformer World Models with Mixture-of-Experts for Embodied Physical Intelligence&lt;/strong&gt; - Yanzhi Wang at Northeastern University&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scalable and Efficient Deep Learning: From Understanding to Generation&lt;/strong&gt; - Yitian Zhang at Northeastern University&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>computervision</category>
      <category>machinelearning</category>
      <category>agents</category>
    </item>
    <item>
      <title>This Thursday: March 19 - Women in AI Meetup</title>
      <dc:creator>Jimmy Guerrero</dc:creator>
      <pubDate>Tue, 17 Mar 2026 19:20:00 +0000</pubDate>
      <link>https://dev.to/voxel51/this-thursday-march-19-women-in-ai-meetup-217e</link>
      <guid>https://dev.to/voxel51/this-thursday-march-19-women-in-ai-meetup-217e</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Felwagq9q9ttdane2zuea.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Felwagq9q9ttdane2zuea.png" alt=" " width="800" height="420"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Join us on March 19 at 9 AM Pacific for the Women in AI virtual event.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://voxel51.com/events/women-in-ai-meetup-march-19-2026" rel="noopener noreferrer"&gt;&lt;strong&gt;Register for the Zoom&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Talks will include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Towards Reliable Clinical AI: Evaluating Factuality, Robustness, and Real-World Performance of Large Language Models&lt;/strong&gt; - Monica Munnangi at Northeastern University&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Language Diffusion Models&lt;/strong&gt; - Jayita Bhattacharyya at Big4 Consulting&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Neural BRDFs: Learning Compact Representations for Material Appearance&lt;/strong&gt; - Manushree Gangwar at Voxel51&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Supercharging AI agents with evaluations&lt;/strong&gt; - Priya Venkat at Intuit&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>computervision</category>
      <category>machinelearning</category>
      <category>datascience</category>
    </item>
  </channel>
</rss>
