<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Austin Murray</title>
    <description>The latest articles on DEV Community by Austin Murray (@austinjm121).</description>
    <link>https://dev.to/austinjm121</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3922836%2F78dea9a8-1092-4364-b718-0535534ee648.jpeg</url>
      <title>DEV Community: Austin Murray</title>
      <link>https://dev.to/austinjm121</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/austinjm121"/>
    <language>en</language>
    <item>
      <title>How I Built and Shipped a Production-Ready AI Recommendation System (Nomova.ai)</title>
      <dc:creator>Austin Murray</dc:creator>
      <pubDate>Sun, 10 May 2026 05:10:44 +0000</pubDate>
      <link>https://dev.to/austinjm121/how-i-built-and-shipped-a-production-ready-ai-recommendation-system-nomovaai-3pcp</link>
      <guid>https://dev.to/austinjm121/how-i-built-and-shipped-a-production-ready-ai-recommendation-system-nomovaai-3pcp</guid>
      <description>&lt;p&gt;Overview&lt;/p&gt;

&lt;p&gt;Nomova.ai is an AI/ML-powered vacation planning platform designed to generate personalized travel recommendations using predictive modeling and cloud infrastructure.&lt;/p&gt;

&lt;p&gt;The goal of the project was to explore how modern AI systems can be built end-to-end—from model development to production deployment—while keeping the system scalable, maintainable, and practical in a real-world environment.&lt;/p&gt;

&lt;p&gt;Problem&lt;/p&gt;

&lt;p&gt;Travel planning is typically fragmented across multiple platforms. Users jump between search engines, booking sites, and review platforms, then manually combine information into a decision.&lt;/p&gt;

&lt;p&gt;This creates friction in three key ways:&lt;/p&gt;

&lt;p&gt;High cognitive load&lt;br&gt;
Inconsistent recommendations across sources&lt;br&gt;
Lack of personalization&lt;/p&gt;

&lt;p&gt;Nomova.ai was built to address this by consolidating the experience into a single, AI-driven recommendation system.&lt;/p&gt;

&lt;p&gt;System Design&lt;/p&gt;

&lt;p&gt;The system was designed as a cloud-native SaaS architecture with a clear separation between data processing, machine learning, and serving infrastructure.&lt;/p&gt;

&lt;p&gt;Machine Learning Layer&lt;/p&gt;

&lt;p&gt;The core predictive functionality was built using PyTorch, with models deployed through Vertex AI.&lt;/p&gt;

&lt;p&gt;This layer handles:&lt;/p&gt;

&lt;p&gt;Learning user preference patterns&lt;br&gt;
Ranking travel destinations and recommendations&lt;br&gt;
Supporting iterative model experimentation and tuning&lt;/p&gt;

&lt;p&gt;The model design focuses on combining user signals with contextual inputs to produce ranked outputs.&lt;/p&gt;

&lt;p&gt;Feature Processing Layer&lt;/p&gt;

&lt;p&gt;Raw user interaction data is transformed into structured signals before being passed into the model.&lt;/p&gt;

&lt;p&gt;Key feature groups include:&lt;/p&gt;

&lt;p&gt;Behavioral interaction history&lt;br&gt;
Destination affinity signals&lt;br&gt;
Contextual constraints such as budget, duration, and preferences&lt;/p&gt;

&lt;p&gt;This separation ensures the model remains decoupled from raw input complexity and improves maintainability.&lt;/p&gt;

&lt;p&gt;Cloud Deployment&lt;/p&gt;

&lt;p&gt;The system is deployed using Vertex AI for model serving and infrastructure management.&lt;/p&gt;

&lt;p&gt;This setup enables:&lt;/p&gt;

&lt;p&gt;Scalable inference endpoints&lt;br&gt;
Managed deployment workflows&lt;br&gt;
Simplified model lifecycle management&lt;/p&gt;

&lt;p&gt;This allowed the system to move from experimental development into a production-ready environment without restructuring core logic.&lt;/p&gt;

&lt;p&gt;Engineering Decisions&lt;br&gt;
Separation of Concerns&lt;/p&gt;

&lt;p&gt;The ML layer, feature pipeline, and serving infrastructure were intentionally separated to allow independent development and iteration.&lt;/p&gt;

&lt;p&gt;Cloud-Native Architecture&lt;/p&gt;

&lt;p&gt;Using Vertex AI reduced operational overhead and allowed the system to scale inference workloads without manual infrastructure management.&lt;/p&gt;

&lt;p&gt;Iterative Model Development&lt;/p&gt;

&lt;p&gt;The system was designed to support experimentation, allowing models to be updated and evaluated without disrupting production workflows.&lt;/p&gt;

&lt;p&gt;Challenges&lt;br&gt;
Cold Start Problem&lt;/p&gt;

&lt;p&gt;New users lack historical interaction data, requiring fallback logic to generate meaningful initial recommendations.&lt;/p&gt;

&lt;p&gt;Model Generalization&lt;/p&gt;

&lt;p&gt;Balancing personalization with general travel relevance required careful feature selection and tuning.&lt;/p&gt;

&lt;p&gt;Production Transition&lt;/p&gt;

&lt;p&gt;Moving from local development to a deployed system required restructuring inference flows and ensuring consistency between environments.&lt;/p&gt;

&lt;p&gt;Outcome&lt;/p&gt;

&lt;p&gt;Nomova.ai was successfully delivered as a production-ready system and handed off following completion of its core machine learning and infrastructure components.&lt;/p&gt;

&lt;p&gt;The project demonstrates experience in:&lt;/p&gt;

&lt;p&gt;Designing scalable ML systems&lt;br&gt;
Building cloud-native architectures&lt;br&gt;
Moving models from development to production&lt;br&gt;
Structuring systems for long-term maintainability&lt;br&gt;
Stack&lt;br&gt;
PyTorch&lt;br&gt;
Google Vertex AI&lt;br&gt;
Google Cloud Platform&lt;br&gt;
Python-based ML pipelines&lt;/p&gt;

</description>
      <category>ai</category>
      <category>architecture</category>
      <category>machinelearning</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
