<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Галина Садовская</title>
    <description>The latest articles on DEV Community by Галина Садовская (@__678da4d0).</description>
    <link>https://dev.to/__678da4d0</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3678686%2F9fddf1ce-670b-4af5-98d5-65623ca8382c.png</url>
      <title>DEV Community: Галина Садовская</title>
      <link>https://dev.to/__678da4d0</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/__678da4d0"/>
    <language>en</language>
    <item>
      <title>AI Look-Alike Search for OF Creators — Need Advice on Better Face Models</title>
      <dc:creator>Галина Садовская</dc:creator>
      <pubDate>Thu, 25 Dec 2025 19:31:23 +0000</pubDate>
      <link>https://dev.to/__678da4d0/ai-look-alike-search-for-of-creators-need-advice-on-better-face-models-58dc</link>
      <guid>https://dev.to/__678da4d0/ai-look-alike-search-for-of-creators-need-advice-on-better-face-models-58dc</guid>
      <description>&lt;h1&gt;
  
  
  AI Look-Alike Search for OF Creators — Need Advice on Better Face Models
&lt;/h1&gt;

&lt;p&gt;I’m currently building an &lt;strong&gt;AI-based face similarity (look-alike) search for OF models&lt;/strong&gt; as part of a real-world side project.&lt;/p&gt;

&lt;p&gt;The dataset contains &lt;strong&gt;100,000+ public OF model images&lt;/strong&gt;, and the goal is to help users discover &lt;strong&gt;visually similar OF models&lt;/strong&gt; based on facial features rather than usernames or text-based search.&lt;/p&gt;

&lt;p&gt;This is &lt;strong&gt;not identity verification&lt;/strong&gt; — the focus is purely on &lt;strong&gt;visual similarity&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I’m Building (Quick Overview)
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Users upload an image (reference photo / celebrity image)&lt;/li&gt;
&lt;li&gt;The system finds &lt;strong&gt;OF models with similar facial characteristics&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Results are ranked using &lt;strong&gt;face embeddings + vector similarity search&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Everything currently runs on &lt;strong&gt;CPU&lt;/strong&gt;, but I’m considering a move to &lt;strong&gt;GPU&lt;/strong&gt; for scale and experimentation&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What I’m Building (More Detail)
&lt;/h2&gt;

&lt;p&gt;The system allows users to upload an image and receive a list of &lt;strong&gt;OF models with similar facial characteristics&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The intent is to support &lt;strong&gt;visual discovery&lt;/strong&gt;, where perceived similarity matters more than exact identity matching.&lt;/p&gt;




&lt;h2&gt;
  
  
  Key Constraints
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Similarity over identity&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
The system ranks faces by perceived similarity (look-alike matching), not by strict identity verification.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Low tolerance for false positives&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Returning visually different faces as “similar” is more harmful than missing a potential match.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Real-world images&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
The dataset consists of non-studio images with varying lighting, poses, resolutions, and overall quality.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Scalability&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
The solution needs to scale beyond &lt;strong&gt;100k+ images&lt;/strong&gt; without significant drops in accuracy or performance.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Current Pipeline (CPU-Based)
&lt;/h2&gt;

&lt;p&gt;At the moment, the entire pipeline runs on &lt;strong&gt;CPU only&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The setup looks like this:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Face detection and alignment&lt;/li&gt;
&lt;li&gt;Feature extraction using a pre-trained face model&lt;/li&gt;
&lt;li&gt;Storing embeddings in a vector index&lt;/li&gt;
&lt;li&gt;Nearest-neighbor search using cosine similarity&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;At this scale, the system works reasonably well, but both &lt;strong&gt;accuracy and performance&lt;/strong&gt; are starting to become limiting factors.&lt;/p&gt;




&lt;h2&gt;
  
  
  Current Model Setup (InsightFace)
&lt;/h2&gt;

&lt;p&gt;Face embeddings are currently generated using &lt;strong&gt;InsightFace&lt;/strong&gt;, specifically the &lt;strong&gt;&lt;code&gt;buffalo_l&lt;/code&gt;&lt;/strong&gt; model bundle.&lt;/p&gt;

&lt;p&gt;The pipeline includes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Face detection and alignment via InsightFace&lt;/li&gt;
&lt;li&gt;Feature extraction using the &lt;code&gt;buffalo_l&lt;/code&gt; model&lt;/li&gt;
&lt;li&gt;Embeddings stored for similarity search&lt;/li&gt;
&lt;li&gt;Cosine similarity for ranking similar faces&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This provides a solid baseline, but for &lt;strong&gt;look-alike matching&lt;/strong&gt;, small inaccuracies are very noticeable.&lt;/p&gt;




&lt;h2&gt;
  
  
  Where the System Struggles
&lt;/h2&gt;

&lt;p&gt;As the dataset grows, several issues become more apparent:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Visually similar faces sometimes rank lower than expected&lt;/li&gt;
&lt;li&gt;Different individuals with shared facial traits can appear as false positives&lt;/li&gt;
&lt;li&gt;Lighting, pose, and image quality introduce noise&lt;/li&gt;
&lt;li&gt;CPU inference becomes a bottleneck during re-indexing and experimentation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Because this is a &lt;strong&gt;look-alike use case&lt;/strong&gt;, even small errors can significantly affect perceived quality.&lt;/p&gt;




&lt;h2&gt;
  
  
  CPU vs GPU — Is the Move Worth It?
&lt;/h2&gt;

&lt;p&gt;I’m planning to migrate the pipeline to &lt;strong&gt;GPU-based inference&lt;/strong&gt;, but I want to make sure the model choice justifies the move.&lt;/p&gt;

&lt;p&gt;Some of the questions I’m evaluating:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which face models provide the best results for &lt;strong&gt;visual similarity&lt;/strong&gt;, not identity recognition?&lt;/li&gt;
&lt;li&gt;Does GPU inference unlock &lt;strong&gt;meaningfully better accuracy&lt;/strong&gt;, or is it mainly a speed improvement?&lt;/li&gt;
&lt;li&gt;Are there models that are simply not practical to run on CPU at this scale?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If I’m going to reprocess &lt;strong&gt;100k+ OF model images&lt;/strong&gt;, I want to do it with the right model.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I’m Looking for in a Better Face Model
&lt;/h2&gt;

&lt;p&gt;I’m particularly interested in models that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Produce &lt;strong&gt;high-quality embeddings&lt;/strong&gt; for similarity search&lt;/li&gt;
&lt;li&gt;Perform well on non-ideal, real-world images&lt;/li&gt;
&lt;li&gt;Scale efficiently beyond 100k images&lt;/li&gt;
&lt;li&gt;Benefit from GPU acceleration&lt;/li&gt;
&lt;li&gt;Can be fine-tuned (or perform well out of the box) for look-alike matching&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I’m open to both &lt;strong&gt;open-source&lt;/strong&gt; and &lt;strong&gt;commercial&lt;/strong&gt; solutions.&lt;/p&gt;




&lt;h2&gt;
  
  
  Real-World Context
&lt;/h2&gt;

&lt;p&gt;This work is part of a discovery platform where users can upload an image and find visually similar &lt;strong&gt;OF models&lt;/strong&gt; using AI-based face similarity.&lt;/p&gt;

&lt;p&gt;The project is called &lt;strong&gt;Explore.Fans&lt;/strong&gt;, and face similarity search is one of its core components.&lt;/p&gt;

&lt;p&gt;👉 &lt;a href="https://explore.fans" rel="noopener noreferrer"&gt;https://explore.fans&lt;/a&gt;&lt;br&gt;&lt;br&gt;
(Shared only for technical context.)&lt;/p&gt;




&lt;h2&gt;
  
  
  Questions for the Community
&lt;/h2&gt;

&lt;p&gt;If you’ve worked with &lt;strong&gt;face similarity or face recognition models&lt;/strong&gt; at scale, I’d really appreciate your input:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which models gave you the best results for look-alike similarity?&lt;/li&gt;
&lt;li&gt;Did GPU inference improve accuracy, or mostly performance?&lt;/li&gt;
&lt;li&gt;Any experience fine-tuning models for similarity-based ranking?&lt;/li&gt;
&lt;li&gt;Anything you’d avoid based on real-world experience?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Thanks in advance — happy to share more details if helpful.&lt;br&gt;
Have a wonderfull holiday!&lt;/p&gt;




&lt;h2&gt;
  
  
  References
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;InsightFace (DeepInsight): &lt;a href="https://github.com/deepinsight/insightface" rel="noopener noreferrer"&gt;https://github.com/deepinsight/insightface&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;




</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>help</category>
    </item>
  </channel>
</rss>
