<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Frank</title>
    <description>The latest articles on DEV Community by Frank (@biaus_).</description>
    <link>https://dev.to/biaus_</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F406551%2Fee6a4e37-aebb-4d33-8c9a-276f8d51ceda.png</url>
      <title>DEV Community: Frank</title>
      <link>https://dev.to/biaus_</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/biaus_"/>
    <language>en</language>
    <item>
      <title>Is it possible to build a marketplace app in Python?</title>
      <dc:creator>Frank</dc:creator>
      <pubDate>Mon, 20 Jul 2020 16:03:26 +0000</pubDate>
      <link>https://dev.to/biaus_/is-it-possible-to-build-a-marketplace-app-in-python-2cd6</link>
      <guid>https://dev.to/biaus_/is-it-possible-to-build-a-marketplace-app-in-python-2cd6</guid>
      <description>&lt;p&gt;What other languages and tools would you use?&lt;/p&gt;

&lt;p&gt;I believe js would be easier, also you would need to know CSS, HTML, etc. &lt;/p&gt;

</description>
      <category>help</category>
      <category>python</category>
      <category>discuss</category>
      <category>javascript</category>
    </item>
    <item>
      <title>What was the Python library that blew your mind? </title>
      <dc:creator>Frank</dc:creator>
      <pubDate>Sat, 18 Jul 2020 05:02:18 +0000</pubDate>
      <link>https://dev.to/biaus_/what-was-the-python-library-that-blew-your-mind-4om9</link>
      <guid>https://dev.to/biaus_/what-was-the-python-library-that-blew-your-mind-4om9</guid>
      <description>&lt;p&gt;I mean That library that you would run to show it to a friend (if you had any) &lt;/p&gt;

</description>
      <category>python</category>
    </item>
    <item>
      <title>Need help python text matching</title>
      <dc:creator>Frank</dc:creator>
      <pubDate>Thu, 09 Jul 2020 23:25:05 +0000</pubDate>
      <link>https://dev.to/biaus_/need-help-python-text-matching-7l0</link>
      <guid>https://dev.to/biaus_/need-help-python-text-matching-7l0</guid>
      <description>&lt;p&gt;Hi DEV community!&lt;/p&gt;

&lt;p&gt;I have a project idea that I would like to dive into but I need your help defining some major points.&lt;/p&gt;

&lt;p&gt;The project is about matching strings to the most similar string in a different dataset(tables) in Python.&lt;/p&gt;

&lt;p&gt;For example:&lt;br&gt;
Table 1&lt;br&gt;
&lt;strong&gt;name&lt;/strong&gt;/&lt;strong&gt;city, country&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Mark Smith&lt;/em&gt;/&lt;em&gt;New York City, USA&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Mirko Smirk&lt;/em&gt;/&lt;em&gt;New York City, USA&lt;/em&gt;&lt;br&gt;
&lt;em&gt;John E. Doe&lt;/em&gt;/&lt;em&gt;Paris, USA&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Jane Doe&lt;/em&gt;/&lt;em&gt;Paris, France&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Table 2&lt;br&gt;
&lt;strong&gt;name&lt;/strong&gt;/&lt;strong&gt;city, country&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Mirko S.&lt;/em&gt;/&lt;em&gt;NYC, US&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Mark S.&lt;/em&gt;/&lt;em&gt;NYC, US&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Jane D&lt;/em&gt;/&lt;em&gt;Paris, France&lt;/em&gt;&lt;br&gt;
&lt;em&gt;J. Doe&lt;/em&gt;/&lt;em&gt;Paris, US&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The idea is to match each person with itself on the other table.&lt;br&gt;
For example row 1 of the first table would correctly match with row 2 of the second table.&lt;/p&gt;

&lt;p&gt;As you can see there are no perfect matches and sometimes it may match the first column but not the second. The second column should have more weight in the decision.&lt;/p&gt;

&lt;p&gt;I first thought of an approach based on &lt;strong&gt;Levenshtein Distance&lt;/strong&gt; to calculate the differences between sequences using FuzzyWuzzy package. But I encountered many issues like the following:&lt;br&gt;
&lt;strong&gt;Table 1&lt;/strong&gt;: &lt;br&gt;
&lt;em&gt;Franklin Delano Roosevelt&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Franklin Da Turtle&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Table 2&lt;/strong&gt;: &lt;br&gt;
&lt;em&gt;FDR&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Levenshtein Distance would say that &lt;em&gt;Franklin Da Turtle&lt;/em&gt; is a better match than &lt;em&gt;Franklin Delano Roosevelt&lt;/em&gt; for &lt;em&gt;FDR&lt;/em&gt; since it has fewer characters to compare.&lt;/p&gt;

&lt;p&gt;In real-life data, I will have many columns to validate the accuracy of the decision but I'm stuck anyway.&lt;/p&gt;

&lt;p&gt;Is there a better approach? I first thought of some vector usage of NLP but isn't my area of expertise.&lt;/p&gt;

&lt;p&gt;Any ideas will be well received. &lt;/p&gt;

&lt;p&gt;Leave your comment if you know something better than FuzzyWuzzy approach.&lt;/p&gt;

</description>
      <category>help</category>
      <category>python</category>
    </item>
  </channel>
</rss>
