<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Jess Edmund Fan</title>
    <description>The latest articles on DEV Community by Jess Edmund Fan (@jef1056).</description>
    <link>https://dev.to/jef1056</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F391384%2F11bcb146-515e-405e-87c8-dd0630178561.png</url>
      <title>DEV Community: Jess Edmund Fan</title>
      <link>https://dev.to/jef1056</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/jef1056"/>
    <language>en</language>
    <item>
      <title>Jade AI</title>
      <dc:creator>Jess Edmund Fan</dc:creator>
      <pubDate>Thu, 21 May 2020 01:42:29 +0000</pubDate>
      <link>https://dev.to/jef1056/jade-ai-549i</link>
      <guid>https://dev.to/jef1056/jade-ai-549i</guid>
      <description>&lt;h3&gt;
  
  
  What is Jade AI?
&lt;/h3&gt;

&lt;p&gt;Jade's a Contextual chatbot, based on modern NLP techniques. The primary driver of the most recent version of Jade is based on OpenAI's GPT-2 (see &lt;a href="https://openai.com/blog/gpt-2-1-5b-release"&gt;this post&lt;/a&gt;). GPT-2 uses a tranformer model to effectively and efficiently generate tokenized sequences.&lt;/p&gt;

&lt;h3&gt;
  
  
  Demo &amp;amp; Stats
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://top.gg/bot/410253782828449802"&gt;&lt;br&gt;
  &lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--tsFBW80y--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://top.gg/api/widget/410253782828449802.svg" alt="Jade"&gt;&lt;br&gt;
&lt;/a&gt;&lt;br&gt;
You can add the bot to your server, and run &lt;code&gt;JDT [message]&lt;/code&gt; (currently under a different prefix than normal due to testing) or &lt;code&gt;JD [message]&lt;/code&gt; (for her previous, v5, version based on K-means &amp;amp; beam search)&lt;/p&gt;
&lt;h3&gt;
  
  
  Pics or it didn't happen.
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--mZNY4pzX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/2fl00iyl1238egz8vlyx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--mZNY4pzX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/2fl00iyl1238egz8vlyx.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;
  
  
  Link to Code
&lt;/h3&gt;

&lt;p&gt;Note: the codebase is currently incomplete!!! The trained model and training script will be uploaded soon.&lt;br&gt;
&lt;/p&gt;
&lt;div class="ltag-github-readme-tag"&gt;
  &lt;div class="readme-overview"&gt;
    &lt;h2&gt;
      &lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--vWogaON8--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://practicaldev-herokuapp-com.freetls.fastly.net/assets/github-logo-28d89282e0daa1e2496205e2f218a44c755b0dd6536bbadf5ed5a44a7ca54716.svg" alt="GitHub logo"&gt;
      &lt;a href="https://github.com/JEF1056"&gt;
        JEF1056
      &lt;/a&gt; / &lt;a href="https://github.com/JEF1056/Jv6"&gt;
        Jv6
      &lt;/a&gt;
    &lt;/h2&gt;
    &lt;h3&gt;
      Jade V6 - Based on GPT
    &lt;/h3&gt;
  &lt;/div&gt;
&lt;/div&gt;


&lt;h3&gt;
  
  
  A Bit of History
&lt;/h3&gt;

&lt;p&gt;I started the Jade project in my freshman year in high school, knowing absolutely nothing about NLP (natural language processing) and having done very little research on chatbots or artificial intelligence algorithms. &lt;br&gt;
The very first version was really simple: it was a text file with a ton of questions and answers, and I used python to try and match the input sentence to the sentence with the most similar words in the text file, then returned the next line.&lt;br&gt;
Over the years, I've explored LSTM, Seq2Seq, and K-means (with beam search) algorithms to improve the accuracy of the Jade chatbot.&lt;/p&gt;

&lt;h3&gt;
  
  
  Today
&lt;/h3&gt;

&lt;p&gt;Over the quarrantine period, Jade v6 was completed, acheiving a suprising 88% accuracy on new data (taken from discord). It is able to run on low-end hardware, (1 vCPU, 1.25gb of memory), and is able to maintain context/learning between discord servers without mixing up conversations.&lt;br&gt;
The program runs on Python, uses the discord.py (1.3.3) library for discord interactions, pytorch for machine learning, and GPT-2 as the framework for the neural network.&lt;br&gt;
Trained on my own &lt;a href="https://www.kaggle.com/jef1056/anime-subtitles"&gt;open-source dataset&lt;/a&gt;, Jade is out there and ready to converse over a lot of interesting things... &lt;br&gt;&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--PAHNTHCC--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/8t1kpnmnr9idsz0yr3mh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--PAHNTHCC--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/8t1kpnmnr9idsz0yr3mh.png" alt="ono" width="340"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>octograd2020</category>
      <category>machinelearning</category>
      <category>devgrad2020</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
