<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: amanda hernandez</title>
    <description>The latest articles on DEV Community by amanda hernandez (@amandahernandez).</description>
    <link>https://dev.to/amandahernandez</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F169441%2F2b6f1568-4795-4074-9372-7f9716bef446.jpeg</url>
      <title>DEV Community: amanda hernandez</title>
      <link>https://dev.to/amandahernandez</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/amandahernandez"/>
    <language>en</language>
    <item>
      <title>Deepgram x DEV Hackathon - Treehouse - Translating Audio Files</title>
      <dc:creator>amanda hernandez</dc:creator>
      <pubDate>Mon, 11 Apr 2022 23:19:23 +0000</pubDate>
      <link>https://dev.to/amandahernandez/deepgram-x-dev-hackathon-treehouse-translating-audio-files--g20</link>
      <guid>https://dev.to/amandahernandez/deepgram-x-dev-hackathon-treehouse-translating-audio-files--g20</guid>
      <description>&lt;h3&gt;
  
  
  Overview of My Submission
&lt;/h3&gt;

&lt;p&gt;Given some speech in a specific language, use this UI/API to generate a translation - in either text or audio. &lt;/p&gt;

&lt;p&gt;This project focused on building out the API by stitching together Deepgram API for (STT - speech to text), Google Translate API, and Wellsaid Labs API (TTS - text to speech). &lt;/p&gt;

&lt;p&gt;With the creation of this API, applications of it can be used to aid users in understanding audio in another language that may not already come with captions/translation. &lt;/p&gt;

&lt;p&gt;Future Considerations: &lt;br&gt;
UI&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The goal of this project was to focus on the API rather than the UI. With more time, I'd prefer to swap out this Next.js UI in favor of a Google Chrome extension and try to listen to whatever audio is playing in a browser tab and initiate the translation through the extension. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;API&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Google Translation can translate to a number of languages other than English, but for now the default is set to English until we can access some WellSaid Labs voice actors in other languages. &lt;/li&gt;
&lt;li&gt;Detecting voice style in audio would be another awesome improvement in order to select a voice actor that "matches" the voices from the input audio. For now we default to 1 voice actor from WellSaid when we could take advantage of the 50+ voices that are available. &lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Submission Category:
&lt;/h3&gt;

&lt;p&gt;Accessibility Advocates&lt;/p&gt;

&lt;h3&gt;
  
  
  Link to Code on GitHub
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://github.com/jumpmanda/treehouse"&gt;https://github.com/jumpmanda/treehouse&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Additional Resources / Info
&lt;/h3&gt;

&lt;p&gt;An Example &lt;br&gt;
(Please note that I have a feature flag disabled so the example below will show the translation part of this app, but not the audio generation part.) &lt;/p&gt;

&lt;p&gt;In this screenshot, we have uploaded an audio clip (taken from an interview with Selena Quintanilla) which is originally in Spanish. And below the button, we display the text that we have transcribed with Deepgram and translated with Google. &lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--TyDkDJi6--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/wdqvp0zk4s7lbl005ury.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--TyDkDJi6--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/wdqvp0zk4s7lbl005ury.png" alt="Treehouse UI with steps on how to use service" width="739" height="673"&gt;&lt;/a&gt;&lt;br&gt;
Here is the output as from the server logs: &lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--rewFbh5J--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/dwueidw5zi622aj6b1kq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--rewFbh5J--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/dwueidw5zi622aj6b1kq.png" alt="Output logs from API" width="800" height="574"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Sample Audio with &lt;a href="https://wellsaidlabs.com/"&gt;WellSaid Labs&lt;/a&gt; Synthetic Voice (Alana B.) hosted here:&lt;/p&gt;

&lt;p&gt;&lt;a href="http://www.sndup.net/y5qj"&gt;http://www.sndup.net/y5qj&lt;/a&gt;&lt;/p&gt;

</description>
      <category>hackwithdg</category>
      <category>nextjs</category>
      <category>node</category>
      <category>tts</category>
    </item>
    <item>
      <title>What is your "waiting to load" noise/response?</title>
      <dc:creator>amanda hernandez</dc:creator>
      <pubDate>Fri, 18 Sep 2020 18:04:11 +0000</pubDate>
      <link>https://dev.to/amandahernandez/what-is-your-waiting-to-load-noise-response-gd7</link>
      <guid>https://dev.to/amandahernandez/what-is-your-waiting-to-load-noise-response-gd7</guid>
      <description>&lt;p&gt;Of course this only happens when we're demo-ing. We're walking through our demo and at any point something is taking a while to load, maybe more than we expected. So we fill in the silence with a "doo doo do..." or a "just give it a sec..". &lt;/p&gt;

&lt;p&gt;What's your go-to response during this short (long?) period of time? Or how else do you navigate unexpected lag time/error during a demo?&lt;/p&gt;

</description>
      <category>discuss</category>
      <category>softskills</category>
    </item>
  </channel>
</rss>
