<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Audrey</title>
    <description>The latest articles on DEV Community by Audrey (@audrey2001).</description>
    <link>https://dev.to/audrey2001</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F731660%2Ff436ab65-33d5-41d9-8a7d-61de3c2d7bd1.jpeg</url>
      <title>DEV Community: Audrey</title>
      <link>https://dev.to/audrey2001</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/audrey2001"/>
    <language>en</language>
    <item>
      <title>Transcribe Now - : Unlocking Cross-Cultural Language Understanding</title>
      <dc:creator>Audrey</dc:creator>
      <pubDate>Fri, 30 Jun 2023 20:43:35 +0000</pubDate>
      <link>https://dev.to/audrey2001/transcribe-now-unlocking-cross-cultural-language-understanding-59dk</link>
      <guid>https://dev.to/audrey2001/transcribe-now-unlocking-cross-cultural-language-understanding-59dk</guid>
      <description>&lt;h2&gt;
  
  
  Overview
&lt;/h2&gt;

&lt;p&gt;In a world of diverse cultures, travelers face the challenge of cross-cultural language understanding. Language barriers can leave them feeling disconnected, relying on non-verbal cues to navigate conversations. Curiosity and the desire to bridge the gap intensify, yearning to unravel rich traditions and stories. This universal struggle reminds us that language holds the key to profound understanding, empathy, and meaningful connections. Embrace the beauty of cultural diversity as we break these barriers together with Transcribe Now Application.&lt;/p&gt;

&lt;h2&gt;
  
  
  Problem Statement
&lt;/h2&gt;

&lt;p&gt;In today's globalized world, cross-cultural communication poses a significant challenge for travelers and individuals seeking to connect with people from different backgrounds. Language barriers hinder effective interaction, leading to missed opportunities for meaningful conversations, cultural understanding, and building connections.&lt;/p&gt;

&lt;h2&gt;
  
  
  Proposed Solution
&lt;/h2&gt;

&lt;p&gt;Transcribe Now is a simple web application powered by advanced speech-to-text technology and machine learning algorithms,travelers can embark on their journeys with confidence, knowing they have a powerful tool at their fingertips. &lt;br&gt;
They can easily communicate, ask questions, and engage in meaningful discussions, regardless of language differences. The app empowers users to unlock the full potential of cross-cultural experiences, fostering deeper connections, and facilitating genuine cultural exchange.&lt;/p&gt;

&lt;p&gt;The highlight- Transcribe Now makes it easier than ever! Enjoy seamless communication without the need for authentication. Experience the freedom of instant cross-cultural understanding at your fingertips.&lt;/p&gt;

&lt;h2&gt;
  
  
  Tech Stack
&lt;/h2&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;NodeJs, React, Transformer tools
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;




&lt;h2&gt;
  
  
  Our Approach
&lt;/h2&gt;

&lt;p&gt;Transcribe Now has two approach&lt;/p&gt;

&lt;p&gt;&lt;u&gt;&lt;strong&gt;The Frontend&lt;/strong&gt;&lt;/u&gt;&lt;br&gt;
This is our user interface where the User get to see the homepage and on clicking "Get Started button" they are redirected to another page where they can record an audio in get the transcripted words displayed in real-time. It also has a start and stop button to control the recordings. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--aTqUHZR7--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/as60174t7ze8u58lhaem.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--aTqUHZR7--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/as60174t7ze8u58lhaem.png" alt="the Frontend view" width="800" height="409"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;stage 1: collect recordings&lt;/strong&gt;&lt;br&gt;
The app uses browser MediaAPI to allow recordings  while in the browser, By clicking the start button, the recordings start automatically and the user can have control of stopping it to.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;stage 2: Generate chunks of audio file&lt;/strong&gt;&lt;br&gt;
Splitting it into chunks of audio file so it could be passed as a "blobUrl" to the backend for transcription.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;stage 3: Pass to the Frontend&lt;/strong&gt;&lt;br&gt;
Created a function that accepts the generated chunks of audio file as an argument and passing it as a formdata, Then making a POST request to the specified url for transcription.&lt;/p&gt;

&lt;p&gt;&lt;u&gt;&lt;strong&gt;The Backend&lt;/strong&gt;&lt;/u&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;stage 1: Endpoint creation&lt;/strong&gt;&lt;br&gt;
created an endpoint using Express.js server that handles a POST request and sends a request to an endpoint&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;stage 2 : Connect the Data from the frontend&lt;/strong&gt;&lt;br&gt;
Grab the url from the frontend "AudioBlob" through the API request body and converts to an acceptable format for transcription.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;stage 3: Transcription&lt;/strong&gt;&lt;br&gt;
Transcribes the recordings using Whisper Transformer model  and sends the transcripted words to the Frontend.&lt;/p&gt;

&lt;h2&gt;
  
  
  Outcomes and Learnings During the Process
&lt;/h2&gt;

&lt;p&gt;It was not an easy Journey to be honest, Identifying the team members strengths and weakness and working on a project was one of our biggest challenge. We had to try out a couple of tech stack and AWS services to get the one to suit our interest.&lt;br&gt;
Initially we tried using a Python environment for our backend by maximizing AWS lambda and API gateway and AWS Amplify to help us use the transformer tool but the file size was too large and then we tried Amazon Elastic File System(EFS) but we still could not get the app working.&lt;/p&gt;

&lt;p&gt;Reading through the docs again after taking a lot of time trying to debug our code, we decided to go through with Nodejs because that was something we were more familiar with to build the Project.&lt;/p&gt;

&lt;h2&gt;
  
  
  Lessons
&lt;/h2&gt;

&lt;p&gt;For my team, these are the couple of things we learnt while working on the project&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Reading docs- we definitely read lots of them&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Teamwork &lt;br&gt;
showed up when we all tried to bring our strengths and skills to the project&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Brainstorming&lt;br&gt;
We did a lot of that when we tried getting a project idea, why we should with it and how well it suited the project Instructions&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I am sure every member of the team must have learnt a couple of things from the hackathon experience&lt;/p&gt;

&lt;h2&gt;
  
  
  Way Forward
&lt;/h2&gt;

&lt;p&gt;We hope to modify and refactor our codebase,To help people easily navigate through the codebase. &lt;/p&gt;

&lt;p&gt;For the application, we have our users interest at heart and we definitely hope to integrate more language translation on it.&lt;/p&gt;

</description>
      <category>aihackathon</category>
      <category>learnings</category>
      <category>educationaltool</category>
      <category>travel</category>
    </item>
  </channel>
</rss>
