<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Karina Kato</title>
    <description>The latest articles on DEV Community by Karina Kato (@karinakato).</description>
    <link>https://dev.to/karinakato</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F837379%2Fccd766dc-c1d1-4e18-b967-3b48317f8671.jpeg</url>
      <title>DEV Community: Karina Kato</title>
      <link>https://dev.to/karinakato</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/karinakato"/>
    <language>en</language>
    <item>
      <title>Imagens Geradas por Inteligência Artificial</title>
      <dc:creator>Karina Kato</dc:creator>
      <pubDate>Sat, 10 Sep 2022 22:44:56 +0000</pubDate>
      <link>https://dev.to/karinakato/imagens-geradas-por-inteligencia-artificial-4olf</link>
      <guid>https://dev.to/karinakato/imagens-geradas-por-inteligencia-artificial-4olf</guid>
      <description>&lt;p&gt;É impressionante o poder que a tecnologia atingiu!&lt;/p&gt;

&lt;p&gt;Essas imagens a seguir são geradas por inteligência artificial. Dado um texto com a descrição, ela gera automaticamente a imagem. 🤖&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--e4HUJYSb--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/rtm6vhrd2d2ab078my0f.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--e4HUJYSb--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/rtm6vhrd2d2ab078my0f.png" alt="Imagem gerada por IA - Midjourney" width="880" height="570"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;O entendimento do texto não é uma tarefa fácil. Já pensou que quando você descreve algo pode ser difícil entender como as palavras se relacionam e o que elas significam? 📖&lt;/p&gt;

&lt;p&gt;A parte de geração de imagens é outra tarefa extremamente complexa. Nós mesmos podemos ter dificuldades de pensar em como uma descrição se tornaria uma imagem. 🤔&lt;/p&gt;

&lt;p&gt;Há algumas redes neurais generativas de imagem que estão fazendo bastante sucesso hoje e talvez seja interessante você conhecê-las para entender a que nível a tecnologia já evoluiu. Dentre as mais famosas estão Dall.e 2 criada pela Open AI e o Midjourney. Ambas tiveram resultados surpreendentes! 😱&lt;/p&gt;

&lt;p&gt;Gerei as imagens a seguir usando o Midjourney. Um ponto interessante é que é possível passar parâmetros para configurar a imagem resultante gerada. Você não precisa entender de inteligência artificial para usar o produto.&lt;/p&gt;

&lt;p&gt;Na primeira imagem gerei um robô fofo com óculos segurando o ícone do LinkedIn e fundo roxo. Perceba como foi uma descrição específica. Também tentei passar textos mais abstratos como “perseguindo sonhos” ou “o inverno está chegando”. Por fim, testei coisas um pouquinho mais difíceis de serem imaginadas como “um diamante arco-íris em formato de estrela” ou “um vitral de unicórnio “. 🦄&lt;/p&gt;

&lt;p&gt;𝐂𝐨𝐦𝐩𝐚𝐫𝐚𝐭𝐢𝐯𝐨 𝐝𝐞 𝐢𝐦𝐚𝐠𝐞𝐧𝐬 𝐠𝐞𝐫𝐚𝐝𝐚𝐬 𝐩𝐨𝐫 𝐢𝐧𝐭𝐞𝐥𝐢𝐠ê𝐧𝐜𝐢𝐚 𝐚𝐫𝐭𝐢𝐟𝐢𝐜𝐢𝐚𝐥&lt;/p&gt;

&lt;p&gt;Também trouxe um comparativo das imagens geradas pelo 𝐌𝐢𝐝𝐣𝐨𝐮𝐫𝐧𝐞𝐲 e 𝐃𝐀𝐋𝐋.𝐄 2. É interessante ver como os resultados gerados são diferentes e dependentes dos dados que foram alimentados durante o treino da rede.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--MsG1vEoW--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/wxa4wncyfi3vktf2j70e.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--MsG1vEoW--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/wxa4wncyfi3vktf2j70e.png" alt="Black and white shih tzu" width="880" height="572"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--2HKwjM42--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/1j4axh8t6yk13gveyl7y.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--2HKwjM42--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/1j4axh8t6yk13gveyl7y.png" alt="Cutest fire pokemon" width="880" height="572"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--hK_Z9Bwh--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/3v0ls5mpszmuiaitg7gs.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--hK_Z9Bwh--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/3v0ls5mpszmuiaitg7gs.png" alt="Stained glass unicorn" width="880" height="572"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s---tSU_MUZ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/1ngsixt3esd2qj13tuot.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s---tSU_MUZ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/1ngsixt3esd2qj13tuot.png" alt="Cute robot" width="880" height="572"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--aYawxwVP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/b933lyf008bg0ws9ao56.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--aYawxwVP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/b933lyf008bg0ws9ao56.png" alt="Rainbow diamond star shaped" width="880" height="572"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--uoJVuJsV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ij8f6cmh6bov0w9gia36.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--uoJVuJsV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ij8f6cmh6bov0w9gia36.png" alt="Chasing dreams" width="880" height="572"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Particularmente gostei muito das imagens criadas pelo Midjourney para temas mais abstratos com arte digital e paisagens. Também adorei o robô fofo de óculos segurando o ícone do LinkedIn com um fundo roxo.&lt;/p&gt;

&lt;p&gt;Já no DALL.E 2, achei incrível o nível de detalhes que a rede neural consegue pegar a partir do texto. É possível passar frases bem complexas. O Shih Tzu branco e preto sorrindo com um chapéu vermelho em estilo de pintura a óleo é sensacional! Também amei o Pokémon de fogo mais fofo gerado. Senti que a rede capturou muito a essência do que é um Pokémon, pois parece uma mistura de um Plusle com a cauda de um Cyndaquil.&lt;/p&gt;

&lt;p&gt;Quais resultados você gostou mais? 😉&lt;/p&gt;

&lt;p&gt;┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄┄&lt;/p&gt;

&lt;p&gt;Me acompanhe no LinkedIn se quiser receber mais posts sobre inteligência artificial, aprendizado de máquina, ciência de dados e carreira. &lt;a href="https://www.linkedin.com/in/karinakato"&gt;https://www.linkedin.com/in/karinakato&lt;/a&gt;&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>datascience</category>
      <category>ai</category>
      <category>inteligenciaartificial</category>
    </item>
    <item>
      <title>Create automatic blog posts from videos</title>
      <dc:creator>Karina Kato</dc:creator>
      <pubDate>Sun, 03 Apr 2022 20:36:19 +0000</pubDate>
      <link>https://dev.to/karinakato/create-automatic-blog-posts-from-videos-1c6i</link>
      <guid>https://dev.to/karinakato/create-automatic-blog-posts-from-videos-1c6i</guid>
      <description>&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fipiioaxl3y8sb73oc1i0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fipiioaxl3y8sb73oc1i0.png" alt="Blog Post Project"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Overview of My Submission
&lt;/h3&gt;

&lt;p&gt;This tutorial will teach you how to create blog posts automatically from videos. If you are a content creator, you may find this very useful to speed up the process of writing new posts using just your own video tutorials.&lt;/p&gt;

&lt;h3&gt;
  
  
  Submission Category:
&lt;/h3&gt;

&lt;p&gt;Analytics Ambassadors&lt;/p&gt;

&lt;h3&gt;
  
  
  Link to Code on GitHub
&lt;/h3&gt;


&lt;div class="ltag-github-readme-tag"&gt;
  &lt;div class="readme-overview"&gt;
    &lt;h2&gt;
      &lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev.to%2Fassets%2Fgithub-logo-5a155e1f9a670af7944dd5e12375bc76ed542ea80224905ecaf878b9157cdefc.svg" alt="GitHub logo"&gt;
      &lt;a href="https://github.com/tiemi" rel="noopener noreferrer"&gt;
        tiemi
      &lt;/a&gt; / &lt;a href="https://github.com/tiemi/automatic-blog-post" rel="noopener noreferrer"&gt;
        automatic-blog-post
      &lt;/a&gt;
    &lt;/h2&gt;
    &lt;h3&gt;
      
    &lt;/h3&gt;
  &lt;/div&gt;
  &lt;div class="ltag-github-body"&gt;
    
&lt;div id="readme" class="md"&gt;
&lt;div class="markdown-heading"&gt;
&lt;h1 class="heading-element"&gt;Automatic Blog Post&lt;/h1&gt;
&lt;/div&gt;
&lt;p&gt;&lt;a href="https://www.python.org/" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/13c939c276cd527b98013e81c17f5188744f156954c4bf86488e07d81de41275/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6275696c74253230776974682d507974686f6e332d677265656e2e737667" alt="built with Python3"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Automatic blog post is a Python project created to generate automatically blog posts from videos.&lt;/p&gt;
&lt;p&gt;&lt;a rel="noopener noreferrer" href="https://github.com/tiemi/automatic-blog-postimages/architecture.png"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fgithub.com%2Ftiemi%2Fautomatic-blog-postimages%2Farchitecture.png" alt="drawing" width="700"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;The final blog post consists of 5 main components: &lt;strong&gt;title, summary, image, text&lt;/strong&gt; and &lt;strong&gt;keywords&lt;/strong&gt;. The image above represents how this architecture works. As you can see, this is a machine learning project with natural language processing.&lt;/p&gt;
&lt;p&gt;First, we need to process the video to extract the audio. Using the DeepGram API we can do the speech to text. Later, we split the text into paragraphs. For that, we are analyzing the pause between the words to find if the sentences belong to the same paragraph or if it’s a new one.&lt;/p&gt;
&lt;p&gt;Then, we use some pre-trained machine learning models to create the text keywords and summary. We also get the video thumbnail and name, which will be our blog post image and title, respectively.&lt;/p&gt;
&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;How can I use it?&lt;/h2&gt;
&lt;/div&gt;
&lt;p&gt;The…&lt;/p&gt;
&lt;/div&gt;
  &lt;/div&gt;
  &lt;div class="gh-btn-container"&gt;&lt;a class="gh-btn" href="https://github.com/tiemi/automatic-blog-post" rel="noopener noreferrer"&gt;View on GitHub&lt;/a&gt;&lt;/div&gt;
&lt;/div&gt;


&lt;h3&gt;
  
  
  Additional Resources / Info
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvb62z9eye2liv2mcz4s0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvb62z9eye2liv2mcz4s0.png" alt="Architecture"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  My Deepgram Use-Case
&lt;/h3&gt;

&lt;p&gt;The final blog post consists of 5 main components: &lt;strong&gt;title, summary, image, text&lt;/strong&gt; and &lt;strong&gt;keywords&lt;/strong&gt;. The image above represents how this architecture works. As you can see, this is a machine learning project with natural language processing.&lt;/p&gt;

&lt;p&gt;First, we need to process the video to extract the audio. Using the DeepGram API we can do the speech to text. Later, we split the text into paragraphs. For that, we are analyzing the pause between the words to find if the sentences belong to the same paragraph or if it’s a new one.&lt;/p&gt;

&lt;p&gt;Then, we use some pre-trained machine learning models to create the text keywords and summary. We also get the video thumbnail and name, which will be our blog post image and title, respectively.&lt;/p&gt;

&lt;h3&gt;
  
  
  Getting Started
&lt;/h3&gt;

&lt;p&gt;The first step to run the project is to create a &lt;a href="https://console.deepgram.com/signup" rel="noopener noreferrer"&gt;DeepGram account&lt;/a&gt;. After that, you can generate an API key. On &lt;a href="https://console.deepgram.com/project/" rel="noopener noreferrer"&gt;this page&lt;/a&gt;, just click on the button &lt;strong&gt;Create a New API Key&lt;/strong&gt;. You'll have to choose a name for the key, set permission and set the expiration date.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffwql9j3lkors104lcu25.PNG" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffwql9j3lkors104lcu25.PNG" alt="Image description"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;After you click &lt;strong&gt;Create Key&lt;/strong&gt; button, a new key will be created and it's important that you keep it safe.&lt;/p&gt;

&lt;p&gt;To run the code, you’ll need Python installed. My suggestion is to use &lt;a href="https://www.anaconda.com/" rel="noopener noreferrer"&gt;Anaconda&lt;/a&gt;, which is an open-source Python distribution platform.&lt;/p&gt;

&lt;p&gt;I also recommend that you create a new environment specifically for this project. Check the official tutorial to learn how to download Conda and set up a virtual environment: &lt;a href="https://docs.conda.io/projects/conda/en/latest/user-guide/getting-started.html" rel="noopener noreferrer"&gt;Conda Getting Started&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;To install the Python packages, just &lt;a href="https://github.com/tiemi/automatic-blog-post" rel="noopener noreferrer"&gt;clone the project&lt;/a&gt; and run the following command:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight console"&gt;&lt;code&gt;&lt;span class="go"&gt;pip install -r requirements.txt
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Then, simply run the command:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight console"&gt;&lt;code&gt;&lt;span class="gp"&gt;python generate_blog_post.py --deepgram &amp;lt;api_key&amp;gt;&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nt"&gt;--video&lt;/span&gt; &amp;lt;youtube_url&amp;gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;em&gt;Note: This is a demonstration tutorial, if you plan to use this in production, I recommend that you use a Key Vault to store the DeepGram API Key.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;After a couple of minutes, you'll see the blog post markdown file in your output directory!&lt;/p&gt;

&lt;h3&gt;
  
  
  Dive into Details
&lt;/h3&gt;

&lt;p&gt;The code:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Downloads the audio from the specified video URL&lt;/li&gt;
&lt;li&gt;Downloads the image from the thumbnail&lt;/li&gt;
&lt;li&gt;Crops the black border of the image&lt;/li&gt;
&lt;li&gt;Saves the processed image&lt;/li&gt;
&lt;li&gt;Generates the text from the audio using DeepGram API&lt;/li&gt;
&lt;li&gt;Process the text to fix the punctuation&lt;/li&gt;
&lt;li&gt;Splits the text into new paragraphs using the median pause between words and the punctuation as a heuristic&lt;/li&gt;
&lt;li&gt;Stores the text&lt;/li&gt;
&lt;li&gt;Gets keywords from the text using KeyBERT&lt;/li&gt;
&lt;li&gt;Gets summary using pretrained &lt;a href="https://huggingface.co/sshleifer/distilbart-cnn-12-6" rel="noopener noreferrer"&gt;Hugging Face DistilBART&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Creates a markdown template&lt;/li&gt;
&lt;li&gt;Fill the markdown template with title, image, summary, keywords and text&lt;/li&gt;
&lt;li&gt;Saves the markdown in the output directory&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  Post-processing
&lt;/h3&gt;

&lt;p&gt;If you need some text post-processing, you can use the edit_blog_post module. This can be useful if your video has some domain-specific words. For example, the abbreviation "ASR" (automatic speech recognition) was captured as "As r". So, in the process_dictionary.json file which you will need to put your key-value pair "As r": "ASR".&lt;br&gt;
The process_dictionary.json looks like this:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;{ &lt;br&gt;
    "As r": "ASR",&lt;br&gt;
    "Ai": "AI",&lt;br&gt;
    "Apis": "APIs"&lt;br&gt;
}&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Then, you can use the following command to automatically replace those words. Don't worry, because it will reuse the previous processing. So, it will not download the audio or try to transcript it again.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight console"&gt;&lt;code&gt;&lt;span class="gp"&gt;python edit_blog_post.py --video &amp;lt;youtube_url&amp;gt;&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Well, now your post is ready and saved in the same path! &lt;/p&gt;

&lt;h3&gt;
  
  
  Conclusion
&lt;/h3&gt;

&lt;p&gt;Now that everything is settled. Let’s test it. For demonstration purposes, we will be using the DeepGram video tutorial on Youtube. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.youtube.com/watch?v=kibx5BR6trA" rel="noopener noreferrer"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fimg.youtube.com%2Fvi%2Fkibx5BR6trA%2F0.jpg" alt="What is DeepGram?"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The final blog post:&lt;/p&gt;




&lt;h1&gt;
  
  
  What is Deepgram?
&lt;/h1&gt;

&lt;p&gt;#aispeechplatform #voicedata #accuraterealtimetranscription&lt;/p&gt;

&lt;p&gt;Most automatic speech recognition services or ASR are built on technology that's over fifty years old . The old tech is fine for short call and response audio, but it just doesn't work for conversational audio . We built an end to end deep learning neural network that delivers actually usable transcriptions that get even better over time at lightning speed .&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft7epppz0ouv9wyqt16ls.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft7epppz0ouv9wyqt16ls.png" alt="What is DeepGram"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Voice enabled experiences are a big deal in business these days. For a good experience, you need a foundation of real time accurate transcription. But most automatic speech recognition services or ASR are built on technology that's over fifty years old.&lt;/p&gt;

&lt;p&gt;Yep, even these guys. The old tech is fine for short call and response audio. Alexa, what's the weather today? But it just doesn't work for conversational audio. I'm having a problem with my super user service. This is what's happening. In order to make real use out of voice data, transcription needs to be accurate, fast, cost effective, scalable. With the old tech, you can maybe get one or two of these requirements. But no matter how hard they try, they just can't adapt it to give you all four. So we reinvented it. Hi, we're Deepgram. We built an end to end deep learning neural network. A what? Sorry, an AI speech platform that delivers actually usable transcriptions that get even better over time at lightning speed. Without how hardware costs or high transcription costs. And because we're better faster and cheaper, guess what. We're also more scalable.&lt;/p&gt;

&lt;p&gt;So if you've avoided building that great voice feature because you lack the right APIs. Good news, it's time to get to work. Triple espresso, please, what could you do with accurate real time transcription? Oh, just off the top of our heads, You could create conversational ai. Virtual assistance voice analytics agent enable, compliance improvement better customer experience. Just to name a few. We're here to help you go big. If you don't know where to start, don't worry. In addition to providing the best technology stack, Deepgram makes an excellent partner. We know the world is constantly changing. And your audio and transcription needs will two. Will be with you every step of the way from labeling your data to training custom AI models to deployment on prem or in the cloud. To ensure your transcription foundation is powering the experience that actually delight your customers.&lt;/p&gt;

&lt;p&gt;So if you're ready to stop chugging along on less than ideal solutions and start building the great voice product, we're ready to help make it happen Deepgram.&lt;/p&gt;




&lt;p&gt;Thank you for reading this. Feel free to leave a comment. I hope it was helpful. Let's share some knowledge! See you in the next post! :)&lt;/p&gt;

</description>
      <category>hackwithdg</category>
      <category>machinelearning</category>
      <category>nlp</category>
      <category>python</category>
    </item>
  </channel>
</rss>
