Siddharth Singh Baghel

Posted on Sep 15, 2022 • Edited on Nov 12, 2022

Hello Python (Voice Assistant) - For Windows Users

Siddharth Singh Baghel · 2022-09-15T10:24:29Z

App's Overview In this tutorial, we will be going through a voice assistant build with python for beginners. Prerequisites for reading this article are basic knowledge of python and python package importing. Table Of Content Project Setup Imports & package installation Features and more GitHub Repo Writer's Support ❤️ Project Setup I have used the VS code editor for this project. PyCharm is also equally recommended, it depends upon your personal choice which among the two to choose. But other than these two I won't strongly recommend any editor. Create a file my_assistant.py . Imports & Packages Installation We will be using various python packages for creating our smart assistant. Some of them need to be installed while some are pre-installed packages. The imports visible in the above image are the brain cells of our assistant 😁. pyttsx3 - The most important package is pyttsx3 . it is a text-to-speech conversion library in Python. Unlike alternative libraries, it works offline and is compatible with both Python 2 and 3. ## to download the package, write it in the terminal pip install pyttsx3 ## to import write this in my_assistant.py import pyttsx3 Enter fullscreen mode Exit fullscreen mode speech_recognition - It allows computers to understand human language. Speech recognition is a machine's ability to listen to spoken words and identify them. ## to download the package, write it in the terminal pip install SpeechRecognition ## to import write this in my_assistant.py import speech_recognition as sr Enter fullscreen mode Exit fullscreen mode wikipedia - Wikipedia is a Python library that makes it easy to access and parse data from Wikipedia. Search Wikipedia, get article summaries, get data like links and images from a page, and more. ## to download the package, write it in the terminal pip install wikipedia ## to import write this in my_assistant.py import wikipedia as wk Enter fullscreen mode Exit fullscreen mode datetime - For accessing date and time. ## to import write this in my_assistant.py import datetime as dt Enter fullscreen mode Exit fullscreen mode webbrowser - For accessing the browser. ## to import write this in my_assistant.py import webbrowser as wb Enter fullscreen mode Exit fullscreen mode os - For accessing operating system's operations. ## to import write this in my_assistant.py import os Enter fullscreen mode Exit fullscreen mode Features and More This can be extended immensely as per the developer's interest. I have provided basic steps with a few features in this tutorial. But you can fork the GitHub Repo for various features. Feature - 1 (Listening) We will make our assistant listen us firstly 😁 . takeCommand() function is ears for our assistant. def takeCommand(): r = sr.Recognizer() # sr-> speech_recognition with sr.Microphone() as source: r.adjust_for_ambient_noise(source) # for cancelling the backgroud noise r.pause_threshold = 1 print("Listening...") maudio = r.listen(source) print("Listened") try: print("Recognizing...") query = r.recognize_google(maudio, language = 'en-in') # this step will recogize the text you spoke and store it into var query except Exception as e: query = "Say that again please!" return query Enter fullscreen mode Exit fullscreen mode Now, you can use the query returned from takeCommand() for various tasks. Feature - 2 (speaking) It's time for our assistant to respond, because conversation must be two sided. 😊 speakBaby() function will let our assistant speak. def speakBaby(audio): engine = pyttsx3.init('sapi5') # learn more about sapi5 in pyttsx3 documentation voices = engine.getProperty('voices') # for getting all the voices (male and female) engine.setProperty('voice', voices[1].id) # set a voice with your choice engine.say(audio) engine.runAndWait() Enter fullscreen mode Exit fullscreen mode Feature - 3 (Wishing) wishMe() function allows our assistant to wish us according to the time of the day. def wishMe(): hour = int(dt.datetime.now().hour) if hour in range(0,12): speakBaby("Good morning Sid .. How may I help you !") elif hour in range(12,17): speakBaby("Good Afternoon Sid .. How may I help you !") else: speakBaby("Good Evening Sid .. How may I help you !") Enter fullscreen mode Exit fullscreen mode We used speakBaby() for our assistant to speak out time. Note : These are the basic features for a working assistant and for more features checkout the GitHub repo attached below. Siddharth-sing / Python_Talks Voice-Assistant-Python <img src="https://camo.githubusercontent.com/1acec6bc44a5c4bc5f0b0fd5894f79683f95be00b9fdffea50be6ebd259fa71a/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6465762e746f2d3041304130413f7374796c653d666f722d7468652d6261646765266c6f676f3d6465762e746f266c6f676f436f6c6f723d7768697465" height="200px" width="500px" data-canonical-src="https://img.shields.io/badge/dev.to-0A0A0A?style=for-the-badge&logo=dev.to&logoColor=white" style="max-width: 100%;"> App's Overview In this tutorial, we will be going through a voice assistant build with python for beginners. Prerequisites for reading this article are basic knowledge of python and python package importing. Table Of Content Project Setup Imports & package installation Features and more GitHub Repo Writer's Support ❤️ Project Setup I have used the VS code editor for this project. PyCharm is also equally recommended, it depends upon your personal choice which among the two to choose. But other than these two I won't strongly recommend any editor. Create a file my_assistant.py . Imports & Packages Installation We will be using various python packages for creating our smart assistant. Some of them need to be installed while some are pre-installed packages. <img src="https://camo.githubusercontent.com/261a7a67ac804617220773c5147d7e892a1876af7091caeffaf93c427e69126b/68747470733a2f2f6465762d746f2d75706c6f6164732e73332e616d617a6f6e6177732e636f6d2f75706c6f6164732f61727469636c65732f73623537677377773239643731703633347972642e706e67" alt="Image description" data-canonical-src="https://dev-to-uploads.s3.amazonaws.com/uploads/articles/sb57gsww29d71p634yrd.png" style="max-width: 100%;"> The imports visible in the above image are the brain cells of our assistant 😁. pyttsx3 - The most important package is pyttsx3 it is a … View on GitHub Writer's Support If you find the article useful show some ❤️ by staring some of my repositories and following me on dev.to and github.

#python #tutorial #beginners #devjournal

App's Overview

In this tutorial, we will be going through a voice assistant build with python for beginners. Prerequisites for reading this article are basic knowledge of python and python package importing.

Project Setup

I have used the VS code editor for this project. PyCharm is also equally recommended, it depends upon your personal choice which among the two to choose. But other than these two I won't strongly recommend any editor.
Create a file my_assistant.py.

Imports & Packages Installation

We will be using various python packages for creating our smart assistant. Some of them need to be installed while some are pre-installed packages.

The imports visible in the above image are the brain cells of our assistant 😁.

pyttsx3 - The most important package is pyttsx3.
it is a text-to-speech conversion library in Python. Unlike alternative libraries, it works offline and is compatible with both Python 2 and 3.

## to download the package, write it in the terminal

pip install pyttsx3

## to import write this in my_assistant.py

import pyttsx3

speech_recognition - It allows computers to understand human language. Speech recognition is a machine's ability to listen to spoken words and identify them.

## to download the package, write it in the terminal

pip install SpeechRecognition

## to import write this in my_assistant.py

import speech_recognition as sr

wikipedia - Wikipedia is a Python library that makes it easy to access and parse data from Wikipedia. Search Wikipedia, get article summaries, get data like links and images from a page, and more.

## to download the package, write it in the terminal

pip install wikipedia

## to import write this in my_assistant.py

import wikipedia as wk

datetime - For accessing date and time.

## to import write this in my_assistant.py

import datetime as dt

webbrowser - For accessing the browser.

## to import write this in my_assistant.py

import webbrowser as wb

os - For accessing operating system's operations.

## to import write this in my_assistant.py

import os

Features and More

This can be extended immensely as per the developer's interest. I have provided basic steps with a few features in this tutorial. But you can fork the GitHub Repo for various features.

Feature - 1 (Listening)

We will make our assistant listen us firstly 😁 .
takeCommand() function is ears for our assistant.

def takeCommand():
    r = sr.Recognizer()  # sr-> speech_recognition
    with sr.Microphone() as source:
        r.adjust_for_ambient_noise(source)  # for cancelling the backgroud noise 
        r.pause_threshold = 1
        print("Listening...")
        maudio = r.listen(source)

    print("Listened")   
    try: 
        print("Recognizing...")
        query = r.recognize_google(maudio, language = 'en-in')  # this step will recogize the text you spoke and store it into var query

    except Exception as e:
        query = "Say that again please!"

    return query

Now, you can use the query returned from takeCommand() for various tasks.

Feature - 2 (speaking)

It's time for our assistant to respond, because conversation must be two sided. 😊
speakBaby() function will let our assistant speak.

def speakBaby(audio):
    engine = pyttsx3.init('sapi5')  # learn more about sapi5 in pyttsx3 documentation 
    voices = engine.getProperty('voices') # for getting all the voices (male and female)
    engine.setProperty('voice', voices[1].id) # set a voice with your choice
    engine.say(audio)
    engine.runAndWait()

Feature - 3 (Wishing)

wishMe() function allows our assistant to wish us according to the time of the day.

def wishMe():
    hour = int(dt.datetime.now().hour)
    if hour in range(0,12):
        speakBaby("Good morning Sid .. How may I help you !")
    elif hour in range(12,17):
        speakBaby("Good Afternoon Sid .. How may I help you !")
    else:
        speakBaby("Good Evening Sid .. How may I help you !")

We used speakBaby() for our assistant to speak out time.

Note : These are the basic features for a working assistant and for more features checkout the GitHub repo attached below.

Siddharth-sing / Python_Talks

Voice-Assistant-Python

App's Overview

In this tutorial, we will be going through a voice assistant build with python for beginners. Prerequisites for reading this article are basic knowledge of python and python package importing.

Table Of Content

Project Setup

I have used the VS code editor for this project. PyCharm is also equally recommended, it depends upon your personal choice which among the two to choose. But other than these two I won't strongly recommend any editor.
Create a file my_assistant.py.

Imports & Packages Installation

We will be using various python packages for creating our smart assistant. Some of them need to be installed while some are pre-installed packages.

The imports visible in the above image are the brain cells of our assistant 😁.

pyttsx3 - The most important package is pyttsx3 it is a …

View on GitHub

Writer's Support

If you find the article useful show some ❤️ by staring some of my repositories and following me on dev.to and github.

Top comments (2)

Akin C. • Sep 22 '22

Hello Siddharth Singh,

thank you for your article.
I like how it's structured.
I also like the high level explanation of the different packages.

I was trying to get your code to work on Google Colab just for fun and I know you specifically recommend only pycharm and VS code.
I only mention this because it seems that your code regarding this documentation only works on Windows OS:
pypi.org/project/comtypes/

I stumbled across this when the following error popped up:
ImportError: cannot import name 'COMError' from '_ctypes'
(It seems that one of the packages needs the package comtypes, so I installed it)

If this is true, I recommend that your article mentions that it's only for Windows.
It might help provide more clarity.

To reproduce the error in Google Colab (for the curious):

Download packages

!pip install pyttsx3
!pip install SpeechRecognition
!pip install wikipedia

Import packages

import pyttsx3
import speech_recognition as sr
import wikipedia as wk
import datetime as dt
import webbrowser as wb
import os

Run the code from article and call

speakBaby("Good morning Sid .. How may I help you !")

An error will pop up that package comtypes is needed
Install comtypes before the imports
!pip install comtypes
Import comtypes after package installation
The following error should pop up
ImportError: cannot import name 'COMError' from '_ctypes'

Siddharth Singh Baghel • Oct 31 '22

@incrementis,

Thanks for reviewing the article carefully and I am glad you find it useful. Also I apologize for the inconvenience. I will mark that the article is meant for the windows users only.