If you’ve ever wanted to change the pitch of a song without altering its speed, this blog post is for you. Pitch-shifting is a common task for musicians, DJs, and audio engineers. In this tutorial, we will explore how to down-pitch a song using Python and the pydub library and apply this process to multiple songs in a folder automatically.
Why Pitch Shift?
In music, pitch-shifting means changing the pitch of a song (raising or lowering it) without speeding it up or slowing it down. This can be useful for:
- Matching the key of a song to another track 
- Transposing songs for instruments tuned to a different key 
- Creating remixes or mashups 
Tools You Will Need
We will use the Python library pydub to manipulate audio files. You can install it using pip:
pip install pydub
Additionally, pydub requires ffmpeg to handle audio files like MP3. You can install ffmpeg via the terminal:
sudo apt install ffmpeg
Step-by-Step Guide to Pitch Shifting
Now let’s dive into the Python script that automates pitch-shifting for multiple songs in a folder. The script loops through the files in a songs folder, down-pitches them by a half-step (semitone = -1), and saves the new files to an output folder.
The Code
import os
from pydub import AudioSegment
# Function to shift pitch down
def pitch_shift(audio, semitones):
    # Adjust sample rate to shift pitch
    new_sample_rate = int(audio.frame_rate * (2.0 ** (semitones / 12.0)))
    return audio._spawn(audio.raw_data, overrides={'frame_rate': new_sample_rate}).set_frame_rate(audio.frame_rate)
# Input and output folders
input_folder = './songs'
output_folder = './output'
# Ensure the output folder exists
os.makedirs(output_folder, exist_ok=True)
# Loop through all files in the songs folder
for filename in os.listdir(input_folder):
    # Check if the file is an audio file (e.g., mp3 or wav)
    if filename.endswith(".mp3") or filename.endswith(".wav"):
        # Construct the full file path
        input_path = os.path.join(input_folder, filename)
        output_path = os.path.join(output_folder, filename)
        # Load the audio file
        audio = AudioSegment.from_file(input_path)
        # Shift pitch down by a half-step (semitone = -1)
        shifted_audio = pitch_shift(audio, -1)
        # Export the pitch-shifted audio to the output folder
        shifted_audio.export(output_path, format="mp3")
        print(f"Processed and saved: {output_path}")
Explanation
- Importing Libraries: 
 We import- osto work with file directories and- AudioSegmentfrom- pydubto manipulate audio files.
- Pitch-Shift Function: 
 The- pitch_shiftfunction adjusts the sample rate of the audio. When we change the sample rate, the pitch changes. In this case, we calculate the new sample rate to shift the pitch down by one semitone using the formula:
 
 - new_sample_rate = int(audio.frame_rate * (2.0 ** (semitones / 12.0)))
 
- Input and Output Folders: 
 We define the folders where we will read the audio files and save the pitch-shifted versions. If the output folder doesn't exist, it will be created.
- 
Loop Through Songs: 
 Usingos.listdir(), we loop through each file in thesongsfolder. The script checks if the file is an audio file (.mp3or.wav) before processing it. For each file:- It loads the audio.
- The pitch_shift function is applied, lowering the pitch by a half-step.
- The pitch-shifted audio is exported to the output folder.
 
- Export and Feedback: 
 Once the processing is done, the pitch-shifted song is saved in the- outputfolder, and a confirmation message is printed.
Running the Script
Make sure you have your audio files in the songs folder and then run the script:
python -m pitch_down
The pitch-shifted files will be saved in the output folder.
Customization
You can easily modify this script to:
- Pitch the audio up by passing a positive value (e.g., - pitch_shift(audio, 1)for a half-step up).
- Process different file formats by adding other extensions like - .oggor- .flacto the conditional check.
- Shift by a different number of semitones by adjusting the - semitonesargument.
Conclusion
This script is a simple yet powerful way to pitch-shift multiple audio files using Python. With pydub and ffmpeg, you can manipulate audio files in bulk, making tasks like pitch correction or audio preparation easier for musicians, producers, or anyone working with audio.
Feel free to experiment with this script and see how you can adapt it to your needs. Happy coding!
 
 
              
 
    
Top comments (4)
Wow this is really cool!
Thanks!!
Can you explain the new sample formula a bit more in detail, like the magic numbers?
What this does:
This formula changes the pitch of an audio file by adjusting its sample rate based on the number of semitones you want to shift up or down.
Key Terms:
audio.frame_rate:This is the original sample rate of the audio file. It defines how many samples per second the audio was originally recorded at.semitones:A semitone is the smallest musical interval (the distance between two adjacent keys on a piano). Positive values shift the pitch up, and negative values shift it down.2.0 ** (semitones / 12.0):This is the mathematical formula that determines the pitch shift. The number12represents the number of semitones in an octave. So:If semitones= 12, you shift the pitch up by 1 full octave.If semitones= -12, you shift the pitch down by 1 full octave.If semitones= 1, you shift the pitch up by 1 semitone (like moving one key up on a piano).new_sample_rate:This is the sample rate of the audio after the pitch shift.How it works:
Finally,
int()is used to ensure the result is a whole number, as the sample rate must be an integer.Example:
If
audio.frame_rate = 44100Hz (a common sample rate) and you want to shift up by 1 semitone:python