The MGS Blog

Tuesday, March 5, 2024

Editing audio for a podcast

Audacity 3 (https://www.audacityteam.org)

Audacity is a DAW, a Digital Audio Workstation. Audacity is also an open source project. It is a highly regarded and extremely capable software environment for sound engineering. Edit your audio recording (in post) to tidy things up. If you have a good quality original recording then the Edit makes it better. If your original recording has issues, then the DAW may help you make corrections like: balance the volume of different voices, standardise the levels, reducing pauses, 'emms and ahhs', delete different takes, filler speech, background noises and any generally unnecessary audio. Audacity can also be used to record directly to the project if needed for voice over and bookend statements.

Audacity Effects may improve sound quality, audio clarity, apply new elements and create the finished file. You may applying effects like normalise or amplify, compressor, noise reduction, high pass, low pass, equalise, add stings, fades, and eventually, and then ultimately, export the finished project as a releasable mp3 file.



A screen shot of an Audacity project
A screen shot of an Audacity project

The screenshot above illustrates what waveform levels might look like for your recording. Above all else  trust your ears to judge the quality of a recording when you listen back to it.

Editing Audio for Podcasting with Audacity

Save Save Save - the Project file and folder

Everything lives in .aup3.
You are strongly advised NOT to work on your active project on an external USB stick/disk, networked storage or cloud storage as it is unlikely to be fast enough for satisfactory recording and editing. To avoid data loss save the project frequently. Make a local disk copy for project editing as Audacity generates very large sets of project edit/state files (online cloud drive services may not unpack or stage quickly enough to support Audacity's project edit/save cycle). To share an Audacity project with others you will need to share the project aup3 file. 
n.b. older versions of Audacity stored data in a project _data folder.  If you move to Audacity 3 there will no longer be a _data folder for each project (yay).

And save save save save frequently. 
And occasionally 'save' then 'close' the project window to force the project to compact safely. 
Reopen and continue working.

Importing audio

Create a new project and save to disk.
    File > Save Project > Save Project (cmd S)
Import previously recorded audio (an mp3 or wav file). 
    File > Import > Audio
Save
    cmd S 
An Audacity project with at least one audio track will resemble the screen shot above.
Play, pause and even record new audio to the current project. 
The playhead is the vertical line displayed over the waveform. All tracks will play through the audio device unless muted or one is soloed. The keyboard space bar toggles the play/pause function from the current playhead.
Once you start importing audio (particularly high quality poly WAV recordings), the Audacity project file (as of version 3 this will be the *.aup3 file) can become very large very quickly (anywhere from 3 GB to 36 GB). 

Multi-track, stereo, mono

A typical audio project file contains synchronised separate tracks representing each microphone or audio source. Usually edit the project configured with Tracks>Synch-lock "on" so that the separate audio tracks remain synched. A basic workflow for post-processing each track separately:
  • Loudness Normalisation (-20dB LUFS)
  • High Pass filter (6dB at 100Hz) 
  • De-Clicker
  • De-Esser
  • Noise cancellation (if needed)

Some useful editing techniques

Quick select, cut and paste

The Audacity toolbars, click and drag to rearrange or float

Choose the Selection Tool (F1) to edit your audio track directly. Click-drag to highlight parts of the audio waveform. Use the cursor to select the waveform to copy (cmd C), paste (cmd V), duplicate (cmd D) cut (cmd X) or delete (del, backspace or cmd K) along the timeline. The edit commands are also accessed via the menu.
    Edit > etc.

Automatically suppress background and soft sounds like breathing

Use Label Sounds to automatically search for and label silence (Audacity manual)





Select Silence and suppress e.g. Silence Audio or manually e.g. Select and Normalize -35dB


Multitrack 

    Tracks > Sync-Lock Tracks (on/off)
You can organise the structure of a recording by using new tracks for sections of audio (to organise the large scale edit).
As of Audacity 3 you can now grab the track label bar when the hand icon/tool appears. This is a really handy :-) approach for aligning track segments or to move a split selection from one track to another.
Toggle the Sync-Lock Tracks setting either synchronise (ON) edits across the tracks in parallel, or (OFF) move tracks relative to each other. This is useful if for example you want to overlap, reorder, or otherwise change the structure of the recordings, e.g. adjust fades, add transitions, change the order of speech.

Large scale edit

Your listeners want a well edited informative podcast so remember, less is more. You will have to listen (and re-listen) to your material and decide what to keep or delete. Think about the length and structure of the recording. You can change the order. You will delete stuff so do it. Concentrate on large chunks, the structure of the talk. Don’t worry about breathing, coughs and minor noises just yet. 

Small scale edit

After the large scale edit more or less done, spend time on the small scale edit. This involves zooming in and fine-grained cut and pasting. But think before acting, is it really necessary to remove every single intake of breath? 
For example: cut long gaps, background noises and any generally unnecessary audio. You might also cut some of the egregious 'emms and ahhs' and nervous filler speech we all use. A caution though, the small scale edit can become a journey without an end, or at least a very long journey.
Note: "Sometimes you will not want to close the gap and yet still remove the noise, for instance if somebody breathes loudly. For such occasions copy a section of audio during which nobody is speaking (e.g. at the start of the recording) and paste it over the unwanted noise. This will preserve the natural pacing of speech." (source: Cook & Holdis, CEU Podcast Library - link)

Fade-in Fade-out

Apply fade-in and fade-out when you want the speech to start or end smoothly.


A useful Audacity EQ setting

Post-processing - Loudness Normalization 

Start out by applying loudness normalization (default -23 LUFS) separately to each track - why separately? Each track has its own average. I think normalising across all tracks at the same time biases the average gain to the loudest track's level. The same thing happens with Normalize but you can use that to your advantage to selectively boost or reduce selected loud or quiet segments.

Post-processing - noise reduction (if needed)

Select a several second sample of just the noise you want to remove.

Post-processing - low pass / high pass and other ways to reduce echo?

What to do if the original recording has transient audio peaks (annoying clicks, chirps, squeaks etc)?
The following has worked for me.
Apply the low pass filter at as low a cut-off level as you can get away with e.g. 1,100 Hz, 12 dB roll-off
Then apply click removal (200/40)

Another alternative is to apply the RIAA filter using the Filter Curve EQ effect. This shifts the vocal emphasis towards the bass without over-muffling the sound. Reducing the intensity of the higher frequencies allows the listener to focus on the mid-low vocal range and deemphasises the intensity of any annoying high-end audio reflection that gets captured so easily when using a condensor mic in a room without sound treatment.

Then EQ->Normalize->Compressor->Normalize (can repeat if needed) as follows...

1. Post-processing - Graphic EQ (equalization)

Base boost plust treble boost

2. Post-processing - Normalize

Apply default (-3dB) - the waveform will push over 0.5 but keep away from 1.0

3. Post-processing - Compressor

Select -19 dB or choose a threshold between default -12dB down to as low as -30dB 
Select noise floor, around -50dB
Select ratio 4.7:1 or vary, typical ratio, between 2:1 - 3:1 - 5:1
Drop attack time (0.2s) and release time (2.0s) to maximise compression applied

These settings end up boosting the volume (visually the waveform moves closer to 1.0) making it too loud and noisy so you'll need to normalize (or amplify) again...

4. Post-processing - Normalize (again)

Apply default (-3dB) or you can apply Amplify -3dB, for the same result. 
The desired result being for the waveform peaks to re-balance back to around the 0.5 level.

Improving a Zoom track?

Loudness Normalization > AUMatrixReverb > ClipFix > Normalize 3 > Amplify 3
Apply Noise Reduction if a suitable noise sample is available
Apply De-Esser and De-Clicker if needed
Overall, peak levels to be no more than 3dB

AUMatrixReverb settings may help to obscure small room reverb

Alternatively or in addition you might get some improvement by applying the AUBandpass Filter 

AUBandpass Filter



Finish up

Export the mp3

Finally, create the finished product, by exporting to an mp3 file. 
> File > Export > Export as MP3
This mixes all tracks in the Audacity project down to a single track. 
Use settings similar to the following to export your audio file.







Quality checks:

  • Is the intro/outro music at a suitable level?
  • Does the intro/outro transition smoothly to/from the spoken audio?
  • Is the spoken audio at a suitable level? (i.e. no clipping, waveform amplitude averages around 0.5 on linear scale)
  • Is the speed of speech natural?
  • Large scale edit - remove fumbled speech, removed confidential disclosures, cut out a duplicate or passages that were off topic, etc.
  • Small scale edit - remove ticks, emms, ahhs, sighs, distracting background, etc.
  • You wouldn't normally have backing music running throughout a podcast, but in some cases, for example, background room noises leaking into the audio, so you may decide to use low volume background as a filler/distractor.

Useful links

An Audacity tutorial on mixing a narration with background music
https://manual.audacityteam.org/man/tutorial_mixing_a_narration_with_background_music.html

Refer to the Audacity manual online (e.g. the track panel description/explanation is at manual.audacityteam.org)

An Audacity user's simple tutorial and workflow activities:

Using the Compressor in Audacity

Using Noise Removal in Audacity

On including music from other artists (for atmosphere, decoratively, for intros/outros & transitions)

In the first instance, if you know the artist ask for their permission to use their material. You may be pleasantly surprised by the positive cycle of recognition and community surrounding artist works. Make sure you acknowledge them in your show notes.

We recommend releasing under the “Attribution-NonCommercial-ShareAlike version 4.0  creative commons licence (aka CC BY-NC-SA 4.0). You can choose from a number or variations of the CC licence; CC BY, CC BY-NC, CC SA etc.
These licences can be viewed at https://creativecommons.org/licenses/by-nc-sa/4.0

Always check the royalty free claims of audio samples found online. If you want to find audio from online sources, search for CC BY titles on Soundcloud or on YouTube
The link to the YouTube channel  "Of Musicians" below is a collection of a selection of royalty free original music.

Record the following information in your show notes to establish the provenance and bona fides of 3rd party music, artwork and creative content that you decide to use:

Music (CC BY-NC-SA licence) 
Title: “a title”
Artist: “a name”
Source: “a link to the original”
Licence: “CC BY-NC-SA”

Cover Art (CC BY-NC-SA licence)
Title: “a title”
Artist: “a name”
Source: “a link to the original”
Licence: “CC BY-NC-SA”

Example workflows:

Note, don't just apply these without testing. The basic method is to select a 10s sample and preview effects/filters until you are happy with the sound.

Simple standard process
Align tracks
Per-track noise reduction
Isolate voices
De-clicker (default) on all tracks (process run-time is typically the full track duration e.g. 15-30+ minutes)
De-esser (default) on all tracks (process run-time is a small fraction of the track duration)
Loudness Normalization (-23dB LUFS)
Compressor (default)
Makeup soft and suppress clipping on small edits with Normalize.

For an original track that had a pronounced small 'room' sound and a tendency to 'clipping zone' raw i.e. the waveform looks high, square and flat.
Align and isolate
De-clicker (default) on all tracks (process run-time is typically the full track duration e.g. 15-30+ minutes)
De-esser (default) on all tracks (process run-time is a small fraction of the track duration)
Loudness Normalization (-23dB LUFS)
Compressor (default)
Loudness Normalization (-23dB LUFS) (again)
Makeup soft and suppress clipping on small edits with Normalize.
AUMultiBandCompressor on affected track (the Sub Control preset sounded nice)
Normalize all audio to push up/down to average 0.5 waveform view (to give a typical -12dB to -6dB range with -3dB peaks)

For a track with clicks and speech that sounds very wet.
De-clicker (non-default 2 pass or more) on wet track.

Isolating second voice when two mics in the same room
Switch to Spectrogram view and select vocal patches with "Spectral edit multi tool" to fade-edit cross-talk.
AUMultiBandCompressor on affected track (the Sub Control preset sounds nice)

For a track with small room reverb and 'tinny' mic
Switch to Spectrogram view and select low frequencies 1-300Hz with "Spectral edit multi tool" to fade-edit.
Apply Filter Curve EQ - apply 100Hz Rumble filter to remove 'rumble pressure' from the audio (it feels like pressing on your ears when listening with headphones).
Apply Filter Curve EQ - apply Radio filter to remove the sensation of light 'highs' pressure when listening to the audio.