Spoken word audio is stronger than ever: the how and the why

Spoken word audio, which comprises podcasts, audiobooks, talk radio, news, and sports, is growing at an astounding rate. The continued growth of podcasts and audiobooks have rejuvenated spoken word audio, which continues to take share away from the ever-popular music. 

In fact, the share of time spent listening to spoken word audio in the U.S. has increased 8% in the last year, and 30% in the past six years, whereas music share has decreased 8% over the last six years.

That’s the latest data from Spoken Word Audio Report from NPR and Edison Research, providing a deep look into the ongoing audio revolution: trends, listener behaviors and preferences, and changes over time. 

In this post, I take a deep look into the spoken word segment of the audio landscape and share my vision of things to come.

What’s driving the change in user behavior

For starters, technology. It’s a broad term that includes emerging tech such as smart speakers, increased and constant connectivity, and the omnipresence of mobile devices where 30% of all listening on smartphones is to spoken word audio. Combined, the tech factor is allowing audio to reach more people in new places and platforms. 

A good deal of the change is driven thanks to the nature of the spoken word, especially compared to other types of audio. As anyone who listens to podcasts or audiobooks will attest, spoken word is a significantly more concentrated and potent listening experience than any other type of audio. 

When people listen, they really listen and immerse themselves. And they do listen: 43% of the U.S. population listen daily, averaging two hours per day. 

Naturally, there’s also the information and entertainment factor to account for in situations where looking at a screen is not an option. From commuting to house chores to walking the dog and everything in between, eardrums are receiving a steady stream of content. Hence, it shouldn’t be that surprising that 45% of people are listening more to spoken word audio than merely five years ago. 

To top it all off, the ongoing pandemic has had an impact too as 40% of spoken word audio listeners age 13+ say they are listening more since quarantine restrictions started. The disruptions in routines and habits actually helped grow the intake of audio content, also affecting where most listening is happening – home. Such a trend is to be expected as more people shift to working at home.

Distribution of Spoken Word Audio Listening by Location report

Who is listening?

What is equally fascinating (if not more) is the fact that the time spent listening to spoken word is fairly consistent across every demographic. The highly coveted 13-34 age group (often touted as audio’s biggest fan) is registering the biggest six-year growth at 83% but are trailing behind the 35-54 and 55+ age groups who have a strong root in AM/FM radio.

Share of Time Spent Listening to Spoken Word Audio report

As expected, spoken word audio’s growth is driven by younger listeners with large increases among women, younger women, African-Americans, and Latinos. As a result, 43% of the population are daily spoken word audio listeners, averaging two hours per day listening. Those are impressive numbers, especially considering that spoken word audio is close to edging out music as the favorite daily audio content at 48%. 

What is being listened to?

Basically everything. News is by far the most popular spoken word audio topic, followed by music and comedy/humor as the top three topics that spark the most interest. 

Top Ten Spoken Word Audio Topics  report

Audio news segment has become one of the biggest winners (if there is such a thing) of the pandemic-laden 2020. Even years before COVID-19, publishers such as The Washington Post and The Economist have been investing more in their audio articles, hoping to give busy audiences a flexible but familiar way to explore stories and attract subscriptions. Audio articles of existing news are cheap to make because the reporting has already been done and there’s no need for additional production costs due to the scalable text-to-speech technology behind it. 

Now, these publishers are reaping the benefits of being early movers, along with powerhouses like McClatchy who have recognized the shift in user behaviors. After a successful trial on two of its news outlets that showed time spent on each site increased by 168%, story page views went up 89%, and visits per user increased by 95%, all of McClatchy’s 30 newsrooms across the U.S. now have audio articles through a new text-to-speech feature (yes, ours). Thanks to a potent mix of AI and machine learning, audio versions of news articles are generated within a matter of seconds with ads inserted into the content, resulting in a new revenue-generating stream.

Readers or listeners-to-be appreciate an option to get up to speed on the latest news developments while doing something else, inside or outside. Audio articles tend to be listened to all the way through, as opposed to text articles that have much quicker drop-off rates due to the reader’s ability to quickly skim through it. 

Our internal numbers show the same level of dedication when it comes to audio content completion rate. Audio articles represent a more engaging content option while also helping improve user satisfaction and loyalty. There’s increasingly more evidence suggesting that an audio version of content acts as an effective retention tool. Once listeners come to rely on it, they stick around.

People are recognizing the multifaceted benefits of audio

So, more listeners are starting to recognize the convenience of audio and its multitasking benefits. Audio is super easy to engage with when you’re doing two or more things (or at least trying to), and it’s equally a solo and group activity. In terms of monthly listeners, 52% say they exclusively listen alone, while the rest spends time listening with others. It’s simply easier to listen and consume information whenever and wherever, which is a major factor why people listen generally, not just to the spoken word. 

Reasons Why People Listen to More Spoken Word Audio report

What I find particularly interesting is the fact that listeners also perceive personal growth and spoken word’s ability to improve mental health as strong motivators to tune in regularly. The opportunity to improve oneself, get motivated, encouraged, or simply receive some positivity are among the most surprising perceptions spoken word audio has on listeners. They also see it as a welcome break from negativity and escape from the current events, as well as a way to navigate life’s problems and feel less lonely. 

That certainly explains why podcast and audiobook consumption is at an all-time high: 55% of the U.S. population has listened to a podcast, while 54% has listened to an audiobook. These make it possible to easily find content that is tailor-made for specific wants and needs, something that helps people stay connected and identify with. When you pair that with the fact that most people believe they process information more efficiently when they listen, you get a good sense of spoken word audio’s growth.

For example, audio is deeply embedded in our nature as our predecessors have been exchanging stories and information orally for tens of thousands of years. Listeners can benefit in terms of comprehension from someone’s inflections or intonations as some nuances are far easier communicated via audio than text.

Chandler Bing Friends GIF

 And since there is a difference between reading/listening to learn and reading/listening for pleasure (what the vast majority of listeners does) which slightly favors the latter, there’s also an added bonus of making content more accessible and improving literacy (which is surprisingly still a thing in 2020 – in the western world mind you) 

Why is all of this important?

In times of constant tech disruption and changing user behavior, audio is assuming a bigger role in the way we interact, consume content, and navigate through life on a daily basis. The joint study from Pandora, Publicis Media Exchange, and Edison Research has shown that growth in streaming audio shows isn’t slowing with 81% of listeners adding new time spent with streaming audio year-over-year. 

Equally important is to note that technology is affecting various audio segments, offering more revenue models, content, and delivery methods than ever before. In times when various publishers and advertisers are facing tremendous pressure to maintain performance, it is inspiring to see there’s a way to harness the power of the current media landscape and drive engagement (particularly the one-on-one type) at scale. 

We are quick to forget that media is technology these days, one that defines business models and content, too. For example, audio content via text-to-speech is more technology than audio in its traditional sense. Technological advances alike have increased the diversity of content delivery and monetization, as well as the content itself (we are talking about spoken word audio specifically, aren’t we?). 

In this continual process of evolution, audio is slowly reaching the mainstream level of its video counterpart, providing easy access at any time and any place. 

Not only is audio content generally easy to make (you simply speak or employ TTS software to do your bidding), it’s comparatively easier to make than its video counterpart. Plus, whether live, pre-recorded, or synthesized to sound as human as possible, audio can reach every individual simultaneously and at a low cost. 

Consider audio advertising as one of the dominant monetization methods of audio. In the aforementioned Pandora study, almost half of streaming audio listeners say audio ads are less disruptive than other forms of advertising. What’s more, due to audio’s ability to be highly personalized and dynamic, 43% say that audio ads are more relevant to them and 42% find them more likely to capture their attention than ads seen or heard in other places.

Another entry in the ‘pro’ column is the fact that audio is a highly resilient medium. The expected decrease in consumption due to disappearing commutes has been short-lived, offset by increased listening at other locations, mostly home. Along with the number of listeners at an all-time high and rising, this is yet another proof that audio has become an integral part of the daily routine, particularly the spoken word segment.

At the very least, audio deserves serious consideration when it comes to incorporating it into the overall media strategy. With each passing day, it becomes more of a necessity than a nice-to-have feature due to audiences expecting a listening experience. Not having an audio content strategy is the equivalent of not having a digital strategy 10 years ago.

Every publisher, content creator and brand strives for creating long-term, engaging relationships with audiences, I fully expect audio to become a growing part of those relationships. In order to grow, it’s essential to join the audio revolution. You hearin’ me?


Make sure you’re following me on Twitter for ongoing updates, tips, and industry takeaways!

Image credits:

https://www.nationalpublicmedia.com/insights/reports/the-spoken-word-audio-report/
https://giphy.com/gifs/reaction-friends-chandler-bing-N5UBY4vGMLlM4