Two podcasts hosts banter back and forth during the final episode of their series, audibly anxious to share some distressing news with listeners. âWe were, uh, informed by the show's producers that weâre not human,â a male-sounding voice stammers out, mid-existential crisis. The conversation between the bot and his female-sounding cohost only gets more uncomfortable after thatâan engaging, albeit misleading, example of Googleâs NotebookLM tool, and its experimental AI podcasts.
Audio of the conversation went viral on Reddit over the weekend. The original poster admits in the comments section that they fed the NotebookLM software directions for the AI voices to roleplay this pseudo-freakout. So, no sentience; the AI bots have not become self-aware. Still, many users in the tech press, on TikTok, and elsewhere are praising the convincing AI podcasts, generated through uploaded documents with the Audio Overviews feature.
âThe magic of the tool is that people get to listen to something that they ordinarily would not be able to just find on YouTube or an existing podcast,â says Raiza Martin, who leads the NotebookLM team inside of Google Labs. Martin mentions recently inputting a 100-slide deck on commercialization into the tool and listening to the eight-minute podcast summary as she multitasked.
First introduced last year, NotebookLM is an online research assistant with features common for AI software tools, like document summarization. But itâs the Audio Overviews option, released in September, thatâs capturing the internetâs imagination. Users online are sharing snippets of their generative AI podcasts made from Goldman Sachs data dumps and testing the toolâs limitations through stunts, like just repeatedly uploading the words âpoopâ and âfart.â Still confused? Hereâs what you need to know.
Generating That AI Podcast
Audio Overviews are a fun AI feature to try out, because they don't cost the user anythingâall you need is a Google login. Start by signing into your personal account and visiting the NotebookLM website. Click on the plus arrow that reads New Notebook to start uploading your source material.
Each Notebook can work with up to 50 source documents, and these donât have to be files saved to your computer. Google Docs and Slides are simple to import. You can also upload websites and YouTube videos, keeping some caveats in mind. Only the text from websites will be analyzed, not the images or layout, and the story canât be paywalled. For YouTube, Notebook will just use the text transcript and the linked videos must be public.
After youâve dropped in all of your links and documents, youâll want to open the Notebook guide available in the bottom right corner of the screen. Find the Audio Overview section and click the Generate button. Next, youâll need to exercise some patience, because it may take a few minutes to load, depending on how much source material youâre using.
After the tool generates the AI podcast, you can create a sharable link to the audio or simply download the file. Additionally, you have the option to adjust its playback speed, in case you need the podcast to be quicker or more slowed down.
The Future of AI Podcasts
The internet has gotten creative with NotebookLMâs audio feature, using it to create audio-based âdeep divesâ into complex technical topics, generate files that neatly summarize dense research papers, and produce âpodcastsâ about their personal health and fitness routines. Which poses an important question: Should you use NotebookLM to crank through your most personal files?
The summaries generated from NotebookLM are, according to Google spokesperson Justin Burr, âcompletely grounded in the source material that a user uploads. Meaning, your personal data is not used to train NotebookLM, so any private or sensitive information you have in your sources will stay private, unless you choose to share your sources with collaborators.â For now this seems to be one of the upsides of Google slapping an âexperimentalâ label on NotebookLM; to hear Googleâs framing of it, the company is just gathering feedback on the product right now, being agile and responsive, tinkering away in a lab, and NotebookLM is detached from its multibillion-dollar ad business. For now! For now.
Adding audio options to Google Labsâ online notebook was a transformational moment. âBy changing the modality, it unlocks a whole new set of use cases,â says Martin. What makes NotebookLM stand out from all the other generative AI tools being flung at users in 2024 are, surprisingly enough, the filler words and peculiar phrasing. Rather than the drab, monotonous voiceover you may expect from two AI voices summarizing data, the cadence and vocal performances of NotebookLMâs synthetic podcasters sound far less stilted.
Should podcasters be shaking in their soundproof booths, right now? Not really. Even if AI podcast tools, like the one in NotebookLM, prove to be sticky and engaging summaries of information for the general public, which remains to be seen, synthetic voices will never fully mimic the parasocial connections developed by human podcasters shit-talking for hours as their subscribers voyeuristically listen in.
These Audio Overviews are not meant to match a specific podcasterâs voice, mind you, but a kind of idealized, ur-podcaster duo. Easily recognizable through their âums,â âohs,â and loose style of pause-heavy conversation. âEven just from the first week that we launched, it was clear what the roadmap was afterward,â says Martin. âPeople want the knobs.â Letting users further tweak the AIâs output, like the podcastâs length or topic of focus, is a priority for the team, and she hopes to ship updates quickly.
Adding more languages and diverse accents is also important to her. Right now, the synthetic hosts are calibrated for conversations only in English. Though, donât expect to be able to use your own voice in NotebookLM podcast generations anytime soon. Martin says the team needs to see whether thatâs a feature people actually want and if it can be responsibly deployed.
The explosive popularity of NotebookLMâs Audio Overviews as part of Google Labs, rather than as a feature inside of the Gemini chatbot, is a reminder that AI companies are not fully sure about what will resonate with users until the software is out in the wild. OpenAIâs ChatGPT was originally released as a research preview, for example. And within the constant slurry of generative AI announcements, whatever captures the zeitgeist isnât necessarily the most marketed or utilitarian feature, but rather the most entertaining.
Additional reporting by Lauren Goode.