How to scrape youtube playlist using python

Ever wanted to extract information from YouTube playlists? I recently needed to grab details like video titles and descriptions from a YouTube channel, and thought I’d share my approach. This simple tutorial will walk you through scraping a YouTube playlist using Python. Let’s dive in!

First, let’s import the essential packages:

from bs4 import BeautifulSoup as bs
import requests

Next, we’ll make a GET request to fetch the page content:

# Target the specific playlist URL
link='https://www.youtube.com/playlist?list=PLRVhX1weTUd_hs_mcpfwcOq18tSdL0y60'
# Use requests to fetch the webpage content
r=requests.get(link) 
page=r.text # Convert to text
soup=bs(page,'html.parser') # Parse with BeautifulSoup
#print(soup.prettify()) # Uncomment to see the HTML structure

To extract video titles:

title_soup=soup.find('meta',property='og:title')
vid_title=title_soup['content'] if title_soup else 'not found'
print(vid_title)

For video descriptions:

disrip_soup=soup.find('meta',property='og:description')
vid_discription=disrip_soup['content'] if disrip_soup else 'not found'
print(vid_discription)

Trying to get upload dates (which doesn’t work for playlists):

date_soup=soup.find('meta',itemprop='uploadDate') # Doesn't work for playlists
upload_date=date_soup['content'] if date_soup else 'not found'
print(upload_date)
#->not found

Web scraping doesn’t have to be intimidating! YouTube’s structured format makes it relatively straightforward to extract the information you need. You can expand on this foundation to extract other metadata using different property tags. Happy scraping!