Python and RequestsHTML Web Scraping Dynamic Content from JavaScript applications
YOUR LINK HERE:
http://youtube.com/watch?v=-PmNcIX9En4
In this video, we'll learn how to scrape content that is NOT present in initial page loads, but instead is loaded dynamically by JavaScript. • This is a common problem with scraping the modern web: the initial response contains minimal HTML and a SPA-based JavaScript app (React, Vue, Angular, etc). The data that we want to scrape on the page is therefore not present, but is rendered later via API calls from the SPA application. • We will look at how we can use requests-html to solve this issue in Python when scraping such sites. We'll also look at using this with BeautifulSoup in order to find data on the page. • This video makes use of the following sample website (a React application): • https://react-amazon-bestsellers-book... • ð ððĩðŪð―ððēðŋð: • 00:00 Intro • 02:15 Sending GET request using Python requests library • 04:00 Finding objects with BeautifulSoup • 05:15 Installing requests-html • 06:38 Executing JavaScript on page using requests-html • âïļ ððð ðšðē ðŪ ð°ðžðģðģðēðē: • To support the channel and encourage new videos, please consider buying me a coffee here: • https://ko-fi.com/bugbytes • ðĶðžð°ðķðŪðđ ð ðēðąðķðŪ: • ð Blog: https://bugbytes.io/posts/ • ðū Github: https://github.com/bugbytes-io/ • ðĶ Twitter: / bugbytesio • ð ðððŋððĩðēðŋ ðŋðēðŪðąðķðŧðī ðŪðŧðą ðķðŧðģðžðŋðšðŪððķðžðŧ: • requests-html: https://pypi.org/project/requests-html/ • BeautifulSoup: https://beautiful-soup-4.readthedocs.... • requests: https://pypi.org/project/requests/ • Sample website: https://react-amazon-bestsellers-book... • #python #webscraping #datascience
#############################
