Beautiful Soup Tutorial – Web Scraping in Python

///Beautiful Soup Tutorial – Web Scraping in Python

Beautiful Soup Tutorial – Web Scraping in Python

FavoriteLoadingAdd to favorites

The Beautiful Soup module is used for web scraping in Python. Learn how to use the Beautiful Soup and Requests modules in this tutorial. After watching, you will be able to start scraping the web on your own.

💻Code:

Tutorial from Vincent Russo of Lucid Programming. Check out his YouTube channel:

🐦Vincent on Twitter: @captainhamptons

Learn to code for free and get a developer job:

Read hundreds of articles on programming:

And subscribe for new videos on technology every day:

source

By |2019-09-24T22:59:12+00:00September 24th, 2019|Python Video Tutorials|48 Comments

48 Comments

  1. LucidProgramming September 24, 2019 at 10:59 pm - Reply

    Hi everyone, LucidProgramming here, the creator of this video series. If you enjoyed the video, I'm really thrilled to hear that. If you like this type of content, head on over to my channel and consider subscribing for staying up-to-date with similar types of videos. If you have any requests, recommendations, I take these seriously to heart and am constantly trying to improve the content of my channel. Thanks again, and happy coding!

  2. that guy September 24, 2019 at 10:59 pm - Reply

    pip does not work for command prompt idk why

  3. Sachin Negi September 24, 2019 at 10:59 pm - Reply

    clean and straightforward

  4. Sakib September 24, 2019 at 10:59 pm - Reply

    what kind of project is this? i cant create Beatifulsoup class in a py file

  5. Yacine Freifer September 24, 2019 at 10:59 pm - Reply

    when typing import requests i get "import requests is not recognized as a cmdlet "

  6. Fatima Zahra September 24, 2019 at 10:59 pm - Reply

    plz how can i stock this data in database phpmyadmin???

  7. nick brownrigg September 24, 2019 at 10:59 pm - Reply

    took me 3 hours just to figure out i need to add the Lib/site-packages in visual studios just to import requests. what a drag. idk how i even ended up on this page

  8. Arush James September 24, 2019 at 10:59 pm - Reply

    I am your 1000th like

  9. Akshay Sapra September 24, 2019 at 10:59 pm - Reply

    In the white house example, can you find the 'a' tag directly without first finding the 'h2' tag?

  10. Rhishabh Deshpande September 24, 2019 at 10:59 pm - Reply

    A very neat and cogent description of the Beautiful Soup documentation. Thanks for this video, makes learning much more easier! 🙂

  11. robin vermillion September 24, 2019 at 10:59 pm - Reply

    answering my own question . pip install lxml

  12. robin vermillion September 24, 2019 at 10:59 pm - Reply

    File "untitled", line 40, in <module>

    soup = BeautifulSoup(src, 'lxml')

    File "C:UsersRobAppDataLocalProgramsPythonPython37-32libsite-packagesbs4_init__.py", line 196, in __init_

    % ",".join(features))

    bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?
    do you know what might be causing this traceback error?

  13. Mustafa Khan September 24, 2019 at 10:59 pm - Reply

    informative, nice and one of the best videos on web scrapping thank you

  14. Cosmetolog D September 24, 2019 at 10:59 pm - Reply

    Thanks for video, really enjoyed!
    P.s. subscribed

  15. артур магомедов September 24, 2019 at 10:59 pm - Reply

    beautiful

  16. Shivam Gupta September 24, 2019 at 10:59 pm - Reply

    when running this code :
    h2 = soup.find_all('h2')

    I m not able to scrap any of the lists for "h2" class while I' m able scrap for "a".
    kindly reply

  17. Karen Wong September 24, 2019 at 10:59 pm - Reply

    very easy to follow! clear explanation! thx for the tutorial!

  18. Jean DAVID September 24, 2019 at 10:59 pm - Reply

    i wonder how I really did get the scrape.html into my pc to test the python scripts !!!!!

  19. Lohe221 September 24, 2019 at 10:59 pm - Reply
  20. Nour SIDAOUI September 24, 2019 at 10:59 pm - Reply

    This is the fourth web scraping tutorial and its by far the most beginner friendly one.
    Perfectly clear and well simplified!
    You've done a great job man!
    Peace

  21. Peter Steele September 24, 2019 at 10:59 pm - Reply

    I really enjoyed the video, however I have some feedback regarding what was said. Around the 5-6 min mark you are talking about creating a variable "soup" to store the object information in. "Soup" is an object being instantiated from the BeautifulSoup class itself and is not a variable. Also you mention that "find_all" is a method to the BeautifulSoup object above which is a class not an object. You do correctly reference "soup" as an object after the fact but I only wanted to point this out as it may be confusing for some viewers. I find its important to use the terminology correctly to help those learning to understand what is going on, and when we mix things up when making these videos, it can often lead to confusion. I wont pick apart the whole video, but just wanted to point out a couple of the early things I noticed. To me its like calling a method a function even though they are similar they are not the same. Please don't get upset over me pointing this out. I just want to help make the terminology consistent with what is going on so you can make more accurate videos for those learning off of these in the future. Thanks again for the tutorial!

  22. Andrew Klauber September 24, 2019 at 10:59 pm - Reply

    CAPTIAN HAMPTON YOU SEXY BEAST

  23. VRUSH RAUT September 24, 2019 at 10:59 pm - Reply

    My name is Vrushabh Raut ! I am basically working as a freelancer ! I need a help to make a video on reviews of data scraping
    software ! Basically i need full 5 minute review video that shows how to use the software ! If you are interested kindly reply on my
    email given below ! It is beneficial for me to promote the site by the medium of you ! Kindly reply me sir !

  24. Andrea DWS September 24, 2019 at 10:59 pm - Reply

    I need to work through a huge 10k html page and i need to select a lot of tables with different classe and text. i want to kill myself

  25. Rhino plasty September 24, 2019 at 10:59 pm - Reply

    when i do the first step to ope google.com it says ModuleNotFoundError: No module named 'requests'
    thank you for any help

  26. Jesse Gonźalez September 24, 2019 at 10:59 pm - Reply

    Hello I first started this tutorial last month or so and my terminal came up with

    "soup = BeautifulSoup(src, "lxml")
    File "/Library/Python/2.7/site-packages/bs4/_init__.py", line 152, in __init_
    % ",".join(features))
    bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?"

    Re-attempted, copied the code from your git repo, nada.

    I've been wanting to create some projects, and I especially like the idea of webscrapping.

    Today I re-attempted by starting over, and the error above came up. Crap, now what.

    I googled it and found this a Stack Overflow article with the same issue: https://stackoverflow.com/questions/24398302/bs4-featurenotfound-couldnt-find-a-tree-builder-with-the-features-you-requeste

    Fix: sudo pip install lxml
    *close all terminals and re-navigate to the file.

    Viola!

    The power of Google, I'm so juiced! Thanks for the amazing video!

  27. Boka Bosiljcic September 24, 2019 at 10:59 pm - Reply

    If I comment "This is beautiful ", would I be stating the obvious? 🙂 Great work, keep it up. Will subscribe to your channel.

  28. Jordy Mcnab September 24, 2019 at 10:59 pm - Reply

    Nice and simple. Would advise maybe throwing the return type of the different methods i.e result.status_code will return an int 200 not string 200. This would be good training for beginners to always account for the data type returned when handling the variable.

  29. BaTtErY_aCiD_ September 24, 2019 at 10:59 pm - Reply

    could you please make a tutorial for
    scikit learn

  30. Noah Iboa September 24, 2019 at 10:59 pm - Reply

    Is there any reason that the instructor refers to things like .attrs and .text as functions? Aren't they attributes in object oriented programming terms (as opposed to a function/method)? I ask because he specifically "corrected" himself at 9:35 from attribute to function.

  31. Dwane Ho September 24, 2019 at 10:59 pm - Reply

    Hi can ask what kind of potato get system are you using for this video? It looks like Linux, but it looks not like Ubuntu, and also can you explain why you are using this? Thanks!

  32. luu vinh tuong September 24, 2019 at 10:59 pm - Reply

    amazing tutorial which is very helpful

  33. Christine Felts September 24, 2019 at 10:59 pm - Reply

    2:08 6:58 find links content <a class = to a>

  34. Prashant Arora September 24, 2019 at 10:59 pm - Reply

    Hey please make a typescript tutorial video bcz there is no any good resource for it except official documentation

  35. Gabriel Paez September 24, 2019 at 10:59 pm - Reply

    excellent, thanks a lot!!

  36. Ray Charles September 24, 2019 at 10:59 pm - Reply

    Great course thank you

  37. Rohit Verma September 24, 2019 at 10:59 pm - Reply

    Can you plz upload a video on Kotlin and Android Development

  38. Amilcar C. da Silva September 24, 2019 at 10:59 pm - Reply

    Amazing tutorial video. Fantastic! Well explained….thanks a lot tutor….I really like this tutorial…I have learnt about this web scraping in my class of data mining….I was not really clear about this point…..but this video tutorial with the coding in python makes me better understand …..once again huge thanks to you…..You render great service to many.

  39. Kerry Weston September 24, 2019 at 10:59 pm - Reply

    Instead of using the White House website, can you use Amazon? I'd like to see how to collect all the product data from Amazon and store it in a database. This would be useful.

  40. ria roussou September 24, 2019 at 10:59 pm - Reply

    I like programming 😀

  41. raffaele rimorso September 24, 2019 at 10:59 pm - Reply

    good tutorials

  42. nom nom September 24, 2019 at 10:59 pm - Reply

    on Instagram @t0nystark007…….If you like meme's please have a look at our page and follow us for more… We're posting daily and share our page with your friends… It means a lot to us .. thank you……

  43. jhkkjhk September 24, 2019 at 10:59 pm - Reply

    This is how you get banned by Google =D

  44. Host Promo September 24, 2019 at 10:59 pm - Reply

    Vim is the master editor!

  45. Silently Sceptical September 24, 2019 at 10:59 pm - Reply

    Where do I plug in the Dreamweaver sonny?!

  46. Where's the soup at though?

  47. Sihab Tutorials September 24, 2019 at 10:59 pm - Reply

    which software you use for write code?

  48. Gigel Chiazna September 24, 2019 at 10:59 pm - Reply

    wht not regex al urls in page source?

Leave A Comment

*