Querying <div class="name"> in Python

Question

I am trying to follow the guide posted here: https://medium.freecodecamp.org/how-to-scrape-websites-with-python-and-beautifulsoup-5946935d93fe

I am at this point, where I am supposed to get the name of presumably the stock.

Take out the div of name and get its value

name_box = soup.find(‘h1’, attrs={‘class’: ‘name’})

I suspect I will also have trouble when querying the price. Do I have to replace 'price' with 'priceText__1853e8a5' as found in the html?

get the index price

price_box = soup.find(‘div’, attrs={‘class’:’price’})

Thanks, this would be a massive help.

Andrej Kesely Andrej Kesely · Accepted Answer · 2018-08-02T05:16:14

If you replace price with priceText__1853e8a5 you will get your result, but I suspect that the class name changes dynamically/is dynamically generated (note the number at the end). So to get your result you need something more robust.

You can target tags in BeautifulSoups with CSS selectors (with select()/select_one() methods. This example will target all <span> tags with class attribute that begins with priceText (^= operator - more info about CSS selectors here).

from bs4 import BeautifulSoup
import requests

r = requests.get('https://www.bloomberg.com/quote/SPX:IND')
soup = BeautifulSoup(r.text, 'lxml')

print(soup.select_one('span[class^="priceText"]').text)

This prints:

2,813.36

Querying <div class="name"> in Python

Take out the div of name and get its value

get the index price

2 Answers