soup.find() returning 'None', resulting in AttributeError: 'NoneType' object has no attribute 'get_text #20

CoderStylus · 2024-04-30T23:58:38Z

Traceback (most recent call last):
File "/workspaces/pcpartpickertest/index.py", line 7, in
parts = pcpp.part_search("i7")
File "/home/codespace/.python/current/lib/python3.10/site-packages/pypartpicker/scraper.py", line 232, in part_search
soup = self.__make_soup(f"{search_link}&page={i + 1}")
File "/home/codespace/.python/current/lib/python3.10/site-packages/pypartpicker/scraper.py", line 95, in _make_soup
if "Verification" in soup.find(class="pageTitle").get_text():
AttributeError: 'NoneType' object has no attribute 'get_text'

The error is returned by this example code:

from pypartpicker import Scraper

pcpp = Scraper()
parts = pcpp.part_search("i7")


for part in parts:
    print(part.name)

first_product_url = parts[0].url
product = pcpp.fetch_product(first_product_url)
print(product.specs)`

The issue is with the last line of code in the error message, which tells us that it occurs at scraper.py, line 95, in __make_soup.

The key is in the statement:

if "Verification" in soup.find(class_="pageTitle").get_text():

The call to soup.find should return some object reference on which a call to get_text will return some data. But if soup.find does not succeed it does not return any object (in reality it returns None). So the call to get_text is impossible because there is no object to call it from. Which results in the error message: AttributeError: 'NoneType' object has no attribute 'get_text'

This may be a problem with the scraper.py code or the Soup package itself.

The text was updated successfully, but these errors were encountered:

thefakequake · 2024-05-10T15:46:53Z

This issue is likely occurring due to a cloudflare bot verification check when the library makes the request to pcpartpicker.
This can be solved by using custom HTTP headers, the same as the ones your browser users in order to bypass the check.

When creating the instance of the Scraper class, pass in a dictionary headers with the same HTTP headers as your browser.

thefakequake · 2024-05-10T15:47:48Z

I will think about adding a new CloudflareCheck error to the library to make it more clear that this is the case, as the current error is confusing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

soup.find() returning 'None', resulting in AttributeError: 'NoneType' object has no attribute 'get_text #20

soup.find() returning 'None', resulting in AttributeError: 'NoneType' object has no attribute 'get_text #20

CoderStylus commented Apr 30, 2024 •

edited

Loading

thefakequake commented May 10, 2024

thefakequake commented May 10, 2024

soup.find() returning 'None', resulting in AttributeError: 'NoneType' object has no attribute 'get_text #20

soup.find() returning 'None', resulting in AttributeError: 'NoneType' object has no attribute 'get_text #20

Comments

CoderStylus commented Apr 30, 2024 • edited Loading

thefakequake commented May 10, 2024

thefakequake commented May 10, 2024

CoderStylus commented Apr 30, 2024 •

edited

Loading