-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"NoneType" in title_el.get("href") when scraping kleinanzeigen #515
Comments
Hi @alvarnydev, I've not seen that before, no. It seems to pick out the title elements (at least on my crawls) without complaining. Looks like the search for the |
Hey thanks for the comment. I haven't really looked into it much further because the docker compose config just restarts and works fine from there, until it eventually crashes again, in perpetuum. When I have the time I'll look into it more |
Unfourtunately got the same problem. |
Here's how I parse Kleinanzeigen. Maybe it helps in providing a fix: expose_ids = soup.find_all("article", class_="aditem")
for x, expose in enumerate(expose_ids):
title = expose.find(class_="ellipsis") |
Hi you lovely people!
I currently run into an issue when scraping kleinanzeigen because the bot seems to have trouble getting the link from the current listing it parses over, sometimes. It works for a while and eventually breaks. Looks like this for me:
I looked through the existing and past issues and didn't find anything similar. Nevertheless, have you guys maybe seen this before? I run the Docker image of flathunter using docker compose on an Ubuntu 22 machine with the following config:
The text was updated successfully, but these errors were encountered: