Scraper rules don't work with theguardian.com #2688

advert665 · 2024-06-11T09:35:37Z

The Guardian publishes summaries in thier rss feeds, so I want to use the scraper rules to load the full content from the corresponding webpage. However, when I use a selector that corresponds to the desired content on the webpage it won't load.

For instance, using div#maincontent or p.dcr-iy9ec7, fails to change the resulting article in miniflux for the following feed, even though they select elements in the linked pages: https://www.theguardian.com/theguardian/mainsection/topstories/rss

Similarly, using picture to extract the cartoons from https://www.theguardian.com/profile/martinrowson/rss (with or without the add_dynamic_image rule), fails to load anything in miniflux.

Other RSS apps like Lire are able to load the full articles so it's not a Guardian issue specifically. Am I doing something wrong or is this a Miniflux limitation? Thanks!

The text was updated successfully, but these errors were encountered:

advert665 added feed problems triage needed labels Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scraper rules don't work with theguardian.com #2688

Scraper rules don't work with theguardian.com #2688

advert665 commented Jun 11, 2024 •

edited

Loading

Scraper rules don't work with theguardian.com #2688

Scraper rules don't work with theguardian.com #2688

Comments

advert665 commented Jun 11, 2024 • edited Loading

advert665 commented Jun 11, 2024 •

edited

Loading