Skip to content

Parse multiple local html files instead of keywords #76

@pankajjha-cd

Description

@pankajjha-cd

Hi,

Thanks a lot for this great package. This is an awesome package and really worth all the starts it gets.

I was wondering how can I feed multiple local html files to the scraper instead of keywords.

So basically I just want to use the parser module of the package to scrape html files that are already downloaded in the local file system.

I see there's a scrape_from_file config option but it takes a single file and still needs a keyword array(even though the scrapper doesn't use the keyword).

may be something like scrape_file_loop instead of keyword scrapping loop where I can assign an array of files or URLs that are ready to parse.

Is there any way I can achieve this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions