-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unicode handling of --include and --exclude #145
Comments
This looks correct. The best way to know is by testing it, and I would love to see the result of such a test. If you can build this directory tree, just serve it using a webserver and try to run suckit on localhost |
@mr-bo-jangles Did it worked ? |
Maybe we can add an option to output URL filtering information to stdout or a file, e.g, if the include or exclude regex matches? |
Good idea
What do you mean? |
To download a phpBB forum, I added a hack to rewrite some URLs, namely remove a I originally planned to flesh this out into a dedicated feature / command line option, but eventually didn't. I already achieved my goal and I could not figure out a way to do it properly. |
The problem with removing parameters such as |
In general you are correct, but in the specific case of phpBB the content is always the same, no matter the I actually just found a different solution, namely to send session cookies, which avoids |
We could imagine a solution where you whould have a list of tuple with a regex and list of arguments to remove Vec<(regex, Vec<parameter>)> But it might be really costly |
So my specific usecase here is attempting to mirror a site with a lot of directories of various languages, but skipping the static files at a higher level.
Example Folder Structure
I want to be sure that by running a command similar to
suckit https://domain.tld -i "/Books/[a-Z0-9]+/"
I will download the Tree under/Books/
while excluding anything under./
,../
, and===/
The text was updated successfully, but these errors were encountered: