@@ -168,6 +168,20 @@ Whether to use a Chrome web browser (true/false). Defaults to false.
168
168
If true, it renders the URL via a JavaScript-enabled web browser and extracts the HTML after rendering (see
169
169
:ref: `above <use_browser >` for important information).
170
170
171
+ compared_versions
172
+ ^^^^^^^^^^^^^^^^^
173
+ Number of saved snapshots to compare against (int). Defaults to 1.
174
+
175
+ If set to a number greater than 1, instead of comparing the current data to only the very last snapshot captured, it
176
+ is matched against any of *n * snapshots. This is very useful when a webpage frequently changes between several known
177
+ stable states (e.g. they're doing A/B testing), as changes will be reported only when the content changes to a new
178
+ unknown state, in which case the differences are shown relative to the closest match.
179
+
180
+ Refer to the command line argument ``--max-snapshots `` to ensure that you are saving the number of snapshots you need
181
+ for this directive to run successfully (default is 4) (see :ref: `here<max-snapshots> `).
182
+
183
+ .. versionadded :: 3.10.2
184
+
171
185
cookies
172
186
^^^^^^^
173
187
Cookies to send with the request (a dict).
@@ -194,7 +208,7 @@ http_proxy
194
208
Proxy server to use for HTTP requests (a string). If unspecified or null/false, the system environment variable
195
209
``HTTP_PROXY ``, if defined, will be used.
196
210
197
- E.g. ``
\ http://username:[email protected] :8080``.
211
+ E.g. ``
http://username:[email protected] :8080 ``.
198
212
199
213
.. versionchanged :: 3.0
200
214
Works for all ``url `` jobs, including those with ``use_browser: true ``.
@@ -204,7 +218,7 @@ https_proxy
204
218
Proxy server to use for HTTPS (i.e. secure) requests (a string). If unspecified or null/false, the system environment
205
219
variable ``HTTPS_PROXY ``, if defined, will be used.
206
220
207
- E.g. ``
\ https://username:[email protected] :8080``.
221
+ E.g. ``
https://username:[email protected] :8080 ``.
208
222
209
223
.. versionchanged :: 3.0
210
224
Works for all ``url `` jobs, including those with ``use_browser: true ``.
@@ -332,7 +346,32 @@ The following directives are available only for ``url`` jobs without ``use_brows
332
346
333
347
no_redirects
334
348
^^^^^^^^^^^^
335
- Disable GET/OPTIONS/POST/PUT/PATCH/DELETE/HEAD redirection (true/false). Defaults to false.
349
+ Disables GET, OPTIONS, POST, PUT, PATCH, DELETE, HEAD redirection (true/false). Defaults to false (i.e. redirection
350
+ is enabled) for all methods except HEAD. See more `here
351
+ <https://requests.readthedocs.io/en/latest/user/quickstart/#redirection-and-history> `__. Redirection takes place
352
+ whenever an HTTP status code of 301, 302, 303, 307 or 308 is returned.
353
+
354
+ Example:
355
+
356
+ .. code-block :: yaml
357
+
358
+ url : " https://donneespubliques.meteofrance.fr/donnees_libres/bulletins/BCM/203001.pdf"
359
+ no_redirects : true
360
+ filter :
361
+ - html2text :
362
+
363
+ Returns:
364
+
365
+ .. code-block ::
366
+
367
+ 302 Found
368
+ ---------
369
+
370
+ # Found
371
+ The document has moved [here](https://donneespubliques.meteofrance.fr/?fond=donnee_indisponible).
372
+ * * *
373
+ Apache/2.2.15 (CentOS) Server at donneespubliques.meteofrance.fr Port 80
374
+
336
375
337
376
.. versionadded :: 3.2.7
338
377
0 commit comments