- bugs: Fix
traverse_nested
when value is not compatible withItemAdapter
(PR#450) - bugs: Fix default templates non-subscriptable issue (PR#451)
- chore: Update codecov action (PR#453)
- docs: Fix broken link (PR#452)
- feature: Allow to filter jobs in ZyteJobsComparisonMonitor by close_reason (PR#440)
- feature: Apply fingerprint rules to group issues in sentry alerts (PR#455)
- bugs: Fix
_get_jobs
to use the proper parameter for tag filtering inZyteJobsComparisonMonitor
(PR#446) - bugs: Fix
_get_jobs
method returning an incorrect number of jobs inZyteJobsComparisonMonitor
(PR#444) - docs: Add
SPIDERMON_MONITOR_SKIPPING_RULES
documentation and examples on settings page (PR#447) - chore: Add Python 3.12 support (PR#443) (PR#448)
- bug: Fix CI/CD pipelines not working due to changes on Scrapy (PR#426)
- bug: Fix TypeError in
PeriodicItemCountMonitor
when item_scraped_count was not defined the first time it was checked (PR#436) - bug: Fix
calculate_field_coverage
incorrect handling of field names ending with _items (PR#437) - bug: Fix monitor failing when operating offset-naive and offset-aware datetimes (PR#439)
- bug: Fix tests for minimum properties and items in JSON Schema validations (PR#432)
- feature: Add the
SPIDERMON_DICT_FIELDS_COVERAGE_LEVELS
setting to limit nested dict stats computation (PR#433) - feature: Add
SendSNSNotification
to Spidermon Notifications (PR#413) - feature: Change the format of the content of _validation from defaultdict to dict (PR#431)
- feature: Add support for nested fields in the
SPIDERMON_VALIDATION_ERRORS_FIELD
setting (PR#416) - bug: Fix ItemValidationPipeline using the
__setitem__
method without ItemAdaper (PR#415) - feature: Improve test coverage for the item validation pipeline (PR#414)
- bug: Fix
spidermon.utils.zyte.Client
not following crawler settings (PR#411) - chore: Migrate ReadTheDocs to config file v2 (PR#408)
- chore: Refactor package requirements (PR#407)
- The
SPIDERMON_VALIDATION_ERRORS_FIELD
setting no longer supports a field name that contains a dot (.
). Dots are now be treated as field separators, to support specifying a nested field.
- bug: Fix
JobTagsAction
failing due toJobMetadata
not longer supporting__setitem__
method (PR#404) - feature: Add
PeriodicItemCountMonitor
to check for increase in item count. Also includedPeriodicItemCountMonitorSuite
suite (PR#402) - chore: Deprecate support for data validation using
schematics
(PR#399) - chore: Drop support for Python 3.6 and Python 3.7. Added support for Python 3.11 (PR#398)
- feature: Add ability to pass kwargs to Slack APIs (PR#397)
- feature: Support setting the
Return-Path
for Amazon SES emails (PR#381) - bug: Fix
BaseStatMonitor
failing in Scrapy Cloud when settings were provided as strings (PR#378) - feature: Add setting
SPIDERMON_FIELD_COVERAGE_SKIP_IF_NO_ITEM
to allow skippingFieldCoverageMonitor
if no items were scraped (PR#372) - feature: Add
Fallback Actions
.Action
now allows to definefallback
field that may contain an additionalAction
to be executed if an exception is raised during the main action (PR#365) - feature: Use
ItemAdapter
when working with items to support the same types of item as Scrapy (PR#358) - chore: Refactor code from
spider.contrib.scrapy
intobase
,monitors
andsuites
subpackages (PR#386) - chore: Replace
tox pep8
functionality withpre-commit
git hooks (PR#387) - chore: Update contributing guidelines to include reference to
pre-commit
tool (PR#392)
- bug: Fix Slack dependency name issue (PR#367)
- chore: Change the Ubuntu version on workflow settings (PR#373)
- docs: Adding PeriodicExecutionTimeMonitor to the batteries docs (PR#368)
- feature: Adding the use of ItemAdapter to prevent assumptions of item nature (PR#358)
- misc: Fix compatibility issues with jsonschema>=4 (PR#364)
- feature: Updated DownloaderExceptionMonitor and ItemValidationMonitor to inherit from BaseStatMonitor (PR#334, PR#335)
- feature: Updated Slack action to use slack-sdk as library in replacement of deprecated slackclient (PR#313)
- feature: Added new action to allow to send notification to Discord channels (PR#348)
- feature: Added Python 3.10 support (PR#349)
- feature: Added new action to allow to send email notifications using SMTP server (PR#345)
- misc: small bug fixes and documentation improvements that can be checked in the milestone summary.
- feature: Create base class to aid the creation of custom monitors that only validates against a job stat value (PR#325)
- feature: Add built-in monitor for critical errors (PR#329)
- feature: Use new base class to implement some built-in monitors (PR#326 PR#327 PR#328)
- feature: Add new built-in monitors for common validations (PR#284)
- bug: Allow Slack bot to send notification correctly even if an icon URL is not defined to the bot (PR#307)
- bug: Fix regex to match validation error message from schematics library (PR#310)
- chore: Remove six library and upgrade Python syntax (PR#270)
- chore: Remove travis and configure Github Actions (PR#291)
- chore: Add Github Actions support and remove Travis
- chore: Pin jsonschema version to 3.2.0 to avoid problems with newest version that has backward incompatible changes
- chore: Pin schematics version to 2.1.0 to avoid problems with newest version that has backward incompatible changes
- feature: Improve content of Sentry messages (PR#279)
- bug: Replace boto with boto3 for Amazon SES work correctly (issue#285)
- feature: Built-in monitor for field and item coverage (issue#253)
- feature: Add field coverage statistics (PR#262)
- chore: Update required slackclient version (PR#265)
- chore: Add Python 3.8 support (issue#255)
- chore: Drop Python 3.5 support (issue#266)
- chore: Remove test decorator that skips tests if executed in old Python versions (PR#258)
- chore: Fix deprecation warnings (PR#272, PR#274)
- docs: Fix inconsistent examples (PR#273)
- bug: Fix Telegram action error logging
- feature: Disable item validation pipeline when Spidermon is disabled
- feature: Item validation built in monitor
- chore: Removed Python 2.7 support
- docs: Improved documentation organization
- Fixed version 1.12.1 changelog
- bugfix: AttributeError when using ValidationMonitorMixin (issue)
- docs: How-To Guide - Adding required fields coverage validation (pull request)
- Dropped python 3.4 support
- Added action to send monitor reports to Telegram
- Added fallback to scrapy AWS settings
- Logged errors from Slack API calls
- Allowed to define SPIDERMON_SLACK_RECIPIENTS setting as a comma-separated string with the desired recipients
- Read SES settings with getlist
- Added documentation of Expression Monitors
- Improved Slack action documentation
- Fixed sphinx warnings when building docs
- Fixed warnings in docs build
- Validate docs build in CI
- Applied and enforced black formatting on spidermon source code
- Configured test coverage reporting in project
- Allowed per-field checking in ValidationMonitorMixin
- Added option to set AWS Region Name on SES E-Mail action
- Added default value for 'SPIDERMON_BODY_HTML_TEMPLATE' setting
- Fixed bug in logging of Slack messages when fake setting is enabled
- Enforced lxml 4.3.5 or lower for Python 3.4
- Improved stats history documentation
- Version 1.10.1 with CHANGELOG updated
- Allowed to add absolute location for custom templates
- Added new StatsCollector that access stats data from previous spider executions.
- Added new setting to define the max number of unwanted HTTP status codes allowed in built-in monitor.
- Improved validation error messages with JSON Schema when additional fields are found.
- Made possible to retrieve JSON schema files from external locations.
- Included documentation of periodic monitor suites.
- Fixed bug caused by new slackclient release.
- Other small documentation improvements.
- Add set of built-in basic monitors with the most common test methods to allow start monitoring spiders more straightforward.
- Add SendSentryMessage action to send notifications to Sentry containing the results of Spidermon execution.
- Add SPIDERMON_ENGINE_STOP_MONITORS setting to list monitors to be executed when the Scrapy engine is stopped.
- Fix bug that prevented the use of custom model-level validators in schematics models.
- Refactor JSONSchemaValidator to allow select different versions of JSON Schema.
- Refactor requirements in setup.py to include missing required dependencies.
- Fix bug caused by backward incompatible change in jsonschema 3.0.0.
- Fix example code of tutorial.
- Install documentation improvements.
- Remove CreateJobReport action.
- Include new documentation and tutorial code.
- Rename internal method in MonitorRunner to fix typo.
- Support universal wheels.
- Skip authentication and recipient settings when running in fake mode.
- Add SPIDERMON_EMAIL_CONTEXT setting to pass custom contexts to email actions.
- Add support for Schematics 2.1.0.
- Convert the job ID tag into a clickable button.
- Avoid requests to get the amount of lines in the log by default, because
they consume too much memory and they are very slow. You can still use
the old behavior adding
show_log_count
to the context before creating the email message. - Refactor the requirements in setup.py.
- Update the Sphinx configuration.
- Add support for periodic monitors in the Scrapy extension.
- Modify ItemValidationPipeline in order to support dict objects in addition to Scrapy.Item objects.
- Refactor ItemValidationPipeline to make it easier to extend this class.
- Add Schematics 2.* support. Note that Schematics 2.0.0 introduced many changes to its API and even some validation rules have a slightly different behaviour in some cases.
- ItemValidationPipeline optimisations for cases where no validators can be applied.
- Add Python 3 support.
- Run tests on Python 2 and Python 3.
- Add dependencies for optional validation features to setup.py.
- Import HubstorageClient from the scrapinghub library if available.
- Replace dash.scrapinghub.com with app.scrapinghub.com.
- Rename attachements attribute in the SendSlackMessage class to attachments.
- Add the SPIDERMON_ENABLED setting to control if the Scrapy extension should run (note that it is disabled by default).