Hey people, first of all, thanks for taking the time to help improving this project 🍻
This project needs the help of other developers, since it's impossible getting access to all relevant banks and credit cards.
Any kind of help is welcome, even if you just discover an issue and don't have the time to invest in helping to fix it.
While there's no specific template for creating a new issue, please take the time to create a clear description so that it is easy to understand the problem.
In order to run tests you need first to create test configuration file ./src/tests/tests-config.js
from template ./src/tests/tests-config.tpl.js
. This file will be used by jest
testing framework.
IMPORTANT: Under
src/tests
folder exists.gitignore
file that ignore the test configuration file thus this file will not be commited to github. Still when you create new PRs make sure that you didn't explicitly added it to the PR.
This library supports both testing against credit card companies / banks api and also against mock data. Until we will have a good coverage of scrapers test with mock data, the default configuration is set to execute real companies api tests.
Modify property options
in the test configuration file. This object is passed as-is to the scraper.
Enable any company you wish to test by providing its credetials in the test configuration file under credentials
property.
Before running any tests, make sure you created the test configuration file with relevant credentials,
To run all tests of companies that you provided credentials to:
npm test
To run specific describe
(a.k.a suite), use the testNamePattern
arg with the name of the suite. The following will run the all tests under Leumi legacy scraper
suite.
npm test -- --testNamePattern="Leumi legacy scraper"
To run specific test
, use the testNamePattern
arg with suite name following the test name. The following will run test should expose login fields in scrapers constant
that is part of Leumi legacy scraper
suite.
npm test -- --testNamePattern="Leumi legacy scraper should expose login fields in scrapers constant"
Many IDEs support running jest tests directly from the UI. In webstorm for example a small play icon automatically appears next to each describe/test.
IMPORTANT Note babel is configured to ignore tests by default. You must add an environment variable BABEL_ENV=test
to the IDE test configuration to allow the tests to work.
To save unit test scraper results provide a valid path in test configurations property excelFilesDist
, for example:
{
companyAPI: {
enabled: true,
excelFilesDist: '/Users/xyz/Downloads/Transactions',
},
}
You can use environment variables instead of a local file to provide the tests configuration.
copy and adjust the json below with relevant credentials and assign it to environment variable named TESTS_CONFIG
. Note that this must be a valid json string otherwise it will fail during json parsing.
{
"options": {
"startDate": "2019-06-01",
"combineInstallments": false,
"showBrowser": true,
"verbose": false,
"args": []
},
"credentials": {
"leumi": { "username": "demouser", "password": "demopassword" }
},
"companyAPI": {
"enabled": true,
"invalidPassword": false
}
}
If you wish to try it from cli (mac os), you should either create a one liner json configuration or use cat to provide multiline value:
TESTS_CONFIG=`cat <<EOF
{
... replace with actual json configuration ...
}
EOF
` npm run test
Make sure that you created test configuration file ./src/tests/.tests-config.js
from template ./src/tests/tests-config.tpl.js
.
- Make sure that you created test configuration file
./src/tests/.tests-config.js
from template./src/test/tests-config.tpl.js
. - Make sure that you added environment variable
BABEL_ENV=test
to the IDE test configuration.
Make sure that you uncommented the company credentials in the test configuration file.
- Make sure that you uncommented the company credentials in the test configuration file.
- Enable credit card companies / banks api tests in configuration file
companyAPI.enabled: true
The playground scripts were ok at the time and allowed us to develop and test scrapers. Since then we added new types of scrapers with different public api and we needed a better solution that will catch up with those changes.
In addition, the playground was offering an encryption of the passwords which lead to false sense of security since the private key was held in the source codes. Anyone could easily find the private key and decrypt those passwords. The new approach better reflect the standard way by providing a template file and ignoring the user specific configuration file . The developer "sees" the file and review its' PRs which should provide better understanding of what is going on.
Again, no template, but please try to create something of the form:
Changes:
* Created a new scraper for bank X
* Refactor blah blah
* etc...
- Make sure everything is committed.
- Run
npm run prepare:core
. The script will change the name insidepackage.json
toprepare-israeli-bank-scrapers-core
, change puppeteer dependency topuppeteer-core
, reinstall dependencies and rebuild the library. - Run
npm publish
. This will publishprepare-israeli-bank-scrapers-core
package. - Run
npm run reset
to reset the changes.
It is best to look at an existing example.
Most scrapers inherit from BaseScraper
, notice that you need to implement the following:
this section is relevant if you are extending class
BaseScraperWithBrowser
Unless you plan to override the entire login()
function, You can override this function to login regularly in a login form.
import { LoginResults, PossibleLoginResults } from './base-scraper-with-browser';
function getPossibleLoginResults(): PossibleLoginResults {
// checkout file `base-scraper-with-browser.ts` for available result types
const urls: PossibleLoginResults = {};
urls[LoginResults.Success] = [];
urls[LoginResults.InvalidPassword] = [];
urls[LoginResults.ChangePassword] = [];
return urls;
}
function getLoginOptions(credentials) {
return {
loginUrl: '<LOGIN_URL>',
fields: [
{ selector: '<USER_NAME_FIELD>', value: credentials.username },
{ selector: `<PASSWORD_FIELD>`, value: credentials.password },
],
submitButtonSelector: '<SUBMIT_BUTTON>',
possibleResults: getPossibleLoginResults(),
};
}
You can override this async function however way you want, as long as your return results as ScaperScrapingResult
(checkout declaration here).