All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
- Added
fit_args
toestimate_effect()
that can be used withfit_fast()
with the default variable weight algorithm (sklearn
's MTLassoCV). Common usages would include increasingmax_iter
(if you want to improve convergence) orn_jobs
(if you want to run in parallel). - Separated CV folds from Cross-fitting folds in
estimate_effects()
.cv_folds
controls the amount of additional estimation done on the control units (this used to be controled bymax_n_pl
, but that parameter now only governs the amount of post-estimation processing that is done). This will docv_folds
extra estimations per treatment time period, though if=1
then no extra work will be done (but control residuals might be biased toward 0). - Added additional option
Y_col_block_size
toMTLassoCV_MatchSpace_factory
to estimateV
on block-averages ofY
(e.g. taking a 150 cols down to 5 by doing averages over 30 cols at a time). - Added
se_factor
toMTLassoCV_MatchSpace_factory
to use a different penalty than the MSE min. - For large data, approximate the outcomes using a normal distribution (
DescrSet
), and allow for calculating estimates.
- Added tools too to use
fit_fast()
with large datasets. This includes thesample_frac
option toMTLassoCV_MatchSpace_factory()
to estimate the match space on a subset of the observations. It also includesfit_fast()
optionavoid_NxN_mats
which will avoid making large matrices (at the expense of only returning the Synthetic controltargets
andtargets_aux
and not the full weight matrix) - Added logging in
fit_fast
via theverbose
numerical option. This can help identify out-of-memory errors. - Added a pseudo-Doubly robust match space maker
D_LassoCV_MatchSpace_factory
. It apporptions some of the normalized variable V weight to those variables that are good predictors of treatment. This should only be done if there are many treated units so that one can reasonably model this relationship. - Switched using standardized Azure Batch config library
Initial release.