Replies: 1 comment 2 replies
-
@AdrianSosic please take a look |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Here's a comparison between how different startegies are performing.
Details: I created 10
NumericalDiscreteParameter
s and oneCategoricalParameter
with 3 possible values. To change the parameter space size I tried different number of values for the numerical parameters: [5, 6, 7, 8, 10, 20, 50, 100]When the parameters' possible values reached 8 the process was killed due to memory overflow. Using Polars' Streaming API didn't solve the problem because Streaming API doesn't support
cross
operation. So my results are limited to n=5,6 values for each parameter and 3 values for a categorical parameter.Search space size for n=5 : 10 * 3 * 5 ^ 10 ~ 300M cells
Search space size for n=6 : 10 * 3 * 6 ^ 10 ~ 1.80B cells
Search space size for n=7 : 10 * 3 * 7 ^ 10 ~ 8.50B cells [Exceeded memory]
I used all types of polars-constraints (sum, product, exclusion) with these conditions:
One that could filter a large chunk of rows (more than 95%),
The effect of the second one were negligible (less than 5%),
And finally a constraint with mid-range effect (filtering ~25%)
I repeated each experiment multiple times to make sure the results are consistent. The table below shows the result of one of those experiments (they were similar):
Beta Was this translation helpful? Give feedback.
All reactions