LR with priors initial implementation #66

bmramor · 2024-03-12T14:13:38Z

No description provided.

wiz-inc-6f7a9d0588 · 2024-03-15T10:45:56Z

Wiz Scan Summary


IaC Misconfigurations
Vulnerabilities
Sensitive Data
Total
Secrets

SkBlaz

🥳

outrank/algorithms/importance_estimator.py

SkBlaz · 2024-03-19T06:07:57Z

outrank/algorithms/importance_estimator.py

@@ -57,17 +59,22 @@ def sklearn_surrogate(
    unique_values, counts = np.unique(vector_second, return_counts=True)

    # Establish min support for this type of ranking.
-    if counts[0] < len(unique_values) * (2**5):
-        estimate_feature_importance = 0
+    # if counts[0] < len(unique_values) * (2**5):


Let's remove such comments

SkBlaz · 2024-03-19T06:08:24Z

outrank/algorithms/importance_estimator.py

-        estimate_feature_importance = 1 + \
-            np.median(estimate_feature_importance_list)
+    else:
+        X = np.concatenate((X,vector_first.reshape(-1, 1)), axis=1)


There is a space missing after X, wondering why lint didn't catch that. @miha-jenko maybe some idea?

SkBlaz · 2024-03-19T06:08:47Z

outrank/algorithms/importance_estimator.py

+        X = np.concatenate((X,vector_first.reshape(-1, 1)), axis=1)
+        X = transf.fit_transform(X)
+        estimate_feature_importance_list = cross_val_score(
+            clf, X, vector_second, scoring='neg_log_loss', cv=4,


Let's put the num. of folds to top of the file as a constant for now

SkBlaz · 2024-03-19T06:09:46Z

outrank/core_ranking.py

@@ -130,9 +130,13 @@ def mixed_rank_graph(
    # Map the scoring calls to the worker pool
    pbar.set_description('Allocating thread pool')

+    reference_model_features = {}
+    if 'prior' in args.heuristic:


you check for -prior at some point, but prior at some other point. Consider creating a helper function is_prior_heuristic or something, that unifies this behavior (and centralizes it)

outrank/core_ranking.py

SkBlaz · 2024-03-22T09:57:41Z

outrank/core_ranking.py

-        )
+        if args.reference_model_JSON != '':
+            model_combinations = extract_features_from_reference_JSON(args.reference_model_JSON, combined_features_only = True)
+            model_combinations = [tuple(sorted(combination.split(','))) for combination in model_combinations]


combination delimiter could be a const, as it repeats

SkBlaz · 2024-03-22T09:59:13Z

outrank/core_ranking.py

        random.shuffle(full_combination_space)
        full_combination_space = full_combination_space[
            : args.combination_number_upper_bound
        ]
+        if is_prior_heuristic(args):
+            full_combination_space = full_combination_space + [tuple for tuple in model_combinations if tuple not in full_combination_space]


Isn't this second part list(set(model_combinations).difference(set(full_combination_space)))

SkBlaz · 2024-03-22T09:59:25Z

outrank/core_ranking.py

@@ -225,7 +244,7 @@ def compute_combined_features(
    pbar.set_description('Concatenating into final frame ..')
    input_dataframe = pd.concat([input_dataframe, tmp_df], axis=1)
    del tmp_df
-
+    


no need for this space

SkBlaz · 2024-03-22T09:59:52Z

outrank/core_utils.py

    """Given a model's JSON, extract unique features"""

    with open(json_path) as jp:
        content = json.load(jp)

    unique_features = set()
    feature_space = content['desc'].get('features', [])
+    if full_feature_space:


full_feature_space sounds somewhat odd for a flag that computes a set

SkBlaz · 2024-03-22T10:00:01Z

outrank/core_utils.py

@@ -641,3 +644,10 @@ def summarize_rare_counts(
    final_df.to_csv(
        f'{args.output_folder}/feature_sparsity_summary.tsv', index=False, sep='\t',
    )
+
+
+def is_prior_heuristic(args: Any):


missing return type

outrank/algorithms/importance_estimator.py

outrank/core_ranking.py

outrank/core_utils.py

outrank/task_selftest.py

bmramor added 3 commits March 12, 2024 15:07

LR with priors initial implementation

60a8781

add sgd

7630de4

adding reference model json for tests

0e6e204

SkBlaz reviewed Mar 19, 2024

View reviewed changes

cleaning up

289f8eb

bmramor requested a review from SkBlaz March 19, 2024 10:42

bmramor added 8 commits March 19, 2024 10:45

typing bug

54dd256

support for combined features ranking

c632690

combinations for priors

5e53803

remove a bug for non-prior surrogate

92408bb

some more bug handling

31d3dd5

formatting

344be92

formatting

bcd128c

fix tests

9ef5117

bmramor commented Mar 22, 2024

View reviewed changes

outrank/core_ranking.py Outdated Show resolved Hide resolved

SkBlaz requested a review from miha-jenko March 22, 2024 09:56

SkBlaz requested changes Mar 22, 2024

View reviewed changes

miha-jenko reviewed Mar 27, 2024

View reviewed changes

bmramor added 5 commits March 28, 2024 22:02

debugging

41c0975

debugging

bfbe096

fix global variable creation

75d37d2

prior combinations fix

cf305ae

remove logger from function

6d650dd

bmramor requested review from miha-jenko and SkBlaz April 2, 2024 09:26

double line space

5dd9dd9

miha-jenko approved these changes Apr 2, 2024

View reviewed changes

SkBlaz approved these changes Apr 2, 2024

View reviewed changes

SkBlaz merged commit d6dc5d3 into main Apr 3, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LR with priors initial implementation #66

LR with priors initial implementation #66

bmramor commented Mar 12, 2024

wiz-inc-6f7a9d0588 bot commented Mar 15, 2024 •

edited

Loading

SkBlaz left a comment

SkBlaz Mar 19, 2024

SkBlaz Mar 19, 2024

SkBlaz Mar 19, 2024

SkBlaz Mar 19, 2024

SkBlaz Mar 22, 2024

SkBlaz Mar 22, 2024

SkBlaz Mar 22, 2024

SkBlaz Mar 22, 2024

SkBlaz Mar 22, 2024

LR with priors initial implementation #66

LR with priors initial implementation #66

Conversation

bmramor commented Mar 12, 2024

wiz-inc-6f7a9d0588 bot commented Mar 15, 2024 • edited Loading

Wiz Scan Summary

SkBlaz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wiz-inc-6f7a9d0588 bot commented Mar 15, 2024 •

edited

Loading