Adding the option to use a new crd modules in policies #2

lieberlois · 2022-12-30T12:55:54Z

This PR introduces modules. A module can be created using a CRD like this:

apiVersion: bridgekeeper.maibornwolff.de/v1alpha1
kind: Module
metadata:
  name: test-module
spec:
  python: |
    def module_demo():
      print("Hello world!")

Now, this module can be used within a policy like this:

apiVersion: bridgekeeper.maibornwolff.de/v1alpha1
kind: Policy
metadata:
  name: test-policy
spec:
  audit: true
  enforce: false
  target:
    matches:
      - apiGroup: "apps"
        kind: "Deployment"
  modules:
  - test-module
  rule:
    python: |
      module_demo()
      def validate(request):
        return True

Upon evaluation, the modules are loaded and their code is prepended to the actual policy python code. In this case, you can see the "Hello, world" statement within the bridgekeeper logs:

Note that you can see the print statement twice. This is because the policy gets run in the validating webhook, which ensures that all references modules are present. The second print statement comes from the kubectl apply of a new deployment.

From the architecture side, this PR also attempts to generalize behavior of the PolicyStore and the PolicyEvent type using pattern matching and enums. Existing modules are stored in memory using a HashMap combined with a watcher task.

swoehrl-mw

The new CRD must be added to the ClusterRole in charts/bridgekeeper/templates/rbac.yaml.

IMO it makes more sense to put the module into the rule field in the CRD:

apiVersion: bridgekeeper.maibornwolff.de/v1alpha1
kind: Policy
metadata:
  name: test-policy
spec:
  rule:
    modules:
      - test-module
    python: |
      module_demo()
      def validate(request):
        return True

Regarding how modules are used:

I think it would be more python-idiomatic to provide the modules as python modules to be imported. This can be done with Pyo3:

Python::with_gil(|py| {
    let _module_code = PyModule::from_code(py, "def p():\n  print('abc from mymod')", "mymod.py", "mymod").unwrap();
    let rule_code = PyModule::from_code(py, "import mymod\ndef validate():\n  mymod.p()", "rule.py", "bridgekeeper").unwrap();
    let func = rule_code.getattr("validate").unwrap();
    func.call0().unwrap();
});

But one problem I see is that the python interpreter is valid for the entire bridgekeeper process and is reused. And this means any modules created are kept so another run could import a module even if it was not explicitly declared and there is no way to unload modules. We could mitigate this by giving modules random names each run and generating a import x as y statement at the top:

Python::with_gil(|py| {
    let _module_code = PyModule::from_code(py, "def p():\n  print('abc from mymod')", "mymod.py", "mymod123").unwrap();
    let rule_code = PyModule::from_code(py, "import mymod123 as mymod\ndef validate():\n  mymod.p()", "rule.py", "bridgekeeper").unwrap();
    let func = rule_code.getattr("validate").unwrap();
    func.call0().unwrap();
});

But that would preclude people doing things like from y import a.

I'm not really sure if this is worth the hassle. Opinions?

swoehrl-mw · 2023-01-13T12:31:32Z

src/api.rs

-    let (allowed, reason) = validate_policy_admission(&admission_request);
+    let mut module_code = String::new();
+
+    if let Some(policy) = &admission_request.object {


I think it would be cleaner to move the code to collect the modules into the validate_policy_admission function and to make that function into a method of the PolicyEvaluator to have access to all the needed state.

swoehrl-mw · 2023-01-13T12:45:12Z

src/util/types.rs

+}
+
+impl ObjectReference {
+    pub fn to_object_reference(&self) -> KubeObjectReference {


Maybe rename to to_k8s_object_reference to make it clearer what this method produces?

swoehrl-mw · 2023-01-13T12:56:10Z

src/evaluator.rs

+                            module_code.push_str("\n");
+                        },
+                        None => {
+                            log::warn!("Could not find module '{}'", module_name)


I think in this case it's better to directly stop the evaluation and send an error to the user.

swoehrl-mw · 2023-01-13T13:02:35Z

src/policy.rs

@@ -19,33 +20,13 @@ pub struct PolicyStore {
    pub policies: HashMap<String, PolicyInfo>,
 }

-pub type PolicyStoreRef = Arc<Mutex<PolicyStore>>;
+pub type PolicyStoreRef = Arc<Mutex<dyn ObjectStore<Policy, HashMap<String, PolicyInfo>> + Send>>;


There is no need to use the trait here.
Arc<Mutex<PolicyStore>> works as well and is faster because there is no dynamic dispatch.

swoehrl-mw · 2023-01-13T13:02:44Z

src/module.rs

+    pub modules: HashMap<String, ModuleInfo>,
+}
+
+pub type ModuleStoreRef = Arc<Mutex<dyn ObjectStore<Module, HashMap<String, ModuleInfo>> + Send>>;


There is no need to use the trait here.

swoehrl-mw · 2023-01-13T13:03:44Z

src/util/traits.rs

@@ -0,0 +1,7 @@
+use crate::util::types::ObjectReference;
+
+pub trait ObjectStore<T, V> {


Not sure if using a trait is actually beneficial. At the moment all code that uses either policies or modules is specific for that so we cannot really take advantage of the trait.

…-modules

lieberlois added 6 commits December 30, 2022 13:47

Implementing module usage in policies

04e5df2

Adding module code to validating webhook and auditing

4bd5fea

Using loaded module code in audit

350e139

Adding unit test for module usage

b06776f

Using ObjectStore trait across the project

8832169

Properly handling module events in event watcher

ce165df

swoehrl-mw requested changes Jan 13, 2023

View reviewed changes

lieberlois added 3 commits February 16, 2023 12:10

Changes from Code Review

541b8a1

Updating CRD

cde6a42

Merge branch 'main' of github.com:lieberlois/bridgekeeper into python…

1bc3a93

…-modules

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding the option to use a new crd modules in policies #2

Adding the option to use a new crd modules in policies #2

lieberlois commented Dec 30, 2022

swoehrl-mw left a comment

swoehrl-mw Jan 13, 2023

swoehrl-mw Jan 13, 2023

swoehrl-mw Jan 13, 2023

swoehrl-mw Jan 13, 2023

swoehrl-mw Jan 13, 2023

swoehrl-mw Jan 13, 2023

		@@ -0,0 +1,7 @@
		use crate::util::types::ObjectReference;

		pub trait ObjectStore<T, V> {

Adding the option to use a new crd modules in policies #2

Are you sure you want to change the base?

Adding the option to use a new crd modules in policies #2

Conversation

lieberlois commented Dec 30, 2022

swoehrl-mw left a comment

Choose a reason for hiding this comment

swoehrl-mw Jan 13, 2023

Choose a reason for hiding this comment

swoehrl-mw Jan 13, 2023

Choose a reason for hiding this comment

swoehrl-mw Jan 13, 2023

Choose a reason for hiding this comment

swoehrl-mw Jan 13, 2023

Choose a reason for hiding this comment

swoehrl-mw Jan 13, 2023

Choose a reason for hiding this comment

swoehrl-mw Jan 13, 2023

Choose a reason for hiding this comment