Added documentation on usage in no_std

BjornTheProgrammer · BjornTheProgrammer · commit 46b19adb0b55 · 2024-08-22T16:31:50.000-07:00
diff --git a/burn-book/book.toml b/burn-book/book.toml
@@ -6,6 +6,7 @@ authors = [
     "Dilshod Tadjibaev",
     "Guillaume Lagrange",
     "Sylvain Benner",
+    "Bjorn Beishline"
 ]
 language = "en"
 multilingual = false
diff --git a/burn-book/src/SUMMARY.md b/burn-book/src/SUMMARY.md
@@ -31,4 +31,4 @@
     - [Custom WGPU Kernel](./advanced/backend-extension/custom-wgpu-kernel.md)
   - [Custom Optimizer]()
   - [WebAssembly]()
-  - [No-Std]()
+  - [No-Std](./advanced/no-std.md)
diff --git a/burn-book/src/advanced/no-std.md b/burn-book/src/advanced/no-std.md
@@ -0,0 +1,96 @@
+# No Standard Library
+
+In this section, you will learn how to run an onnx inference model on an embedded system, with no standard library support on a Raspberry Pi Pico. This should be universally applicable to other platforms. All the code can be found under the
+[examples directory](https://github.com/tracel-ai/burn/tree/main/examples/onnx-inference-rp2040).
+
+## Step-by-Step Guide
+
+Let's walk through the process of running an embedded ONNX model:
+
+### Setup
+Follow the [embassy guide](https://embassy.dev/book/#_getting_started) for your specific environment. Once setup, you should have something similar to the following.
+```
+./inference
+├── Cargo.lock
+├── Cargo.toml
+├── build.rs
+├── memory.x
+└── src
+    └── main.rs
+```
+
+Some other dependencies have to be added
+```toml
+[dependencies]
+embedded-alloc = "0.5.1" # Only if there is no default allocator for your chip
+burn = { version = "0.14", default-features = false, features = ["ndarray"] } # Backend must be ndarray
+
+[build-dependencies]
+burn-import = { version = "0.14" } # Used to auto generate the rust code to import the model
+```
+
+### Import the Model
+Follow the directions to [import models](./import/README.md).
+
+Use the following ModelGen config
+```rs
+ModelGen::new()
+    .input(my_model)
+    .out_dir("model/")
+    .record_type(RecordType::Bincode)
+    .embed_states(true)
+    .run_from_script();
+```
+
+### Global Allocator
+First define a global allocator (if you are on a no_std system without alloc).
+
+```rs
+use embedded_alloc::Heap;
+
+#[global_allocator]
+static HEAP: Heap = Heap::empty();
+
+#[embassy_executor::main]
+async fn main(_spawner: Spawner) {
+	{
+        use core::mem::MaybeUninit;
+        const HEAP_SIZE: usize = 100 * 1024; // This is dependent on the model size in memory.
+        static mut HEAP_MEM: [MaybeUninit<u8>; HEAP_SIZE] = [MaybeUninit::uninit(); HEAP_SIZE];
+        unsafe { HEAP.init(HEAP_MEM.as_ptr() as usize, HEAP_SIZE) }
+    }
+}
+```
+
+### Define Backend
+We are using ndarray, so we just need to define the NdArray backend as usual
+```rs
+use burn::{backend::NdArray, tensor::Tensor};
+
+type Backend = NdArray<f32>;
+type BackendDeice = <Backend as burn::tensor::backend::Backend>::Device;
+```
+
+Then inside the `main` function add 
+```rs
+use your_model::Model;
+
+// Get a default device for the backend
+let device = BackendDeice::default();
+
+// Create a new model and load the state
+let model: Model<Backend> = Model::default();
+```
+
+### Running the Model
+To run the model, just call it as you would normally
+```rs
+// Define the tensor
+let input = Tensor::<Backend, 2>::from_floats([[input]], &device);
+
+// Run the model on the input
+let output = model.forward(input);
+```
+
+## Conclusion
+Running a model in a no_std environment is pretty much identical to a normal environment. All that is needed is a global allocator. 

Original file line number	Diff line number	Diff line change
`@@ -6,6 +6,7 @@ authors = [`
`6`	`6`	`"Dilshod Tadjibaev",`
`7`	`7`	`"Guillaume Lagrange",`
`8`	`8`	`"Sylvain Benner",`
	`9`	`+ "Bjorn Beishline"`
`9`	`10`	`]`
`10`	`11`	`language = "en"`
`11`	`12`	`multilingual = false`