Implementation of CUDA accelerated passive crossbar programming simulation for the 2021 Data Driven model #125

Philippe-Drolet · 2022-02-07T01:33:09Z

Partial solution to issue #53

The added cuda files are built alongside the other cuda extensions if CUDA is set to true in setup.py. The cpp bindings have not yet been implemented and only the data-driven model is implemented so far for the cuda bindings. Crossbar.py has been modified to call the bindings if all conditions are met and the Linear, Conv3D, etc. files have a new argument for the cuda_malloc_heap_size.

The logic behind the device's programming has changed slightly to represent the way devices are programmed in our lab in practice. The amplitude of the voltage used to program the devices increases (or decreases) by 0.02 each time the desired conductance is not achieved. It could be good to implement in a future release a variable to change the nominal value of this increment. In the new routine, the variable force_adjustment_pos_voltage_threshold represents the maximum voltage that the incrementation of 0.02V can lead to (voltage will always be lower than this even with incrementation) and the force_adjustment_neg_voltage_threshold serves the same purpose for negative voltages .

For the networks I tested on, I did not manage to get to the end of the execution of the naive_program.py routine, so I do not have any estimate for the speedup it brings. Here are some execution times with respect to the size of the network (note that for the following networks, all tiles are initially randomized so every ReRAM needs to be programmed):

Patched Linear(in_features=100, out_features=50, bias=True) -> bh.Linear(in_features=100, out_features=50, bias=True)
Patched Linear(in_features=50, out_features=10, bias=True) -> bh.Linear(in_features=50, out_features=10, bias=True)
time = 123.00254679999999

Patched Linear(in_features=100, out_features=30, bias=True) -> bh.Linear(in_features=100, out_features=30, bias=True)
Patched Linear(in_features=30, out_features=10, bias=True) -> bh.Linear(in_features=30, out_features=10, bias=True)
time = 113.69156190000001

Patched Linear(in_features=25, out_features=30, bias=True) -> bh.Linear(in_features=25, out_features=30, bias=True)
Patched Linear(in_features=30, out_features=10, bias=True) -> bh.Linear(in_features=30, out_features=10, bias=True)
time = 110.4159673

I would like next to commit some plotting utility files next (mostly heatmaps comparing programming of devices before and after the programming routine) but I do not know where in memtorch's architecture this would be most appropriate. It would probably be good to add a section to the tutorial to guide users with this new functionality if plots are added and I could take care of this if you wish.

The version.py and setup.py files are updated by accident and I cannot revert them to the original form because of an error caused by black and isort after I have setup my environment using the suggested pre-commit commands. Please ignore them. I can create a new pull request if necessary

The following variable: random_crossbar_init initializes the crossbar to random device conductances in between 1/Ron and 1/Roff. This will be useful to test the robustness of a passive crossbar to reprogram itself to different conductances after previous inference. It will not affect anything if transistor is set to True.

# Conflicts: # memtorch/bh/crossbar/Crossbar.py # memtorch/mn/Module.py

not tested not working just backup for now

Implementation of data_driven simulate no neighbours made.

Shaky accuracy with simulate neighbours routine (probably working as intended, but the effects of V/2 applied is non neglectable) Documentation added for the cuda kernels

Changed the code of crossbar.py based on what is currently implemented in CUDA

The following variable: random_crossbar_init initializes the crossbar to random device conductances in between 1/Ron and 1/Roff. This will be useful to test the robustness of a passive crossbar to reprogram itself to different conductances after previous inference. It will not affect anything if transistor is set to True.

not tested not working just backup for now

Implementation of data_driven simulate no neighbours made.

Shaky accuracy with simulate neighbours routine (probably working as intended, but the effects of V/2 applied is non neglectable) Documentation added for the cuda kernels

Changed the code of crossbar.py based on what is currently implemented in CUDA

The code would have called memtorch_cuda_bindings even if the version was CPU.

lgtm-com · 2022-02-07T02:01:34Z

This pull request introduces 2 alerts and fixes 2 when merging de8dc2d into aec1e25 - view on LGTM.com

new alerts:

1 for Testing equality to None
1 for Unreachable code

fixed alerts:

2 for Unreachable code

coreylammie

Thank you for your contribution! This functionality looks great.

I have made a number of minor comments/suggestions for your consideration. Your pull request can be updated by pushing new commit(s) to 3it-nano:master directly. The --no-verify flag can be used alongside git commit to bypass pre-commit workflows. It would be greatly appreciated if you could update setup.py so that the automated CI workflow can run in full.

I'm definitely happy to add input arguments to specify the maximum voltage incrementation/decrementation of the amplitude of the voltage used to program devices, and for the failure iteration threshold once this is merged to master. This, alongside (#123), can makeup the 1.1.6 release. I can sort the newly required documentation, changelog, and other miscellaneous changes on my end.

Regarding plotting utility files: In memtorch.bh.memristor.Memristor, I originally defined two abstract methods for plotting IV characteristics and bipolar switching behaviour: plot_hysteresis_loop, and plot_bipolar_switching_behaviour.

I would suggest developing common plotting functions/utilities under a new submodule entitled memtorch.plot (to more easily standardize things such as the font style used and other properties), which can be called from other methods. For example, memtorch.bh.crossbar.Crossbar.plot_conductance_matrices could be be defined, which calls memtorch.plot.plot_heatmat. I've tentatively created a new issue (#126) detailing this. Feel free to reach out if you need any help with this. I'm happy to develop the associated documentation if you would like!

memtorch/bh/crossbar/Crossbar.py

memtorch/cu/simulate_passive_kernels.cu

coreylammie · 2022-02-07T22:48:17Z

memtorch/cu/simulate_passive_kernels.cu

+      if (instruction_array[k] != 0)
+      {
+        //Check to ensure that the resistance remains within possible range
+        if (resistance_ > r_off_global)


This logic can be moved outside of the if else if clause, i.e., it is used if (i == current_i && j == current_j) || (i == current_i || j == current_j).

I adapted the code with respect to this suggestion

memtorch/cu/simulate_passive_kernels.cu

Co-authored-by: Corey Lammie <[email protected]>

Set CUDA = False for CI Co-authored-by: Corey Lammie <[email protected]>

remove memristor_model_params random comment oversight Co-authored-by: Corey Lammie <[email protected]>

Co-authored-by: Corey Lammie <[email protected]>

lgtm-com · 2022-02-08T01:48:16Z

This pull request introduces 1 alert when merging 054e5dc into aec1e25 - view on LGTM.com

new alerts:

1 for Testing equality to None

codecov · 2022-02-08T03:06:40Z

Codecov Report

Merging #125 (e742b7b) into master (aec1e25) will decrease coverage by 0.32%.
The diff coverage is 44.00%.

@@            Coverage Diff             @@
##           master     #125      +/-   ##
==========================================
- Coverage   84.53%   84.20%   -0.33%     
==========================================
  Files          54       54              
  Lines        2276     2292      +16     
==========================================
+ Hits         1924     1930       +6     
- Misses        352      362      +10

Flag	Coverage Δ
unittests	`84.20% <44.00%> (-0.33%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
memtorch/bh/memristor/Data_Driven2021.py	`82.71% <ø> (ø)`
memtorch/mn/Conv1d.py	`94.50% <ø> (ø)`
memtorch/mn/Conv2d.py	`93.68% <ø> (ø)`
memtorch/mn/Conv3d.py	`93.68% <ø> (ø)`
memtorch/mn/Linear.py	`91.95% <ø> (ø)`
memtorch/bh/crossbar/Crossbar.py	`87.21% <41.66%> (-3.92%)`	⬇️
memtorch/version.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update aec1e25...e742b7b. Read the comment docs.

Philippe-Drolet · 2022-02-08T03:39:44Z

Thank you for your swift response and for these comments! I will solve them now and commit my plotting functions later on (following your suggestions) . Its great that you do not mind taking care of the documentation, do not hesitate if you have any questions or if you want me to take it in my charge. I resolved some comments, please feel free to reopen them if you feel that the solution provided is unsatisfactory

I am sorry for the many commits to the branch that trigger the bot verification everytime!

- Removed ideal_crossbar matrix - Made list of implemented models global - Added exception thrown when a model is not implemented - Refactored a part of the CUDA kernel -Removed function from CUD header (cuh file) -Removed redundant build_g_tensor function

coreylammie · 2022-02-10T00:41:07Z

No problem at all! All of your changes look good to me- thank you.

I'll merge this to master now and make a new branch for the 1.1.6 release. In this, I will add input arguments to specify the maximum voltage incrementation/decrementation of the amplitude of the voltage used to program devices, and for the failure iteration threshold. I'll also update the required documentation, CHANGELOG, and version number.

The new plotting functionality can be included in a future release. Don't hesitate to reach out if you have any questions!

Philippe-Drolet and others added 26 commits November 4, 2021 10:21

Forgot this initialization

ddb86ce

Merge remote-tracking branch 'origin/master'

6f4e50a

# Conflicts: # memtorch/bh/crossbar/Crossbar.py # memtorch/mn/Module.py

Simulate_passive_optimizationV0.1

6f9da4d

not tested not working just backup for now

partially working

fa6479f

Functional Kernels DD

5e43d4d

Data_Driven simulate no neighbours implemented implemented

816a480

Implementation of data_driven simulate no neighbours made.

Update gitignore.

8dec0e0

Initial Testing + Documentation

1e20254

Shaky accuracy with simulate neighbours routine (probably working as intended, but the effects of V/2 applied is non neglectable) Documentation added for the cuda kernels

Adapted Crossbar.py to the current implementation of passive simulation

2b9bd1d

Changed the code of crossbar.py based on what is currently implemented in CUDA

Updated pre-commit configs

e76c91b

Reformatted according to the docs

cf4aa8f

Simulate_passive_optimizationV0.1

cf4fed0

not tested not working just backup for now

partially working

c6460c8

Functional Kernels DD

833877a

Data_Driven simulate no neighbours implemented implemented

9a46e56

Implementation of data_driven simulate no neighbours made.

Update gitignore.

493fbe1

Initial Testing + Documentation

3b5f552

Shaky accuracy with simulate neighbours routine (probably working as intended, but the effects of V/2 applied is non neglectable) Documentation added for the cuda kernels

Adapted Crossbar.py to the current implementation of passive simulation

432f147

Changed the code of crossbar.py based on what is currently implemented in CUDA

Updated pre-commit configs

10ad7d3

Reformatted according to the docs

e1f8629

Merge branch 'master' of https://github.com/3it-nano/MemTorch

beebe98

Removed test files

55dbdb4

Fixed updated files that should not have been

34a7862

fixed small bug in Crossbar.py

de8dc2d

The code would have called memtorch_cuda_bindings even if the version was CPU.

coreylammie reviewed Feb 7, 2022

View reviewed changes

Philippe-Drolet and others added 2 commits February 7, 2022 20:04

Update memtorch/cu/simulate_passive_kernels.cu

0db6b0c

Co-authored-by: Corey Lammie <[email protected]>

Update setup.py

917f0ab

Set CUDA = False for CI Co-authored-by: Corey Lammie <[email protected]>

Philippe-Drolet and others added 6 commits February 7, 2022 20:07

Update memtorch/bh/crossbar/Crossbar.py

76c218d

remove memristor_model_params random comment oversight Co-authored-by: Corey Lammie <[email protected]>

Update memtorch/bh/crossbar/Crossbar.py

bb1b76a

Co-authored-by: Corey Lammie <[email protected]>

Update memtorch/version.py

426e551

Co-authored-by: Corey Lammie <[email protected]>

Update memtorch/cu/simulate_passive_kernels.cu

2233da0

Co-authored-by: Corey Lammie <[email protected]>

Update memtorch/cu/simulate_passive_kernels.cu

9b8c226

Co-authored-by: Corey Lammie <[email protected]>

Update memtorch/cu/simulate_passive_kernels.cu

054e5dc

Co-authored-by: Corey Lammie <[email protected]>

Philippe-Drolet added 4 commits February 7, 2022 23:20

remove accidental commit of utils

aab336b

Delete train.py

f08b1e1

Delete tmp_data

e742b7b

coreylammie merged commit 4ce968b into coreylammie:master Feb 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of CUDA accelerated passive crossbar programming simulation for the 2021 Data Driven model #125

Implementation of CUDA accelerated passive crossbar programming simulation for the 2021 Data Driven model #125

Philippe-Drolet commented Feb 7, 2022 •

edited

Loading

lgtm-com bot commented Feb 7, 2022

coreylammie left a comment

coreylammie Feb 7, 2022

Philippe-Drolet Feb 8, 2022

lgtm-com bot commented Feb 8, 2022

codecov bot commented Feb 8, 2022 •

edited

Loading

Philippe-Drolet commented Feb 8, 2022 •

edited

Loading

coreylammie commented Feb 10, 2022

Implementation of CUDA accelerated passive crossbar programming simulation for the 2021 Data Driven model #125

Implementation of CUDA accelerated passive crossbar programming simulation for the 2021 Data Driven model #125

Conversation

Philippe-Drolet commented Feb 7, 2022 • edited Loading

lgtm-com bot commented Feb 7, 2022

coreylammie left a comment

Choose a reason for hiding this comment

coreylammie Feb 7, 2022

Choose a reason for hiding this comment

Philippe-Drolet Feb 8, 2022

Choose a reason for hiding this comment

lgtm-com bot commented Feb 8, 2022

codecov bot commented Feb 8, 2022 • edited Loading

Codecov Report

Philippe-Drolet commented Feb 8, 2022 • edited Loading

coreylammie commented Feb 10, 2022

Philippe-Drolet commented Feb 7, 2022 •

edited

Loading

codecov bot commented Feb 8, 2022 •

edited

Loading

Philippe-Drolet commented Feb 8, 2022 •

edited

Loading