Finish corrigenda on static init post

dmiller · Jun 24, 2024 · 0a6158a · 0a6158a
1 parent 09207c9
commit 0a6158a
Showing 1 changed file with 94 additions and 4 deletions.
diff --git a/_posts/2024-06-24-corrigendum-static-initialization.md b/_posts/2024-06-24-corrigendum-static-initialization.md
@@ -5,12 +5,102 @@ date: 2024-06-18 00:00:00 -0500
 categories: general
 ---
 
-I believe I made an error in one of benchmarks mentioned in an earlier post.  Here I do a little more analysis and provide a correction to the code.
+I must have made an error in one of benchmarks mentioned in an earlier post.  Here I do a little more analysis and provide a correction to my commens and to the code.
 
-In  [A mega-dose of micro-benchmarks, Part 2 -- By the numbers]({{site.baseurl}}{% post_url 2024-06-18-mega-dose-of-micro-benchmarks-part-2 %}), there was a section toward the end that discussed the performance hit of static initialization. I believe that analysis is incorrect.
+In  [A mega-dose of micro-benchmarks, Part 2 -- By the numbers]({{site.baseurl}}{% post_url 2024-06-18-mega-dose-of-micro-benchmarks-part-2 %}), there was a section toward the end that discussed the performance hit of static initialization. I believe the analysis given there is incorrect.
 
 Did I mention that micro-benchmarking is hard?
 
-I made a claim that a static initialization being done in the `Numbers` package was causing a performance hit compared to the C# code. Something was very wrong in those numbers, but there was an element of truth.  And in the long run, it really doesn't matter.
+__TL;DR__: I made a claim that a static initialization being done in the `Numbers` package was causing a performance hit compared to the C# code. Something was very wrong in those numbers, but there was an element of truth.  And in the long run, it really doesn't matter.
 
-Here is a very reduced model of the kind of situation one gets into.
+Here is a very reduced model of the kind of situation I was running into.
+
+
+```F#
+type C(v:int) = 
+    member this.V = v
+
+type B(x:int ) =
+    static member val EmptyC = C(0)
+    member this.X = x
+```
+ Compiling to IL and decompiling to C# (thanks, [sharplab.io[(https://sharplab.io)), we get (with some editing):
+
+ ```C#
+     public class C
+    {
+        internal int v;
+        public int V { get => v; }
+       public C(int v)  { this.v = v; }   
+    }
+
+    public class B
+    {
+        internal int x;
+        internal static C EmptyC@;
+        internal static int init@7;
+
+        public static C EmptyC
+        {
+            get
+            {
+                if (init@7 < 1)
+                {
+                    LanguagePrimitives.IntrinsicFunctions.FailStaticInit();
+                }
+                return EmptyC@;
+            }
+        }
+
+        public int X { get => x; }
+
+        public B(int x) { this.x = x; |
+        }
+
+        static B()
+        {
+            $_.init@ = 0;
+            int init@8 = $_.init@;
+        }
+    }
+}
+namespace <StartupCode$_>
+{
+    internal static class $_
+    {
+        internal static int init@;
+
+        public static void main@()
+        {
+            B.EmptyC@ = new C(0);
+            B.init@7 = 1;
+        }
+    }
+}
+```
+
+The variable such as `$_.init@` and `B.init@7` are used to detect circularity conditions in static field initializations.   It appears that one pays a small price on every static field reference to test that initialization has happened properly.
+
+I had read in a few places online that the tiered compilation of the modern JITter would get rid of this overhead eventually.  But I wasn't seeing it.  
+
+It takes patience.
+
+I thought surely all the warmup that BenchmarkDotNet does on the code before doing the actual benchmarking runs would be enough.  Not so.
+
+I discovered this by accident.  I was benchmarking something else (still involving a static field reference) and accidentally compared the same code to itself three times.  The first run was considerably slower than the second and third runs.  The second and third runs were essentially identical.
+
+For the classes above, I benchmarked accessing `B.EmptyC.v`.  Here are three successive runs showing exactly this behavior:
+
+| Method | Mean     | Error    | StdDev   | Ratio | RatioSD |
+|------- |---------:|---------:|---------:|------:|--------:|
+| BC     | 40.11 ns | 0.769 ns | 1.052 ns |  1.00 |    0.00 |
+| BC2    | 34.65 ns | 0.240 ns | 0.200 ns |  0.86 |    0.02 |
+| BC3    | 35.11 ns | 0.238 ns | 0.211 ns |  0.87 |    0.02 |
+
+In the earlier post, I described a technique to get rid of the static initialization cheks but having the consumer of the numerics package do an initialization step.  I went back to my original benchmarks and ran them twice.  Once with static initializations and the checks you see above and once with the user initialization code that got rid of the checks.  
+
+No essential difference.
+
+I thought it better not to put the burden of remembering to call an initialization function on the user of the package, so I reverted that change and went back to the code using static initialization.
+
+I"m sure I learned some lesson here.  Not sure what it is.