Add Speculator Architecture #6

daviswer · 2024-02-22T19:27:29Z

Add support for speculative decoding architecture, with implementations of distinct parallel pretraining and generative inference forward passes.

nairbv · 2024-02-22T20:30:53Z

fms_extras/modules/speculator.py

+import torch.nn.functional as F
+from fms.modules.layernorm import LayerNormParameterized
+
+class Speculator(nn.Module):


I wonder if we might end up with multiple speculators, and if there's something more specific we might want to name this

might be helpful to have an explanation of our speculation strategy in the comments too

Good point, I'll add an explanation. We could call this MLP_Speculator

FYI snake_case class names aren't convention in python: https://visualgit.readthedocs.io/en/latest/pages/naming_convention.html

we might want to rename the dataset classes at some point too

nairbv · 2024-02-22T20:31:40Z

fms_extras/modules/speculator.py

should the path be models instead of modules? at least in the main fms repo, we distinguish between the two. I think the speculator runs standalone, not as a child of another model

Haha I was using the exact opposite logic - I had this in modules a la main repo, because it's a standalone object and not built from any dedicated sub-modules. But I can move this into models - at least for now that would make the organization of this repo slightly simpler

fms_extras/modules/speculator.py

nairbv · 2024-02-23T15:38:44Z

fms_extras/models/speculator.py

+
+class MLP_Speculator(nn.Module):
+    """
+    This is a simple MLP-based speculator that functions similarly to Medusa, ingesting context via


might want to link to the medusa paper

nairbv · 2024-02-23T15:39:55Z

fms_extras/models/speculator.py

+    The architecture is as flat and simple as possible: for each prediction head, the current 
+    state vector is projected into a new latent space and added to the previous token's embedding. 
+    This sum goes through layernorm and activation, forming the new state vector. This state predicts
+    the next token (or set of candidate tokens) for the current head, and then is passed on to the next.


nit: in diffs on github, the text will be easier to read if each lines is shorter (short enough to not need to wrap in a side-by-side window).

long enough that whole paragraphs are wrapped works too though then they become harder to comment on

nairbv · 2024-02-23T15:40:27Z

fms_extras/models/speculator.py

+import torch.nn.functional as F
+from fms.modules.layernorm import LayerNormParameterized
+
+class MLP_Speculator(nn.Module):


nit: snake_case for class names is unconventional in python

nairbv · 2024-02-23T15:41:32Z

fms_extras/models/speculator.py

+        )
+        # Weights ensure that state_0 accounts for 50% of state magnitude by final head in expectation
+        self.state_weight = .5**(.5/n_predict)
+        self.emb_weight = math.sqrt(1-self.state_weight**2)


will need to run black, expects spaces are -

daviswer · 2024-02-23T17:51:35Z

I made the Blacking and Snake_Casing changes but it looks like a maintainer needs to approve the automated tests again

JRosenkranz

just comments regarding type hints and docstrings

JRosenkranz · 2024-02-23T17:53:52Z

fms_extras/models/speculator.py

+                m.weight.data.fill_(1)
+                m.bias.data.zero_()
+
+    def generate_suffixes(self, state, ind, topk=[5, 4, 3], n=5):


can we add type hints and docstrings for this

JRosenkranz · 2024-02-23T17:54:19Z

fms_extras/models/speculator.py

+            1, best_guesses.unsqueeze(2).expand(-1, -1, self.n_predict)
+        )  # b n h
+
+    def forward(self, state, inds):


can we add type hints and docstrings for this

daviswer added 5 commits February 22, 2024 13:51

Add speculator with two forward modes

57ed0c0

Small qol adds

1e79d45

Fix import

d9d8530

Swap n_heads for n_predict

ef8ac71

Swap self.n_predict for self.npredict

423ee80

lchu6 requested review from nairbv, ani300 and JRosenkranz February 22, 2024 19:36

nairbv reviewed Feb 22, 2024

View reviewed changes

fms_extras/modules/speculator.py Outdated Show resolved Hide resolved

nairbv reviewed Feb 22, 2024

View reviewed changes

fms_extras/modules/speculator.py Outdated Show resolved Hide resolved

nairbv reviewed Feb 22, 2024

View reviewed changes

fms_extras/modules/speculator.py Outdated Show resolved Hide resolved

daviswer added 2 commits February 22, 2024 16:22

Further docs, legibility, comments

49b1ea0

Move speculator to models subfolder

58c1b09

nairbv reviewed Feb 23, 2024

View reviewed changes

nairbv approved these changes Feb 23, 2024

View reviewed changes

nairbv reviewed Feb 23, 2024

View reviewed changes

Blacking, casing

0c98baa

JRosenkranz reviewed Feb 23, 2024

View reviewed changes

daviswer and others added 5 commits February 23, 2024 13:51

Add type hints / docstrings

6bf3944

Fix typing import

3c248c2

Blacking (sigh)

838c728

Fix typing imports pt2

bd45b87

isorting

a4c4ba3

daviswer merged commit 71e5600 into foundation-model-stack:main Feb 23, 2024
3 checks passed

daviswer deleted the speculator-v2 branch February 23, 2024 20:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Speculator Architecture #6

Add Speculator Architecture #6

daviswer commented Feb 22, 2024 •

edited

Loading

nairbv Feb 22, 2024

nairbv Feb 22, 2024

daviswer Feb 22, 2024

nairbv Feb 23, 2024

nairbv Feb 22, 2024

daviswer Feb 22, 2024

nairbv Feb 23, 2024

nairbv Feb 23, 2024

nairbv Feb 23, 2024

nairbv Feb 23, 2024

daviswer commented Feb 23, 2024

JRosenkranz left a comment

JRosenkranz Feb 23, 2024

JRosenkranz Feb 23, 2024

Add Speculator Architecture #6

Add Speculator Architecture #6

Conversation

daviswer commented Feb 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daviswer commented Feb 23, 2024

JRosenkranz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daviswer commented Feb 22, 2024 •

edited

Loading