Skip to content

Commit

Permalink
default normalization to rmsnorm
Browse files Browse the repository at this point in the history
  • Loading branch information
lucidrains committed Jul 5, 2022
1 parent 02a985c commit 3032cf4
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 1 deletion.
2 changes: 2 additions & 0 deletions retro_pytorch/retro_pytorch.py
Original file line number Diff line number Diff line change
Expand Up @@ -492,6 +492,8 @@ def __init__(
# for deepnet, residual scales
# follow equation in Figure 2. in https://arxiv.org/abs/2203.00555

norm_klass = default(norm_klass, RMSNorm)

if use_deepnet:
enc_scale_residual = default(enc_scale_residual, 0.81 * ((enc_depth ** 4) * dec_depth) ** .0625)
dec_scale_residual = default(dec_scale_residual, (3 * dec_depth) ** 0.25)
Expand Down
3 changes: 2 additions & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -3,9 +3,10 @@
setup(
name = 'retro-pytorch',
packages = find_packages(exclude=[]),
version = '0.3.7',
version = '0.3.8',
license='MIT',
description = 'RETRO - Retrieval Enhanced Transformer - Pytorch',
long_description_content_type = 'text/markdown',
author = 'Phil Wang',
author_email = '[email protected]',
url = 'https://github.com/lucidrains/RETRO-pytorch',
Expand Down

0 comments on commit 3032cf4

Please sign in to comment.