You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi Phil, and many thanks for the new Falcon-180B tutorial!
I am trying to replicate your experiment but when I run it I get the error forward() got an unexpected keyword argument 'position_ids'. I saw that, while llama_patch takes position_ids as a parameter, falcon_patch redefines forward() without it. I saw the patch is used only when one tries to runt the code with flash attention so I disabled it, but I am not sure it was supposed to run without it (first it is in the post's title 😅, plus I get a Signal 7 (SIGBUS) but I am not sure it is necessarily related to it...). What are your thoughts about this?
Ah I forgot to say: I am running it from command line as shown in the jupyter notebook:
Hi Phil, and many thanks for the new Falcon-180B tutorial!
I am trying to replicate your experiment but when I run it I get the error
forward() got an unexpected keyword argument 'position_ids'
. I saw that, whilellama_patch
takesposition_ids
as a parameter,falcon_patch
redefines forward() without it. I saw the patch is used only when one tries to runt the code with flash attention so I disabled it, but I am not sure it was supposed to run without it (first it is in the post's title 😅, plus I get a Signal 7 (SIGBUS) but I am not sure it is necessarily related to it...). What are your thoughts about this?Ah I forgot to say: I am running it from command line as shown in the jupyter notebook:
The text was updated successfully, but these errors were encountered: