Code for computing the right vector for the rank-1 update #32

QZH-777 · 2023-03-17T13:07:02Z

In the compute_v function of rome/compute_v.py, you use the get_module_input_output_at_word function to get cur_input and cur_output.

In details, cur_input and cur_output are obtained by inputting “Steve Jobs was the founder of” to gpt2-xl. So cur_input, cur_output are not equal to k*, and W_{proj} k*, but you seem to use cur_input and cur_output as k* and W_{proj} k* when calculating the right vector for the rank-1 update, which is slightly different from your proposed equation (2) in the paper. I wonder why you use this method to approximate k*, and W_{proj} k*?

LuoXiaoxi-cxq · 2024-10-27T23:14:27Z

I have the same question.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code for computing the right vector for the rank-1 update #32

Code for computing the right vector for the rank-1 update #32

QZH-777 commented Mar 17, 2023 •

edited

Loading

LuoXiaoxi-cxq commented Oct 27, 2024

Code for computing the right vector for the rank-1 update #32

Code for computing the right vector for the rank-1 update #32

Comments

QZH-777 commented Mar 17, 2023 • edited Loading

LuoXiaoxi-cxq commented Oct 27, 2024

QZH-777 commented Mar 17, 2023 •

edited

Loading