You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for your great work. I notice that during the casual tracing, when the kind is None, the “trace_important_states ” works layer by layer, but when the kind is attention or mlp, the “trace_important_window” works in layer windows. Why the layer range is different when the kind changes.
The text was updated successfully, but these errors were encountered:
Thank you for your great work. I notice that during the casual tracing, when the kind is None, the “trace_important_states ” works layer by layer, but when the kind is attention or mlp, the “trace_important_window” works in layer windows. Why the layer range is different when the kind changes.
The text was updated successfully, but these errors were encountered: