You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With FSPD2, it should be applied to all the modules present in the highest level module. Currently it's only applied to the encoder, layer norms, token embeddings and decoder, the projector (simple MLP) should have the function called on it.