You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I note that in Table 1, the performance on SST2 without task alignment is 79.2, but in the original MeZO paper, the performance is 51.9 (almost random, i.e., the model can hardly be trained). Why the reported number is much higher in this paper?
The text was updated successfully, but these errors were encountered:
Hi, I note that in Table 1, the performance on SST2 without task alignment is 79.2, but in the original MeZO paper, the performance is 51.9 (almost random, i.e., the model can hardly be trained). Why the reported number is much higher in this paper?
The text was updated successfully, but these errors were encountered: