You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
So far we've based our evaluation schemes on how Kaggle expected us to evaluate. What else can we measure? Can we find references to support our decisions?
Think about using different thresholds when calculating the macro F1 score
Think about the utility of evaluating performance on the tensors?
The text was updated successfully, but these errors were encountered:
So far we've based our evaluation schemes on how Kaggle expected us to evaluate. What else can we measure? Can we find references to support our decisions?
The text was updated successfully, but these errors were encountered: