Batch NDCG #342

donglihe-hub · 2024-01-01T11:11:32Z

What does this PR do?

The original NDCG metric calculates scores for one instance at a time, which is inefficient. The new NDCG metric calculate scores in batch.

Performance Test Settings:

Number of labels = 100

Batch size = 40
Number of batchs = 100
Effective number of validation samples = 4000

Results:

Test CLI & API (`bash tests/autotest.sh`)

Test APIs used by main.py.

Test Pass
- (Copy and paste the last outputted line here.)
Not Applicable (i.e., the PR does not include API changes.)

Check API Document

If any new APIs are added, please check if the description of the APIs is added to API document.

API document is updated (linear, nn)
Not Applicable (i.e., the PR does not include API changes.)

Test quickstart & API (`bash tests/docs/test_changed_document.sh`)

If any APIs in quickstarts or tutorials are modified, please run this test to check if the current examples can run correctly after the modified APIs are released.

Sinacam

We should review the use of plurals vs singulars.

libmultilabel/nn/metrics.py

donglihe-hub · 2024-01-03T11:05:48Z

We should review the use of plurals vs singulars.

A doubt that has haunted me for years is that why it is preds and target, rather than preds and targets or pred and target. Therefore, I prefer using singular form for all variables except for those concepts that have been accepted by the general public.

libmultilabel/nn/metrics.py

cjlin1 · 2024-01-06T08:39:20Z

libmultilabel/nn/metrics.py

@@ -45,6 +45,7 @@ class NDCG(Metric):
    https://scikit-learn.org/stable/modules/generated/sklearn.metrics.ndcg_score.html
    Please find the formal definition here:
    https://nlp.stanford.edu/IR-book/html/htmledition/evaluation-of-ranked-retrieval-results-1.html
+    The target has to be a binary tensor.


add

We do not use NDCG in ?? because of ??

I'm not pretty sure what should be filled in the ??. As Li-Chung only mention this in function-level comments, I will rewrite it and put it under _idcg() to align with Li-Chung's changes and to not confuse anyone.

donglihe-hub requested review from cjlin1, Eleven1Liu, henryyang42, JamesLYC88 and Gordon119 as code owners January 1, 2024 11:11

donglihe-hub force-pushed the OptimizeNDCG branch from c41e976 to a26302c Compare January 1, 2024 14:15

optimize ndcg

198a58e

donglihe-hub force-pushed the OptimizeNDCG branch from a26302c to 198a58e Compare January 1, 2024 14:49

donglihe-hub added 2 commits January 1, 2024 19:22

deal with instances without labels

c2e8c2a

better time complexty

481c981

Sinacam reviewed Jan 3, 2024

View reviewed changes

libmultilabel/nn/metrics.py Outdated Show resolved Hide resolved

libmultilabel/nn/metrics.py Outdated Show resolved Hide resolved

libmultilabel/nn/metrics.py Outdated Show resolved Hide resolved

libmultilabel/nn/metrics.py Outdated Show resolved Hide resolved

donglihe-hub added 3 commits January 3, 2024 12:38

reverse changes to discount

90b07f2

specify argument "dim"

cb5977c

hacking to the idcg

cb35327

add reference for the best practice of batch dot product

c96657c

donglihe-hub force-pushed the OptimizeNDCG branch from e8e4b67 to c96657c Compare January 3, 2024 11:31

Sinacam reviewed Jan 5, 2024

View reviewed changes

libmultilabel/nn/metrics.py Outdated Show resolved Hide resolved

donglihe-hub force-pushed the OptimizeNDCG branch from 979b865 to 30701c7 Compare January 5, 2024 13:09

donglihe-hub changed the title ~~Optimize NDCG~~ Batch NDCG Jan 5, 2024

cjlin1 reviewed Jan 6, 2024

View reviewed changes

improve readability

b08dd9b

donglihe-hub force-pushed the OptimizeNDCG branch from 30701c7 to b08dd9b Compare January 6, 2024 08:52

illustrate internal mechanisms of idcg

a6b124d

donglihe-hub force-pushed the OptimizeNDCG branch from 30c9d54 to a6b124d Compare January 6, 2024 08:59

explain self-implemented NDCG

fb2a71c

donglihe-hub force-pushed the OptimizeNDCG branch from 8c36c4d to fb2a71c Compare January 7, 2024 13:48

cjlin1 approved these changes Jan 8, 2024

View reviewed changes

cjlin1 merged commit a3f296d into ASUS-AICS:master Jan 8, 2024
1 check passed

donglihe-hub deleted the OptimizeNDCG branch January 8, 2024 08:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch NDCG #342

Batch NDCG #342

donglihe-hub commented Jan 1, 2024 •

edited

Loading

Sinacam left a comment

donglihe-hub commented Jan 3, 2024 •

edited

Loading

cjlin1 Jan 6, 2024

donglihe-hub Jan 6, 2024 •

edited

Loading

Batch NDCG #342

Batch NDCG #342

Conversation

donglihe-hub commented Jan 1, 2024 • edited Loading

What does this PR do?

Performance Test Settings:

Results:

Test CLI & API (bash tests/autotest.sh)

Check API Document

Test quickstart & API (bash tests/docs/test_changed_document.sh)

Sinacam left a comment

Choose a reason for hiding this comment

donglihe-hub commented Jan 3, 2024 • edited Loading

cjlin1 Jan 6, 2024

Choose a reason for hiding this comment

donglihe-hub Jan 6, 2024 • edited Loading

Choose a reason for hiding this comment

donglihe-hub commented Jan 1, 2024 •

edited

Loading

Test CLI & API (`bash tests/autotest.sh`)

Test quickstart & API (`bash tests/docs/test_changed_document.sh`)

donglihe-hub commented Jan 3, 2024 •

edited

Loading

donglihe-hub Jan 6, 2024 •

edited

Loading