Batch suggest in Omikuji backend #669

osma · 2023-02-03T14:49:29Z

This PR clarifies that backends only have to implement either one of _suggest and _suggest_batch, then implements batched suggest in the Omikuji backend. In practice, only the text vectorization is performed on the whole batch at once; the Omikuji implementation only supports a predict method for a single document at a time so it has to be done within a for loop.

There seems to be a small performance benefit. I tested this using annif eval the Finto AI yso-parabel-fi project/model, with the kirjaesittelyt2021/fin/test corpus. The evaluation results were unchanged, only the amount of time spent was slightly different. Memory usage remained pretty much the same.

With 1 job

	user time	wall time	max rss
before (master)	86.69	1:29.94	6322624
after (PR)	78.66	1:22.79	6336584

With 4 jobs

	user time	wall time	max rss
before (master)	121.33	1:22.33	6293804
after (PR)	96.55	1:18.78	6293640

Fixes #665

…n backends is enough

sonarqubecloud · 2023-02-03T14:50:09Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
0.0% Duplication

codecov · 2023-02-03T14:54:38Z

Codecov Report

Base: 99.56% // Head: 99.56% // Increases project coverage by +0.00% 🎉

Coverage data is based on head (f4b55cd) compared to base (a7e3b4b).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #669   +/-   ##
=======================================
  Coverage   99.56%   99.56%           
=======================================
  Files          87       87           
  Lines        6143     6145    +2     
=======================================
+ Hits         6116     6118    +2     
  Misses         27       27

Impacted Files	Coverage Δ
annif/backend/backend.py	`100.00% <ø> (ø)`
annif/backend/omikuji.py	`97.53% <100.00%> (+0.09%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

juhoinkinen

LGTM

osma added 2 commits February 3, 2023 15:48

Clarify that implementing either one of _suggest and _suggest_batch i…

599cd33

…n backends is enough

Support _suggest_batch operation in Omikuji backend

f4b55cd

osma added the enhancement label Feb 3, 2023

osma added this to the 0.61 milestone Feb 3, 2023

osma self-assigned this Feb 3, 2023

osma requested a review from juhoinkinen February 3, 2023 14:49

juhoinkinen approved these changes Feb 3, 2023

View reviewed changes

osma merged commit cc6dfcf into master Feb 3, 2023

osma deleted the issue663-suggest-batch-omikuji branch February 3, 2023 14:59

osma mentioned this pull request Feb 3, 2023

Support batch suggest in Omikuji backend #665

Closed

juhoinkinen mentioned this pull request Feb 23, 2023

Batch processing in training of NN ensemble - base project suggest calls #676

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch suggest in Omikuji backend #669

Batch suggest in Omikuji backend #669

osma commented Feb 3, 2023 •

edited

Loading

sonarqubecloud bot commented Feb 3, 2023

codecov bot commented Feb 3, 2023 •

edited

Loading

juhoinkinen left a comment

Batch suggest in Omikuji backend #669

Batch suggest in Omikuji backend #669

Conversation

osma commented Feb 3, 2023 • edited Loading

With 1 job

With 4 jobs

sonarqubecloud bot commented Feb 3, 2023

codecov bot commented Feb 3, 2023 • edited Loading

Codecov Report

juhoinkinen left a comment

Choose a reason for hiding this comment

osma commented Feb 3, 2023 •

edited

Loading

codecov bot commented Feb 3, 2023 •

edited

Loading