Train command option --prepare-only #583

osma · 2022-03-21T15:34:59Z

For many backends, training consists of two distinctive steps: preparing the training data, then training a ML model. It is possible to reuse the prepared training data using the --cached option and just retrain the ML model. But there is no support for the inverse operation: just preparing the data without training the model. This could be useful for DVC workflows (allowing more granular pipeline stages) and in situations where you intend to perform hyperparameter optimization.

The proposal is to add a --prepare-only option to the annif train command, which skips the model training part in those backends where there is a separate preparation step (i.e. those backends that currently support the --cached option).

The text was updated successfully, but these errors were encountered:

osma added enhancement DVC labels Mar 21, 2022

osma added this to the Short term milestone Mar 21, 2022

osma self-assigned this Mar 28, 2022

juhoinkinen mentioned this issue Aug 4, 2023

Fix train state and modification time for unfinished project training #722

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train command option --prepare-only #583

Train command option --prepare-only #583

osma commented Mar 21, 2022 •

edited

Loading

Train command option --prepare-only #583

Train command option --prepare-only #583

Comments

osma commented Mar 21, 2022 • edited Loading

osma commented Mar 21, 2022 •

edited

Loading