This contains a python package and associated Google Colab notebooks for the project.
See notebooks/train
- there are 3 phases:
- Fine Tune distilbert to ONET tasks
- Construct PacMAP embedding of distilbert to reduce dimensionality
- Fine Tune distilbert to Embeddings to simplify code path / prediction
See notebooks/score
- two use cases
- Compute pairwise distances between a set of course objectives and occupational tasks
- Index all tasks, and use nearest neighbors to search for relevant occupations.