What would it take to train it for other languages? #16

enboig · 2023-04-19T10:44:13Z

I use PHP, could this be added?

ravenscroftj · 2023-04-22T06:59:42Z

Hey so basically the main things that would be needed are

a) giant dataset of PHP code - the original codegen models are trained on the Github Bigquery Dataset and there is a subset for PHP so potentially that would be a good source

b.i) a large enough machine to fine tune the model - I'm a hobbyist and in order to train a model with billions of parameters you need commercial GPUs (even on a 4090ti you would struggle to fine tune the 2B or larger models).

OR

b.ii) Use a low resource fine tuning pattern like LoRA which adapts the model by adding additional layers to the model that are specialised for the new language you want to target. This changes the shape/architecture of the model making it incompatible with the current implementation of codegen in ggml so I would need to make some changes to the cpp. llama.cpp recently added LoRa support so it is feasible that lora support could be added.

ravenscroftj mentioned this issue May 10, 2023

Need Elixir Language Support #22

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What would it take to train it for other languages? #16

What would it take to train it for other languages? #16

enboig commented Apr 19, 2023 •

edited

Loading

ravenscroftj commented Apr 22, 2023 •

edited

Loading

What would it take to train it for other languages? #16

What would it take to train it for other languages? #16

Comments

enboig commented Apr 19, 2023 • edited Loading

ravenscroftj commented Apr 22, 2023 • edited Loading

enboig commented Apr 19, 2023 •

edited

Loading

ravenscroftj commented Apr 22, 2023 •

edited

Loading