How to run ONNX model of GPT-2 ? #3563
Replies: 4 comments 1 reply
-
@JeeDevUser |
Beta Was this translation helpful? Give feedback.
-
DJL is a library allows you load the model and do NDArray operations. You should be implement any model logic in Java. |
Beta Was this translation helpful? Give feedback.
-
@frankfliu , thanks for yuor valuable insight. Or, for example, how did you know all this information that you used to implement this? I have a feeling that Java developers have a lot of work to do to be able to use pre-trained models, like Python developers can. |
Beta Was this translation helpful? Give feedback.
-
I think most Java developers just want to use a pre-trained model in a simple way to do some work in their applications. Just like Python developers can do it now, with a few lines of code. We're a long way from that now. This work should probably involve the wider Java community, otherwise Java will be excluded from the future of AI development. |
Beta Was this translation helpful? Give feedback.
-
Hi all,
I am just playing with ONNX version of GPT-2
by using following Python script:
Wondering, Is it possible to employ this model in DJL, and how? Where to start?
Beta Was this translation helpful? Give feedback.
All reactions