Add ADR for langchain4j + djl #185

koppor · 2024-09-04T14:25:47Z

We use ONNX as format, which is supported by langchain4j. The other is GGUF and safetensors-format. The market is mostly safetensors-format. But no Java library. We did not want to have a third party server running.
Microsoft Semantic Kernel (API) is an alternative for langchain4j. Also uses ONNX (probably)

For the embedding model, we opted for deep java library, which uses ONNX under the hood, because a) supported by Java eco system and b) can be converted from other formats. WE accept that the models is much smaller and faster than saftetensors, but possibly responds with slightly less quality.

ThiloteE · 2024-11-15T06:41:00Z

I found out that DJL supports multiple formats.
They use this converter: https://docs.djl.ai/master/extensions/tokenizers/index.html#use-djl-huggingface-model-converter.
Apparently allows to convert a huggingface transformer model to TorchScript, Onnxruntime or Rust.
I assume with "huggingface transformer" model, they mean safetensors.
I used the script and did two conversions that worked, but the conversion failed for two other models, so there seem to be model architecture specific requirements for the conversion to work.

I haven't managed yet to find out how to add those files to the model zoo. Their documentation is hard to understand. See also https://docs.djl.ai/master/docs/development/add_model_to_model-zoo.html

InAnYan · 2024-11-26T21:52:22Z

There is an ADR for this: https://github.com/JabRef/jabref/blob/main/docs/decisions/0037-rag-architecture-implementation.md.

I explained there why langchain and djl were used at all. And why djl instead of langchain embedding models.

Is this what you mean?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ADR for langchain4j + djl #185

Add ADR for langchain4j + djl #185

koppor commented Sep 4, 2024 •

edited by ThiloteE

Loading

ThiloteE commented Nov 15, 2024 •

edited

Loading

InAnYan commented Nov 26, 2024

Add ADR for langchain4j + djl #185

Add ADR for langchain4j + djl #185

Comments

koppor commented Sep 4, 2024 • edited by ThiloteE Loading

ThiloteE commented Nov 15, 2024 • edited Loading

InAnYan commented Nov 26, 2024

koppor commented Sep 4, 2024 •

edited by ThiloteE

Loading

ThiloteE commented Nov 15, 2024 •

edited

Loading