FEAT: Support OpenHermes 2.5 #776

Bojun-Feng · 2023-12-18T01:28:58Z

Update Model Family JSON
Update README
Update Docs

I have tested the GGUF model locally with Llama.cpp but did not test the PyTorch ones due to the lack of CUDA support.

I played around with OpenHermes 2.5 on my laptop and generally believe it to be the best 7B local model we have so far. Here are some outputs from the Q2_K quantization (I'm sure other quantizations will perform even better) with 0 temperature for deterministic output, if anyone is interested:

screenshots