Request: smaller dependency surface on Transformers model types? #408

andysalerno · 2023-10-22T20:13:22Z

Hi, this is a very vague feature request :D

There are some parts of the code (in the main branch, I haven't looked at the upcoming branches that were mentioned in #395 ) that depend on some private/internal functionality of Transformers types. Mostly around the cache and logits history.

The problem with this is, it's difficult to create wrappers for other model providers, like GPTQ, AQW, llamacpp etc etc because you have to replicate some pretty specific private code from the Transformer types that Guidance expects to see.

So my request is this - is it possible to define a smaller interface/surface, not specific to Transformers (or at least not relying on anything internal to Transformers), that can be used to host models from other sources?

No idea how difficult this is or if it's realistic, just something that I would personally love to see.

freckletonj · 2023-10-22T21:20:40Z

I think guidance is effectively dead now. The maintainers are nearly radio silent, and although i don't love lmql, it's the best I'm aware of right now, so, maybe you'll get more mileage there? I had gptq stuff working with them, and I think they claim compatibility with llamacpp.

That said, if this is something you go after, here are a couple relevant transformers PRs that are changing caching related things:

huggingface/transformers#26681

huggingface/transformers#25086

paucodeici · 2023-11-06T12:48:50Z

In fact the branch pythonic they are developing take a long time but this is still active. Just they forget to communicate with community in their movement.

For controlled generation you have LMQL, but personally I find it quite hard to use compared to a guidance. They make many stuff hiding most of what happen and making it hard to debug when things go wrong (and this is often the case, I have unexplainable memory leak with LQML).

Also, you have outlines (I don't try yet but their paper https://arxiv.org/pdf/2307.09702.pdf is at least very easy to understand).

freckletonj · 2023-11-07T20:25:49Z

@paucodeici I agree with everything you've said about lmql. It's a pain to use and debug. I still haven't found anything better yet though. Guidance was way better, but alas, it seems pretty dead.

Have you tried the pythonic branch?

paucodeici · 2023-11-07T22:34:35Z

The way it will be handled seem promising when you look the commit (at least it will be way more pythonic but i feel they are still not sure how to expose the things to be "pythonic"). From my last test it was not working for my use case but this is a work in progress branch so it has to be expected things can be broken. Le mar. 7 nov. 2023 à 21:26, Josh Freckleton ***@***.***> a écrit :

…

@paucodeici <https://github.com/paucodeici> I agree with everything you've said about lmql. It's a pain to use and debug. I still haven't found anything better yet though. Guidance *was* way better, but alas, it seems pretty dead. Have you tried the pythonic branch? — Reply to this email directly, view it on GitHub <#408 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BDTBQFD4DIM4JVOYWGJHAGTYDKKNPAVCNFSM6AAAAAA6LEB63CVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOJZHE2TCNJZGU> . You are receiving this because you were mentioned.Message ID: ***@***.***>

marcotcr · 2023-11-14T22:45:50Z

Please check out our current attempt in the new release, where we integrate llama.cpp as well.

marcotcr closed this as completed Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: smaller dependency surface on Transformers model types? #408

Request: smaller dependency surface on Transformers model types? #408

andysalerno commented Oct 22, 2023

freckletonj commented Oct 22, 2023

paucodeici commented Nov 6, 2023

freckletonj commented Nov 7, 2023

paucodeici commented Nov 7, 2023 via email

marcotcr commented Nov 14, 2023

Request: smaller dependency surface on Transformers model types? #408

Request: smaller dependency surface on Transformers model types? #408

Comments

andysalerno commented Oct 22, 2023

freckletonj commented Oct 22, 2023

paucodeici commented Nov 6, 2023

freckletonj commented Nov 7, 2023

paucodeici commented Nov 7, 2023 via email

marcotcr commented Nov 14, 2023