Unify LogitsProcessors and `outlines.generate` Dispatchers #957

lapp0 · 2024-06-11T19:14:25Z

What behavior of the library made you think about the improvement?

Currently we implement the same code in multiple places in the repo.

For each inference engine / model there are distinct Outlines integrations (good).
For each each model integration there is a distinct set of logits processor (addressed here)
For each model integration there is a distinct outlines.generate.* dispatch function (addressed here)

Having a distinct set of logits processors has resulted in some models lacking features they would otherwise have for free, and bugs due to discrepancies in implementation.

How would you like it to behave?

To avoid bugs, and make development easier we should handle any quirks of specific models implementations encapsulated within outlines.models, and allow the rest of the code base to be model agnostic.

#956 re-introduces generic logits processors. They are designed to ensure any logits type (mx.array for mlx, np.array for llama-cpp, and torch.tensor for everything else) is efficiently cast to a torch.tensor allowing one torch-based logits processor to handle all logits processing work.

Resolving this issue involves updating outlines.generate such that all other models use these generic logits processors. This change should result in a major version update per https://github.com/outlines-dev/outlines/blob/main/docs/community/versioning.md as old logits processors will be removed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify LogitsProcessors and `outlines.generate` Dispatchers #957

Unify LogitsProcessors and `outlines.generate` Dispatchers #957

lapp0 commented Jun 11, 2024 •

edited

Loading

Unify LogitsProcessors and outlines.generate Dispatchers #957

Unify LogitsProcessors and outlines.generate Dispatchers #957

Comments

lapp0 commented Jun 11, 2024 • edited Loading

What behavior of the library made you think about the improvement?

How would you like it to behave?

Related

Unify LogitsProcessors and `outlines.generate` Dispatchers #957

Unify LogitsProcessors and `outlines.generate` Dispatchers #957

lapp0 commented Jun 11, 2024 •

edited

Loading