Implement prompt/generation alignment

[Guidance](https://github.com/microsoft/guidance) implements a method called token healing, which consists in correcting for the quirks introduced by modern encodings like BPE. See [this notebook](https://github.com/microsoft/guidance/blob/main/notebooks/token_healing.ipynb) for a thorough explanation of why this is necessary. The implementation for `Transformers` models is [here](https://github.com/microsoft/guidance/blob/e3c6fe93fa00cb86efc130bbce22aa29100936d4/guidance/llms/_transformers.py#L368).

This consists in backtracking one or several tokens and start generation by imposing that we reproduce the text that corresponds to the removed tokens. This can be integrated in the `__call__` method of the `Sequence` class.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement prompt/generation alignment #161

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Implement prompt/generation alignment #161

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions