Handle long contexts better #192

pseudotensor · 2023-05-28T10:32:14Z

While had added critical protection against failure when inputs too long, still if prompt happens to be too long, tokenization will do truncation=True for a max_length in some way (left or right or whatever), and can drop prompting of : etc.

Right now form prompt, then tokenize. But need to tokenize first ourselves (even for h2oai_pipeline), and remove non-human non-bot parts using split or similar, remove some stuff to get token length down, and then join back up.

Else missing human truncated out confuses model that just sees : and will repeat answers etc with another :. The output is cleaned from that, but stopping doesn't stop from that so calculations go in background and lag user.

…default. But need to deal with Issue #192, output lost for wizard case if prompting not right

pseudotensor · 2023-06-08T07:48:07Z

#255 helps for h2oai_pipeline.py use. Should solve 99% of issues, but still fudge w.r.t. prompt itself.

pseudotensor mentioned this issue May 29, 2023

How much GPU memory should be used with vicuna13b and langchain to get answers from docs #198

Closed

pseudotensor self-assigned this May 29, 2023

pseudotensor added the priority/blocker Priority: issue is blocking development or release process label May 29, 2023

pseudotensor added a commit that referenced this issue May 30, 2023

Use TheBloke quantizations instead of base llama model or GPT4all by …

e3307e7

…default. But need to deal with Issue #192, output lost for wizard case if prompting not right

pseudotensor mentioned this issue May 30, 2023

How can I merge UserData and LLM / ChatLLM data ? #203

Closed

pseudotensor added a commit that referenced this issue May 30, 2023

Change until Issue #192 settled.

410e22b

pseudotensor closed this as completed in 7833eac Jun 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle long contexts better #192

Handle long contexts better #192

pseudotensor commented May 28, 2023

pseudotensor commented Jun 8, 2023

Handle long contexts better #192

Handle long contexts better #192

Comments

pseudotensor commented May 28, 2023

pseudotensor commented Jun 8, 2023