Update of MLX-LM generate_step to support repetition_penalty #1134

ea167 · 2024-09-06T04:14:38Z

In order to fix #1131, here is an updated mlxlm.py with support of repetition_penalty and repetition_context_size parameters in generate_step function.

It prevents the LLM model to fall into an infinite loop of generating the same group of tokens endlessly.

These parameters repetition_penalty and repetition_context_size can therefore be directly passed as arguments to the generator. Here is an example:

sampler = samplers.multinomial( top_p=0.1 )
generator = generate.json( model, JSON_SCHEMA, sampler )
json_answer = generator( my_prompt, max_tokens=1000, repetition_penalty=1.1, repetition_context_size=20 )

…context_size params

ea167 mentioned this pull request Sep 6, 2024

Infinite repetitions and invalid JSON - Outlines with MLX #1131

Open

Update of generate_step to support repetition_penalty and repetition_…

515c197

…context_size params

rlouf force-pushed the main branch from 13fd9a3 to 515c197 Compare September 17, 2024 13:22

Merge branch 'dottxt-ai:main' into main

930565c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update of MLX-LM generate_step to support repetition_penalty #1134

Update of MLX-LM generate_step to support repetition_penalty #1134

ea167 commented Sep 6, 2024

Update of MLX-LM generate_step to support repetition_penalty #1134

Are you sure you want to change the base?

Update of MLX-LM generate_step to support repetition_penalty #1134

Conversation

ea167 commented Sep 6, 2024