🚀 The feature, motivation and pitch
--rope-scaling '{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}' --max-model-len 131072
Extend qwen3 context length from 32768 to 131072
Alternatives
No response
Additional context
No response