Skip to content

[Feature]: Batch Invariant Feature and Performance Optimization #27433

@yewentao256

Description

@yewentao256

🚀 The feature, motivation and pitch

We have basically support Batch Invariant based on https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/

Batch-invariant Inference (view)

But there are still some work to be done, so here is the issue to track the work

TODOs:

Nice to have:

And currently, the performance of batch invariant mode is still not that good, let's optimize it together if you have a free hand!

Metadata

Metadata

Labels

Type

No type

Projects

Status

In Progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions