You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At this point I think that it would only make sense to benchmark the CFG-guided generation. Regex-guided generation is only a dictionary call at each step, so there really isn't anything we could do that would move the needle.
Initialization benchmarks are introduced in #542
We should extend these benchmarks to measure the performance of inference.
Goal
Outlines shouldn't be a bottleneck for most inference. A reasonable goal can be set based on
Benchmarks will help us achieve and maintain that goal.
What must be benchmarked
and Regexguided generationProposed method
It's annoying to need a GPU to run tests. We shouldn't do actual inference in performance benchmarks.
The text was updated successfully, but these errors were encountered: