Fix for triton dynamic batching requirement #890

mengdong · 2020-11-09T06:39:50Z

according to the definition of batch size in bert demo: parser.add_argument("-b", "--batch-size", default=[], action="append", help="Batch size(s) to optimize for. The engine will be usable with any batch size below this, but may not be optimal for smaller sizes. Can be specified multiple times to optimize for more than one batch size.", type=int)

The min batch size in the optimization profile should always start with 1, instead of increasing by 1 for each separate optimization profile such as [1, 2, 3]
There is no point to output a static batch optimization profile based on the batch size definition.

mengdong · 2020-11-09T06:42:49Z

Hello @rajeevsrao, I ran into an issue when apply converted TRT engine to Triton and apply dynamic batching. I think the code could use a simple fix. Original code always output a static engine when only 1 optimization profile is provided.

according to the definition of batch size in bert demo: `parser.add_argument("-b", "--batch-size", default=[], action="append", help="Batch size(s) to optimize for. The engine will be usable with any batch size below this, but may not be optimal for smaller sizes. Can be specified multiple times to optimize for more than one batch size.", type=int)` The min batch size in the optimization profile should always start with 1, instead of increasing by 1 for each separate optimization profile such as [1, 2, 3] There is no point to output a static batch optimization profile based on the batch size definition. Signed-off-by: DougM <mengdong0427@gmail.com>

rajeevsrao · 2020-11-10T22:21:03Z

Thanks @mengdong - will review.

mengdong · 2021-02-03T01:57:36Z

@rajeevsrao could you merge this so that we can use bert engine with bs>1 and multiple optimization profiles in Triton correctly? Thanks!

mengdong · 2021-02-03T02:03:04Z

thanks!

mengdong force-pushed the patch-1 branch from 839313e to f5dc5b0 Compare November 10, 2020 05:36

rajeevsrao merged commit 23adb1a into NVIDIA:release/7.1 Feb 3, 2021

mengdong mentioned this pull request Feb 4, 2021

Fix for triton dynamic batching requirement in master #1044

Closed

mengdong deleted the patch-1 branch February 4, 2021 07:31

mengdong mentioned this pull request Feb 8, 2021

Update demoBERT input dimensions to match Triton requirement #1051

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for triton dynamic batching requirement #890

Fix for triton dynamic batching requirement #890

mengdong commented Nov 9, 2020

mengdong commented Nov 9, 2020

rajeevsrao commented Nov 10, 2020

mengdong commented Feb 3, 2021

mengdong commented Feb 3, 2021

Fix for triton dynamic batching requirement #890

Fix for triton dynamic batching requirement #890

Conversation

mengdong commented Nov 9, 2020

mengdong commented Nov 9, 2020

rajeevsrao commented Nov 10, 2020

mengdong commented Feb 3, 2021

mengdong commented Feb 3, 2021