https://github.com/NVIDIA/TensorRT-LLM/blob/11e14500f35dd781b535ba009c906f55ecfee3b5/tensorrt_llm/runtime/generation.py#L311-L313 If the `stream` keyword argument is specified when creating the `GenerationSession`, `self.stream` will not be initialized and the code will crash on L313.