[BFCL] Fix Hanging Inference for OSS Models on GPU Platforms #663

HuanzhiMao · 2024-09-27T22:14:54Z

This PR addresses issues encountered when running locally-hosted models on GPU-renting platforms (e.g., Lambda Cloud). Specifically, there were problems with output display from vllm due to the use of subprocesses for launching these models. Additionally, some multi-turn functions (such as xargs) rely on subprocesses, which caused inference on certain test entries (such as multi_turn_36 ) to hang indefinitely, resulting in an undesirable pipeline halt.

To fix this, the terminal logging logic has been updated to utilize a separate thread for reading from the subprocess pipe and printing to the terminal.

Alos, for readability, the _format_prompt function has been moved to the Prompting methods section; this would not change the leaderboard score.

CharlieJCJ

LGTM

…Patil#663) This PR addresses issues encountered when running locally-hosted models on GPU-renting platforms (e.g., Lambda Cloud). Specifically, there were problems with output display from `vllm` due to the use of subprocesses for launching these models. Additionally, some multi-turn functions (such as `xargs`) rely on subprocesses, which caused inference on certain test entries (such as `multi_turn_36 `) to hang indefinitely, resulting in an undesirable pipeline halt. To fix this, the terminal logging logic has been updated to utilize a separate thread for reading from the subprocess pipe and printing to the terminal. Alos, for readability, the `_format_prompt` function has been moved to the `Prompting methods` section; this would not change the leaderboard score.

update base_oss_handler

29fb090

HuanzhiMao added the BFCL-General General BFCL Issue label Sep 27, 2024

HuanzhiMao changed the title ~~[BFCL]~~ [BFCL] Fix Hanging Inference for OSS Models on GPU Platforms Sep 27, 2024

Merge branch 'main' into pr/HuanzhiMao/663

dd17b8c

CharlieJCJ approved these changes Oct 5, 2024

View reviewed changes

HuanzhiMao added 2 commits October 4, 2024 22:35

Merge branch 'main' into pr/HuanzhiMao/663

e51d045

fix typo

9fa0960

ShishirPatil merged commit ff169f5 into ShishirPatil:main Oct 5, 2024

HuanzhiMao deleted the subprocess branch October 5, 2024 05:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BFCL] Fix Hanging Inference for OSS Models on GPU Platforms #663

[BFCL] Fix Hanging Inference for OSS Models on GPU Platforms #663

HuanzhiMao commented Sep 27, 2024

CharlieJCJ left a comment

[BFCL] Fix Hanging Inference for OSS Models on GPU Platforms #663

[BFCL] Fix Hanging Inference for OSS Models on GPU Platforms #663

Conversation

HuanzhiMao commented Sep 27, 2024

CharlieJCJ left a comment

Choose a reason for hiding this comment