-
-
Notifications
You must be signed in to change notification settings - Fork 11.1k
Closed
Closed
Copy link
Description
I used the hermes_tool_parser.py as tool-parser-plugin and registered the parser as hermes_patched, but still have the same problem.
Already referred to #9874 #10395 #10398
Traceback (most recent call last):
File "/app/hermes_tool_parser.py", line 228, in extract_tool_calls_streaming
function_name: Union[str, None] = current_tool_call.get("name")
^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'get'
Error trying to handle streaming tool call.
Traceback (most recent call last):
File "/app/hermes_tool_parser.py", line 292, in extract_tool_calls_streaming
args_delta_start_loc = cur_arguments_json.index(delta_text) \
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ValueError: substring not found
Here is how I start vllm service with the latest package:
python3 -m vllm.entrypoints.openai.api_server \
--model /app/Qwen2.5-72B-Instruct-AWQ \
--port 7415 \
--tensor-parallel-size 2 \
--gpu-memory-utilization 0.95 \
--max-model-len 64000 \
--enforce-eager \
--disable_custom_all_reduce \
--enable-auto-tool-choice \
--tool-parser-plugin /app/hermes_tool_parser.py \
--tool-call-parser hermes_patched \
--chat-template /app/qwen.jinja
I also tried using Docker image v0.6.3.post1 v0.6.4 v0.6.4.post1
Originally posted by @Sala8888 in #10398 (comment)
rmalde
Metadata
Metadata
Assignees
Labels
No labels