how to finetune whisper model with 'initial_prompt'

when use 'initial_prompt', the decoding result of  finetuning with my data on whisper model v2 is bad, on the contrary, the result is good.
however, when use 'initial_prompt'  the decoding result of  based whisper model v2 is also good, so it means If  want to use 'initial_prompt'  during decoding , must add it when training？