-
Notifications
You must be signed in to change notification settings - Fork 607
Closed
Labels
questionFurther information is requestedFurther information is requested
Description
I have a text-generator language model that is compressed and is in .bin format, and it can accessed by command line terminal. It generates about one word per second and prints out every word in realtime when accessed through command line. I would like to deploy my model using cortex but I'm struggling to get the output in realtime word-by-word. Right now my code prints out only one line a time.
import subprocess
def run_command(text):
command = ['./mycommand', text]
process = subprocess.Popen(command, stdout=subprocess.PIPE, universal_newlines=True, bufsize=-1)
while True:
output = process.stdout.readline()
if output == '' and process.poll() is not None:
break
if output:
print(output.strip())
run_command('TEXT')
One line may include about 20 words, so that means 1 line will be displayed every 20 seconds (since my model outputs roughly 1 word/ second). I would really like the output to be more dynamic and output one word at a time (as it does through command line) instead of just one line. Is there a way this can be achieved?
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested