Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LlamaSpeak cannot run with Llama-3.1-70B-Instruct #646

Open
SuWeipeng opened this issue Sep 26, 2024 · 2 comments
Open

LlamaSpeak cannot run with Llama-3.1-70B-Instruct #646

SuWeipeng opened this issue Sep 26, 2024 · 2 comments

Comments

@SuWeipeng
Copy link

I'm trying to run a 70B model on my Jetson AGX Orin(64x64GB), but it automatically interrupts when I simply replace the 8B model. How can I get the 70B model to run?

When I run the command below, something interrupt the process automatically.

jetson-containers run --env HUGGINGFACE_TOKEN=hf_xxxxx  \
  dustynv/nano_llm:r36.3.0   \
  python3 -m nano_llm.agents.web_chat --api=mlc  --debug   \
    --model meta-llama/Meta-Llama-3.1-70B-Instruct     \
    --asr=whisper --tts=piper

2024-09-26 163215

If I run with 8B model, it works very well, for example:

jetson-containers run --env HUGGINGFACE_TOKEN=hf_xxxxx  \
  dustynv/nano_llm:r36.3.0   \
  python3 -m nano_llm.agents.web_chat --api=mlc  --debug   \
    --model meta-llama/Meta-Llama-3.1-8B-Instruct     \
    --asr=whisper --tts=piper
@dusty-nv
Copy link
Owner

@SuWeipeng can you test Llama-3.1-70B with the baseline nano_llm.chat first? How much memory is it using? I can't recall explicitly testing Llama-3.1-70B, but have done so with Llama-2-70B

@SuWeipeng
Copy link
Author

SuWeipeng commented Sep 27, 2024

@SuWeipeng can you test Llama-3.1-70B with the baseline nano_llm.chat first? How much memory is it using? I can't recall explicitly testing Llama-3.1-70B, but have done so with Llama-2-70B

@dusty-nv I'm a brand-new man, could you tell me how can I do this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants