Skip to content

Commit

Permalink
Merge pull request #30 from anandhu-eng/amdenable
Browse files Browse the repository at this point in the history
WIP - amd llama2 implementation
  • Loading branch information
arjunsuresh authored Oct 21, 2024
2 parents fa0eeca + 7eee072 commit 2bf7aac
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 0 deletions.
7 changes: 7 additions & 0 deletions docs/benchmarks/language/llama2-70b.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,3 +25,10 @@ hide:
{{ mlperf_inference_implementation_readme (4, "llama2-70b-99", "neuralmagic") }}

{{ mlperf_inference_implementation_readme (4, "llama2-70b-99.9", "neuralmagic") }}

=== "AMD"
## AMD MLPerf Implementation

{{ mlperf_inference_implementation_readme (4, "llama2-70b-99", "amd") }}

{{ mlperf_inference_implementation_readme (4, "llama2-70b-99.9", "amd") }}
5 changes: 5 additions & 0 deletions main.py
Original file line number Diff line number Diff line change
Expand Up @@ -43,6 +43,11 @@ def mlperf_inference_implementation_readme(spaces, model, implementation, *, imp
devices = [ "CUDA" ]
frameworks = [ "TensorRT" ]

elif implementation == "amd":
devices = [ "cuda" ]
frameworks = [ "pytorch" ]
execution_envs.remove("Docker")

elif implementation == "neuralmagic":
devices = [ "CUDA" ]
frameworks = [ "pytorch" ]
Expand Down

0 comments on commit 2bf7aac

Please sign in to comment.