enable float8 CI on sm89 #575

vkuzo · 2024-07-30T18:49:41Z

Float8 was moved to torchao in #551, and currently the CI that we have for float8 is running on:
a. CPU nightly (skips all cuda related tests)
b. CUDA nightly (skips all cuda related tests which require torch._scaled_mm, because the default machines used for this do not have a high enough CUDA capability version.

We should enable float8 CI on sm89 machines, which have cuda capability 8.9. The performance will not be representative, but we can at least test correctness.

Pointers:

Try running some periodic jobs on L4 pytorch#129608 is an example of adding github CI worklows on sm89
float8 tests currently test for H100s with capability 9.0 (

ao/test/float8/test_base.py

Line 57 in 00b76c4

is_H100 = torch.cuda.is_available() and torch.cuda.get_device_capability() >= (9, 0)

), we should update that everywhere to test for capability 8.9, something like below:

# old
is_H100 = torch.cuda.is_available() and torch.cuda.get_device_capability() >= (9, 0)

# new
is_cuda_8_9 = torch.cuda.is_available() and torch.cuda.get_device_capability() >= (8, 9)

The text was updated successfully, but these errors were encountered:

vkuzo · 2024-07-30T18:50:01Z

cc @drisspg @msaroufim , @jainapurva

msaroufim · 2024-07-30T18:52:00Z

@seemethere we could also alternatively move our CI jobs to use L4 instances which are cheaper than A10G and also support fp8. Last I tried to move our CI to use L4 i got timeouts while looking for runners so I suspect the L4 pool isn't big enough but this feels like a free efficiency win. wdyt?

drisspg · 2024-07-30T19:35:54Z

One small note is that we will still need the skips on any rowwise tests since they currently require 9.0 +

vkuzo added the float8 label Jul 30, 2024

vkuzo assigned jainapurva Jul 30, 2024

jainapurva mentioned this issue Aug 1, 2024

Enable float8 CI on sm89 #587

Merged

jainapurva closed this as completed Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable float8 CI on sm89 #575

enable float8 CI on sm89 #575

vkuzo commented Jul 30, 2024

vkuzo commented Jul 30, 2024

msaroufim commented Jul 30, 2024

drisspg commented Jul 30, 2024

enable float8 CI on sm89 #575

enable float8 CI on sm89 #575

Comments

vkuzo commented Jul 30, 2024

vkuzo commented Jul 30, 2024

msaroufim commented Jul 30, 2024

drisspg commented Jul 30, 2024