Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance report - 2023 M2 Max 96GB #6

Closed
explorigin opened this issue Aug 17, 2024 · 10 comments
Closed

Performance report - 2023 M2 Max 96GB #6

explorigin opened this issue Aug 17, 2024 · 10 comments

Comments

@explorigin
Copy link
Contributor

explorigin commented Aug 17, 2024

(Using my CLI options here )

time python main.py --prompt "Luxury food photograph" --steps 2 --seed 2 3.94s user 10.78s system 56% cpu 26.109 total
image

time python main.py --prompt "detailed cinematic dof render of an old dusty detailed CRT monitor on a wooden desk in a dim room with items around, messy dirty room. On the screen are the letters "FLUX" glowing softly. High detail hard surface render" --steps 2 --seed 2 3.94s user 11.75s system 61% cpu 25.565 total
image

So about 25-26 seconds.

@filipstrand
Copy link
Owner

@explorigin Thanks for the report (and nice picture with the food for --seed 2 :) ). I have looked at your PR and will include this performance result and some other numbers in the README once this PR is merged.

@7enChan
Copy link

7enChan commented Aug 20, 2024

(Using my CLI options here )

time python main.py --prompt "Luxury food photograph" --steps 2 --seed 2 3.94s user 10.78s system 56% cpu 26.109 total
image

time python main.py --prompt "detailed cinematic dof render of an old dusty detailed CRT monitor on a wooden desk in a dim room with items around, messy dirty room. On the screen are the letters "FLUX" glowing softly. High detail hard surface render" --steps 2 --seed 2 3.94s user 11.75s system 61% cpu 25.565 total
image

So about 25-26 seconds.

On the same machine (M2 Max 96GB MBP), the same prompt takes about 8 seconds longer to complete. I'm not sure what the reason is. I'm running this within a conda virtual environment with python 3.12.4

time python main.py --prompt "Luxury food photograph" --steps 2 --seed 2
Fetching 8 files: 100%|███████████████████████| 8/8 [00:00<00:00, 124737.67it/s]
Fetching 7 files: 100%|█████████████████████████| 7/7 [00:00<00:00, 9670.66it/s]
100%|█████████████████████████████████████████████| 2/2 [00:27<00:00, 13.83s/it]
python main.py --prompt "Luxury food photograph" --steps 2 --seed 2 4.19s user 12.84s system 49% cpu 34.180 total

time python main.py --prompt "detailed cinematic dof render of an old dusty detailed CRT monitor on a wooden desk in a dim room with items around, messy dirty room. On the screen are the letters "FLUX" glowing softly. High detail hard surface render" --steps 2 --seed 2
Fetching 8 files: 100%|███████████████████████| 8/8 [00:00<00:00, 127100.12it/s]
Fetching 7 files: 100%|████████████████████████| 7/7 [00:00<00:00, 74518.09it/s]
100%|█████████████████████████████████████████████| 2/2 [00:28<00:00, 14.00s/it]
python main.py --prompt --steps 2 --seed 2 4.26s user 13.21s system 49% cpu 35.015 total

@explorigin
Copy link
Contributor Author

There could be a loading time difference. Let's compare a few more things:

  • virtualenv environment, Python 3.12.1
  • Sonoma 14.6.1
  • 4TB harddrive (APFS Encrypted)

@7enChan
Copy link

7enChan commented Aug 20, 2024

There could be a loading time difference. Let's compare a few more things:

  • virtualenv environment, Python 3.12.1
  • Sonoma 14.6.1
  • 4TB harddrive (APFS Encrypted)

Thanks for the reply. Here is mine:

  • virtualenv environment, Python 3.12.4
  • Sonoma 14.6.1
  • 1TB harddrive (APFS Not encrypted)

@7enChan
Copy link

7enChan commented Aug 20, 2024

There could be a loading time difference. Let's compare a few more things:

  • virtualenv environment, Python 3.12.1
  • Sonoma 14.6.1
  • 4TB harddrive (APFS Encrypted)

Thanks for the reply. Here is mine:

  • virtualenv environment, Python 3.12.4
  • Sonoma 14.6.1
  • 1TB harddrive (APFS Not encrypted)

Forgot one thing, I'm using the latest version of mflux

@explorigin
Copy link
Contributor Author

Encryption also includes compression. For loading large models into memory, that might be the difference if the drive is the bottleneck. I'm speculating.

@7enChan
Copy link

7enChan commented Aug 20, 2024

Encryption also includes compression. For loading large models into memory, that might be the difference if the drive is the bottleneck. I'm speculating.

Interesting. If the drive is the bottleneck, then the time difference for FLUX.1 dev, which takes 20 steps, should be less.

@7enChan
Copy link

7enChan commented Aug 20, 2024

Encryption also includes compression. For loading large models into memory, that might be the difference if the drive is the bottleneck. I'm speculating.

Interesting. If the drive is the bottleneck, then the time difference for FLUX.1 dev, which takes 20 steps, should be less.

I tried turning on encryption and it didn't take significantly less time, it was still around 34s 😂

@explorigin
Copy link
Contributor Author

I'm speculating

That leaves drive size as the remaining differentiator.

@7enChan
Copy link

7enChan commented Aug 21, 2024

I'm speculating

That leaves drive size as the remaining differentiator.

Yeah, maybe the 4T harddrive doesn't have the same read/write speeds as 1T...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants