[tests] enable `test_mixed_adapter_batches_lora_opt_timing` on XPU #2021

faaany · 2024-08-20T09:56:29Z

After Fix:

====================================== short test summary info ======================================
PASSED tests/test_custom_models.py::TestMixedAdapterBatches::test_mixed_adapter_batches_lora_opt_timing
=================================== 1 passed, 1 warning in 7.59s ====================================

Just like other tests in this file, this function should not only apply to NV GPU. We can actually remove this test marker.

faaany · 2024-08-20T09:56:36Z

@BenjaminBossan

BenjaminBossan

The issue with this change is that this would mean the test also runs on CPU. As the comment further below indicates, we want to avoid this to prevent flakiness:

peft/tests/test_custom_models.py

Lines 3340 to 3341 in eb5eb6e

    
           # Measure timing of running base and adapter separately vs using a mixed batch. Note that on CPU, the 
        
           # differences are quite small, so this test requires GPU to avoid flakiness.

I tried the test again just now on CPU and the time differences are indeed much smaller (~25%) compared to GPU (~150%), so this is still true. One solution would be to check if either GPU or XPU is being used.

faaany · 2024-08-20T14:23:41Z

The issue with this change is that this would mean the test also runs on CPU. As the comment further below indicates, we want to avoid this to prevent flakiness:

peft/tests/test_custom_models.py

Lines 3340 to 3341 in eb5eb6e

# Measure timing of running base and adapter separately vs using a mixed batch. Note that on CPU, the

# differences are quite small, so this test requires GPU to avoid flakiness.

I tried the test again just now on CPU and the time differences are indeed much smaller (~25%) compared to GPU (~150%), so this is still true. One solution would be to check if either GPU or XPU is being used.

Sure, should I add a marker called "require_torch_accelerator" just like in accelerate?

BenjaminBossan · 2024-08-20T14:28:52Z

Sure, should I add a marker called "require_torch_accelerator" just like in accelerate?

Hmm, I think let's not go that far yet, a simple check at the start of the test function for the device is more explicit, we can add an extra decorator if we need the same check in many places.

faaany · 2024-08-20T14:34:31Z

Sure, should I add a marker called "require_torch_accelerator" just like in accelerate?

Hmm, I think let's not go that far yet, a simple check at the start of the test function for the device is more explicit, we can add an extra decorator if we need the same check in many places.

Sure, let me update!

faaany · 2024-08-21T04:59:17Z

@BenjaminBossan how about the require_non_cpu solution I introduced in this PR #2026 ?

BenjaminBossan · 2024-08-21T10:43:52Z

how about the require_non_cpu solution I introduced in this PR #2026 ?

Sounds good, I merged that PR.

faaany · 2024-08-21T11:26:54Z

rebase done.

HuggingFaceDocBuilderDev · 2024-08-21T11:34:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks for extending this test, LGTM.

Btw. is there some public place where I can check the XPU tests?

…ustom_models

faaany · 2024-08-22T01:22:54Z

Thanks for extending this test, LGTM.

Btw. is there some public place where I can check the XPU tests?

do you mean the test summary on XPU?

BenjaminBossan · 2024-08-22T09:36:39Z

do you mean the test summary on XPU?

Yes, so that me and others can check the current state.

faaany · 2024-08-23T10:23:15Z

do you mean the test summary on XPU?

Yes, so that me and others can check the current state.

Sure, currently we don't upload the test results to a public repo. But let me come out with a solution. Talk to you next Monday.

BenjaminBossan · 2024-08-23T10:34:56Z

Sure, currently we don't upload the test results to a public repo. But let me come out with a solution. Talk to you next Monday.

Thanks. It's not super high priority, but right now if we ever break something in PEFT for XPU, we won't know until someone comes to us to report it.

yao-matrix · 2024-09-03T02:36:32Z

Sure, currently we don't upload the test results to a public repo. But let me come out with a solution. Talk to you next Monday.

Thanks. It's not super high priority, but right now if we ever break something in PEFT for XPU, we won't know until someone comes to us to report it.

We have a 2-step plan:

screen HF libraries(transformers, accelerate, peft, diffusers, trl) UTs, run in XPU, and fix or extend if we target them run on XPU
work w/ Hugging Face to provide CI machines and integrate into CI flow
We are in 1st now, and progress is transformers & accelerate & peft are done, diffusers & trl are undergoing. Once this step is done, we are glad to work w/ you guys to enable CI, :).

BenjaminBossan · 2024-09-03T09:29:03Z

Thanks for the update on the plan.

enable

7ccdff1

BenjaminBossan reviewed Aug 20, 2024

View reviewed changes

Merge branch 'huggingface:main' into custom_models

1640701

BenjaminBossan approved these changes Aug 21, 2024

View reviewed changes

BenjaminBossan merged commit fa218e1 into huggingface:main Aug 21, 2024
14 checks passed

faaany added 2 commits August 21, 2024 16:26

update

3c8bf49

Merge branch 'custom_models' of https://github.com/faaany/peft into c…

b049012

…ustom_models

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tests] enable `test_mixed_adapter_batches_lora_opt_timing` on XPU #2021

[tests] enable `test_mixed_adapter_batches_lora_opt_timing` on XPU #2021

faaany commented Aug 20, 2024

faaany commented Aug 20, 2024

BenjaminBossan left a comment

faaany commented Aug 20, 2024

BenjaminBossan commented Aug 20, 2024

faaany commented Aug 20, 2024

faaany commented Aug 21, 2024

BenjaminBossan commented Aug 21, 2024

faaany commented Aug 21, 2024

HuggingFaceDocBuilderDev commented Aug 21, 2024

BenjaminBossan left a comment

faaany commented Aug 22, 2024

BenjaminBossan commented Aug 22, 2024

faaany commented Aug 23, 2024

BenjaminBossan commented Aug 23, 2024

yao-matrix commented Sep 3, 2024

BenjaminBossan commented Sep 3, 2024

	# Measure timing of running base and adapter separately vs using a mixed batch. Note that on CPU, the
	# differences are quite small, so this test requires GPU to avoid flakiness.

[tests] enable test_mixed_adapter_batches_lora_opt_timing on XPU #2021

[tests] enable test_mixed_adapter_batches_lora_opt_timing on XPU #2021

Conversation

faaany commented Aug 20, 2024

faaany commented Aug 20, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

faaany commented Aug 20, 2024

BenjaminBossan commented Aug 20, 2024

faaany commented Aug 20, 2024

faaany commented Aug 21, 2024

BenjaminBossan commented Aug 21, 2024

faaany commented Aug 21, 2024

HuggingFaceDocBuilderDev commented Aug 21, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

faaany commented Aug 22, 2024

BenjaminBossan commented Aug 22, 2024

faaany commented Aug 23, 2024

BenjaminBossan commented Aug 23, 2024

yao-matrix commented Sep 3, 2024

BenjaminBossan commented Sep 3, 2024

[tests] enable `test_mixed_adapter_batches_lora_opt_timing` on XPU #2021

[tests] enable `test_mixed_adapter_batches_lora_opt_timing` on XPU #2021