new version Bone #2233

JL-er · 2024-11-25T08:19:42Z

https://arxiv.org/abs/2409.15371
Bone has been updated to a new version. The current Bone is faster, more memory-efficient, and performs better than the Lora series, making it a foundational structure comparable to Lora. The previous version of Bone has now been renamed to Bat, and users can use it by including "bat" in init_weight.

BenjaminBossan

Thanks for the update, addressing some of the shortcomings of the initial Bone implementation. Just to be clear: What was previously called "Bone" now corresponds to "Bat" and the current "Bone" is something different (but similar). Am I right in my understanding that the new Bone is more memory efficient and faster than Bat but Bat performs better (tables 4 and 7)?

I added a few smaller comments to the PR. On top of that, let's ensure that both variants are covered by testing. For this, I think we need to add new rows with init_weights="bat" here:

peft/tests/test_custom_models.py

Lines 301 to 304 in eaaf03c

    
           ("Vanilla MLP 1 Bone", "MLP", BoneConfig, {"target_modules": "lin0", "r": 2}), 
        
           ("Vanilla MLP 2 Bone", "MLP", BoneConfig, {"target_modules": ["lin0"], "r": 2}), 
        
           ("Vanilla MLP 3 Bone", "MLP", BoneConfig, {"target_modules": ["lin0", "lin1"], "r": 2}), 
        
           ("Vanilla MLP 5 Bone", "MLP", BoneConfig, {"target_modules": ["lin0"], "modules_to_save": ["lin1"], "r": 2}),

Finally, WDYT about updating the Bone finetuning example to explicitly show results for Bone and Bat?

docs/source/conceptual_guides/adapter.md

src/peft/tuners/bone/layer.py

JL-er · 2024-11-25T10:40:24Z

ddressing some of the shortcomings of the initial Bone implementation. Just to be clear: What was previously called "Bone" now corresponds to "Bat" and the current "Bone" is something different (but similar). Am I right in my understanding that the new Bone is more memory efficient and faster than Bat but Bat performs better (tables 4 and 7)?

Yes, your understanding is correct. Bone is now more like a foundational method comparable to Lora. Bat is more like an improved method compared to Pissa. However, Bone itself has already surpassed Pissa.

JL-er · 2024-11-25T11:15:19Z

Finally, WDYT about updating the Bone finetuning example to explicitly show results for Bone and Bat?

I don't quite understand this issue. Currently, Bat is slower, so I think people would prefer to use Bone for training or research.
The other parts have been updated.

BenjaminBossan · 2024-11-25T11:46:00Z

I don't quite understand this issue. Currently, Bat is slower, so I think people would prefer to use Bone for training or research.

That's true, but Bat has slightly better scores, right? So some users might be willing to trade memory+speed for better scores. If you think it's not really worth it, at least the example could mention that this is something that users can try out?

JL-er · 2024-11-25T13:59:15Z

I don't quite understand this issue. Currently, Bat is slower, so I think people would prefer to use Bone for training or research.

That's true, but Bat has slightly better scores, right? So some users might be willing to trade memory+speed for better scores. If you think it's not really worth it, at least the example could mention that this is something that users can try out?

That's indeed the case, so where should I include this explanation?

BenjaminBossan · 2024-11-25T17:18:31Z

That's indeed the case, so where should I include this explanation?

How about adding a sentence or two here: https://github.com/huggingface/peft/tree/main/examples/bone_finetuning#advanced-usage. It could also be a good idea to add a flag the args of the script to enable Bat. WDYT?

JL-er · 2024-11-26T03:47:59Z

That's indeed the case, so where should I include this explanation?

How about adding a sentence or two here: https://github.com/huggingface/peft/tree/main/examples/bone_finetuning#advanced-usage. It could also be a good idea to add a flag the args of the script to enable Bat. WDYT?

Okay, I have added instructions to guide users on how to use Bat.

BenjaminBossan · 2024-11-26T15:12:22Z

Thanks for this update @JL-er. Could you please merge the latest main branch of PEFT, as there is a fix there required for CI.

JL-er · 2024-11-26T15:14:22Z

Thanks for this update @JL-er. Could you please merge the latest main branch of PEFT, as there is a fix there required for CI.

Completed.

HuggingFaceDocBuilderDev · 2024-11-26T15:19:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan · 2024-11-26T15:40:20Z

@JL-er Thanks for merging. Running the tests leads to a lot of errors related to Bone, such as

FAILED tests/test_custom_models.py::PeftCustomModelTester::test_disable_adapters_with_merging_087_Vanilla_MLP_1_Bone - RuntimeError: shape '[9, 10, 5, 2]' is invalid for input of size 90

Could you please check what happened there?

JL-er · 2024-11-26T16:26:24Z

@JL-er Thanks for merging. Running the tests leads to a lot of errors related to Bone, such as

FAILED tests/test_custom_models.py::PeftCustomModelTester::test_disable_adapters_with_merging_087_Vanilla_MLP_1_Bone - RuntimeError: shape '[9, 10, 5, 2]' is invalid for input of size 90

Could you please check what happened there?

fix all

BenjaminBossan

Thanks for this update to the Bone method. These improvements should hopefully help with the adoption of this method.

Note that technically, this is a backwards incompatible change, since the underlying algorithm for Bone was changed. This is fine as Bone is not part of any PEFT release. But after the next PEFT release (could be next week), this type of change will no longer be possible (we can still add new variations of Bone, but the default one needs to stay the same).

JL-er · 2024-11-27T12:04:27Z

Thanks for this update to the Bone method. These improvements should hopefully help with the adoption of this method.

Note that technically, this is a backwards incompatible change, since the underlying algorithm for Bone was changed. This is fine as Bone is not part of any PEFT release. But after the next PEFT release (could be next week), this type of change will no longer be possible (we can still add new variations of Bone, but the default one needs to stay the same).

Thank you for your reminder. Currently, Bone is a very basic method that is better than LoRA in terms of speed and memory usage, so I will not make any modifications to it.

JL-er · 2024-12-20T08:03:28Z

@BenjaminBossan My paper has undergone significant revisions. Can I update the citations and explanations?

BenjaminBossan · 2024-12-20T11:38:30Z

@JL-er Yes sure, go ahead. Did anything on the implementation side change? This would be more difficult to amend now.

JL-er · 2024-12-21T08:29:45Z

@JL-er Yes sure, go ahead. Did anything on the implementation side change? This would be more difficult to amend now.

The structure remains unchanged, only the paper section has been modified to provide more reasonable explanations.

new version

611138b

BenjaminBossan requested changes Nov 25, 2024

View reviewed changes

update

eb9cc72

useage Bat

950ce78

Merge branch 'huggingface:main' into main

d5a494f

fix bone input shape

bfb970e

BenjaminBossan approved these changes Nov 27, 2024

View reviewed changes

BenjaminBossan merged commit 60978d7 into huggingface:main Nov 27, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new version Bone #2233

new version Bone #2233

JL-er commented Nov 25, 2024

BenjaminBossan left a comment

JL-er commented Nov 25, 2024

JL-er commented Nov 25, 2024

BenjaminBossan commented Nov 25, 2024

JL-er commented Nov 25, 2024

BenjaminBossan commented Nov 25, 2024

JL-er commented Nov 26, 2024

BenjaminBossan commented Nov 26, 2024

JL-er commented Nov 26, 2024

HuggingFaceDocBuilderDev commented Nov 26, 2024

BenjaminBossan commented Nov 26, 2024

JL-er commented Nov 26, 2024

BenjaminBossan left a comment

JL-er commented Nov 27, 2024

JL-er commented Dec 20, 2024

BenjaminBossan commented Dec 20, 2024

JL-er commented Dec 21, 2024

	("Vanilla MLP 1 Bone", "MLP", BoneConfig, {"target_modules": "lin0", "r": 2}),
	("Vanilla MLP 2 Bone", "MLP", BoneConfig, {"target_modules": ["lin0"], "r": 2}),
	("Vanilla MLP 3 Bone", "MLP", BoneConfig, {"target_modules": ["lin0", "lin1"], "r": 2}),
	("Vanilla MLP 5 Bone", "MLP", BoneConfig, {"target_modules": ["lin0"], "modules_to_save": ["lin1"], "r": 2}),

new version Bone #2233

new version Bone #2233

Conversation

JL-er commented Nov 25, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

JL-er commented Nov 25, 2024

JL-er commented Nov 25, 2024

BenjaminBossan commented Nov 25, 2024

JL-er commented Nov 25, 2024

BenjaminBossan commented Nov 25, 2024

JL-er commented Nov 26, 2024

BenjaminBossan commented Nov 26, 2024

JL-er commented Nov 26, 2024

HuggingFaceDocBuilderDev commented Nov 26, 2024

BenjaminBossan commented Nov 26, 2024

JL-er commented Nov 26, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

JL-er commented Nov 27, 2024

JL-er commented Dec 20, 2024

BenjaminBossan commented Dec 20, 2024

JL-er commented Dec 21, 2024