Skip to content
This repository has been archived by the owner on Sep 18, 2024. It is now read-only.

[Model Compression] transformer pruner #4180

Closed
wants to merge 12 commits into from

Conversation

J-shang
Copy link
Contributor

@J-shang J-shang commented Sep 14, 2021

migrate transformer pruner to v2

The test is heavy, will not add it.

@J-shang J-shang force-pushed the compression_v2_transformers branch from ad5a4b0 to dea72b0 Compare November 1, 2021 06:00
@liuzhe-lz liuzhe-lz mentioned this pull request Nov 5, 2021
86 tasks
@J-shang J-shang marked this pull request as ready for review November 9, 2021 02:42
@J-shang
Copy link
Contributor Author

J-shang commented Nov 16, 2021

For the latest result of measurement, we found (QK pruning in the same position, V free pruning) is better than (QKV pruning in the same position) in most cases. Maybe this implementation should redesign.

@J-shang J-shang marked this pull request as draft November 22, 2021 04:32
@liuzhe-lz
Copy link
Contributor

Deferred to next release.

@J-shang J-shang mentioned this pull request Jan 10, 2022
51 tasks
@J-shang J-shang closed this Jan 26, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants