Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ Docs ] Overhaul accelerate user guide #76

Merged
merged 39 commits into from
Aug 14, 2024

Conversation

robertgshaw2-neuralmagic
Copy link
Collaborator

SUMMARY:

  • update accelerate examples with README
  • update accelerate cpu offloading example to use fp8 + be consistent with other examples
  • update accelerate calibration example to highlight multi-gpu setup

TEST PLAN:

  • manually running examples

@robertgshaw2-neuralmagic robertgshaw2-neuralmagic changed the title Switch big model example [ DOCS ] Overhaul big-model user guide Aug 11, 2024
@robertgshaw2-neuralmagic robertgshaw2-neuralmagic changed the title [ DOCS ] Overhaul big-model user guide [ DOCS ] Overhaul accelerate user guide Aug 11, 2024
@robertgshaw2-neuralmagic robertgshaw2-neuralmagic changed the title [ DOCS ] Overhaul accelerate user guide [ Docs ] Overhaul accelerate user guide Aug 11, 2024
@Satrat Satrat self-requested a review August 14, 2024 20:57
README.md Outdated Show resolved Hide resolved
@robertgshaw2-neuralmagic robertgshaw2-neuralmagic merged commit d1d3d23 into main Aug 14, 2024
7 of 12 checks passed
markmc pushed a commit to markmc/llm-compressor that referenced this pull request Nov 13, 2024
Allow any future versions of transformers since we are just using it for `AutoConfig` at the moment and would like to support new models.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants