-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Intel Gpu support initialization #4340
Conversation
Looking very good! |
Added preliminary support , certain ISAs related to AMX are to be added in llama cpp (hence I have not added those yet). Can be tested on Intel Arc devices , now . |
@abhilash1910 I have reviewed the changes and they look good. The level of detail in this PR is extremely impressive -- you have accelerated something that would have taken me several months or indefinitely to do, given that I do not have an Intel Arc GPU and they are not readily available on GPU rental platforms. I think that the best way to get it tested is to get it merged and let users test it on main, so let's do that. |
commit e18a046 Author: kabachuha <artemkhrapov2001@yandex.ru> Date: Sat Nov 4 22:12:51 2023 +0300 fix openai extension not working because of absent new defaults (oobabooga#4477) commit b7a409e Merge: b5c5304 fb3bd02 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Sat Nov 4 15:04:43 2023 -0300 Merge pull request oobabooga#4476 from oobabooga/dev Merge dev branch commit fb3bd02 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Sat Nov 4 11:02:24 2023 -0700 Update docs commit 1d8c7c1 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Sat Nov 4 11:01:15 2023 -0700 Update docs commit b5c5304 Merge: 262f8ae 40f7f37 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Sat Nov 4 14:19:55 2023 -0300 Merge pull request oobabooga#4475 from oobabooga/dev Merge dev branch commit 40f7f37 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Sat Nov 4 10:12:06 2023 -0700 Update requirements commit 2081f43 Author: Orang <51061118+Soefati@users.noreply.github.com> Date: Sun Nov 5 00:00:24 2023 +0700 Bump transformers to 4.35.* (oobabooga#4474) commit 4766a57 Author: feng lui <3090641@qq.com> Date: Sun Nov 5 00:59:33 2023 +0800 transformers: add use_flash_attention_2 option (oobabooga#4373) commit add3593 Author: wouter van der plas <2423856+wvanderp@users.noreply.github.com> Date: Sat Nov 4 17:41:42 2023 +0100 fixed two links in the ui (oobabooga#4452) commit cfbd108 Author: Casper <casperbh.96@gmail.com> Date: Sat Nov 4 17:09:41 2023 +0100 Bump AWQ to 0.1.6 (oobabooga#4470) commit aa5d671 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Sat Nov 4 13:09:07 2023 -0300 Add temperature_last parameter (oobabooga#4472) commit 1ab8700 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Fri Nov 3 17:38:19 2023 -0700 Change frequency/presence penalty ranges commit 45fcb60 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Fri Nov 3 11:29:31 2023 -0700 Make truncation_length_max apply to max_seq_len/n_ctx commit 7f9c1cb Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Fri Nov 3 08:25:22 2023 -0700 Change min_p default to 0.0 commit 4537853 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Fri Nov 3 08:13:50 2023 -0700 Change min_p default to 1.0 commit 367e5e6 Author: kalomaze <66376113+kalomaze@users.noreply.github.com> Date: Thu Nov 2 14:32:51 2023 -0500 Implement Min P as a sampler option in HF loaders (oobabooga#4449) commit fcb7017 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Thu Nov 2 12:24:09 2023 -0700 Remove a checkbox commit fdcaa95 Author: Julien Chaumond <julien@huggingface.co> Date: Thu Nov 2 20:20:54 2023 +0100 transformers: Add a flag to force load from safetensors (oobabooga#4450) commit c065547 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Thu Nov 2 11:23:04 2023 -0700 Add cache_8bit option commit 42f8163 Merge: 77abd9b a56ef2a Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Thu Nov 2 11:09:26 2023 -0700 Merge remote-tracking branch 'refs/remotes/origin/dev' into dev commit 77abd9b Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Thu Nov 2 08:19:42 2023 -0700 Add no_flash_attn option commit a56ef2a Author: Julien Chaumond <julien@huggingface.co> Date: Thu Nov 2 18:07:08 2023 +0100 make torch.load a bit safer (oobabooga#4448) commit deba039 Author: deevis <darren.hicks@gmail.com> Date: Tue Oct 31 22:51:00 2023 -0600 (fix): OpenOrca-Platypus2 models should use correct instruction_template and custom_stopping_strings (oobabooga#4435) commit aaf726d Author: Mehran Ziadloo <mehranziadloo@gmail.com> Date: Tue Oct 31 21:29:57 2023 -0700 Updating the shared settings object when loading a model (oobabooga#4425) commit 9bd0724 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Tue Oct 31 20:57:56 2023 -0700 Change frequency/presence penalty ranges commit 6b7fa45 Author: Orang <51061118+Soefati@users.noreply.github.com> Date: Wed Nov 1 05:12:14 2023 +0700 Update exllamav2 version (oobabooga#4417) commit 41e159e Author: Casper <casperbh.96@gmail.com> Date: Tue Oct 31 23:11:22 2023 +0100 Bump AutoAWQ to v0.1.5 (oobabooga#4410) commit 0707ed7 Author: Meheret <101792782+senadev42@users.noreply.github.com> Date: Wed Nov 1 01:09:05 2023 +0300 updated wiki link (oobabooga#4415) commit 262f8ae Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Fri Oct 27 06:49:14 2023 -0700 Use default gr.Dataframe for evaluation table commit f481ce3 Author: James Braza <jamesbraza@gmail.com> Date: Thu Oct 26 21:02:28 2023 -0700 Adding `platform_system` to `autoawq` (oobabooga#4390) commit af98587 Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Date: Fri Oct 27 00:46:16 2023 -0300 Update accelerate requirement from ==0.23.* to ==0.24.* (oobabooga#4400) commit 839a87b Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Thu Oct 26 20:26:25 2023 -0700 Fix is_ccl_available & is_xpu_available imports commit 778a010 Author: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> Date: Fri Oct 27 08:09:51 2023 +0530 Intel Gpu support initialization (oobabooga#4340) commit 317e2c8 Author: GuizzyQC <86683381+GuizzyQC@users.noreply.github.com> Date: Thu Oct 26 22:03:21 2023 -0400 sd_api_pictures: fix Gradio warning message regarding custom value (oobabooga#4391) commit 92b2f57 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Thu Oct 26 18:57:32 2023 -0700 Minor metadata bug fix (second attempt) commit 2d97897 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Wed Oct 25 11:21:18 2023 -0700 Don't install flash-attention on windows + cuda 11 commit 0ced78f Author: LightningDragon <lightningdragon96@gmail.com> Date: Wed Oct 25 09:15:34 2023 -0600 Replace hashlib.sha256 with hashlib.file_digest so we don't need to load entire files into ram before hashing them. (oobabooga#4383) commit 72f6fc6 Author: tdrussell <6509934+tdrussell@users.noreply.github.com> Date: Wed Oct 25 10:10:28 2023 -0500 Rename additive_repetition_penalty to presence_penalty, add frequency_penalty (oobabooga#4376) commit ef1489c Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Mon Oct 23 20:45:43 2023 -0700 Remove unused parameter in AutoAWQ commit 1edf321 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Mon Oct 23 13:09:03 2023 -0700 Lint commit 280ae72 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Mon Oct 23 13:07:17 2023 -0700 Organize commit 49e5eec Merge: 82c11be 4bc4113 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Mon Oct 23 12:54:05 2023 -0700 Merge remote-tracking branch 'refs/remotes/origin/main' commit 82c11be Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Mon Oct 23 12:49:07 2023 -0700 Update 04 - Model Tab.md commit 306d764 Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Mon Oct 23 12:46:24 2023 -0700 Minor metadata bug fix commit 4bc4113 Author: adrianfiedler <adrian_fiedler@msn.com> Date: Mon Oct 23 19:09:57 2023 +0200 Fix broken links (oobabooga#4367) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com> commit 92691ee Author: oobabooga <112222186+oobabooga@users.noreply.github.com> Date: Mon Oct 23 09:57:44 2023 -0700 Disable trust_remote_code by default
I have an A770 and have not succeeded in using this so far. The problems are all environmental. @abhilash1910 how do you start an environment to have all the right Intel libraries and modules? I'm stumped; it seems like all roads lead to an ABI mismatch. Are you using Conda or something else? |
@naptastic, try |
Ok, that got me most of the way there. It was still necessary to do |
@naptastic, suggest continue here |
From thread #1575
WIP (Draft mode)
Checklist: