Skip to content
This repository was archived by the owner on Oct 25, 2024. It is now read-only.
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
38 commits
Select commit Hold shift + click to select a range
d2aa176
adapt smoothquant,static,woq autooround
changwangss May 7, 2024
4fa9555
Merge branch 'main' into wangchang/inc3.x
changwangss May 16, 2024
e804b7b
add GPTQ API
changwangss May 16, 2024
cbfd1be
migrated RTN to use INC3.x API.
XinyuYe-Intel May 17, 2024
44d0ced
migrate sq with INC 3.x
changwangss May 20, 2024
c66f984
ssupport moothquant with fix alpha
changwangss May 22, 2024
0d064a0
migrate restore sq from json
changwangss May 23, 2024
785be96
Merge branch 'main' into wangchang/inc3.x
changwangss May 23, 2024
4dc368a
added HQQ for WOQ in ITREX.
XinyuYe-Intel May 24, 2024
231c80a
Merge branch 'wangchang/inc3.x' of https://github.com/intel/intel-ext…
XinyuYe-Intel May 24, 2024
a7cf2c1
support chatglm,qwen,baichuan sq
changwangss May 28, 2024
2a60dfb
adapt inc fixed gptq
changwangss May 29, 2024
eb63429
migraate awq teq
changwangss May 29, 2024
3c2bec9
Merge branch 'main' into wangchang/inc3.x
changwangss May 29, 2024
0e40185
rebase autoround
changwangss May 29, 2024
7ae4aec
adapt inc 3.x weightonlylinear
changwangss May 30, 2024
f6f0ffc
support autoround 0.2 and sq with alpha auto
changwangss May 31, 2024
02d5751
fix awq folding setting to True
changwangss Jun 3, 2024
72035a2
Merge branch 'main' into wangchang/inc3.x
changwangss Jun 11, 2024
e8c0946
rebase
changwangss Jun 11, 2024
658e129
fix extension
changwangss Jun 11, 2024
fe06a84
support extension
changwangss Jun 11, 2024
17287f8
fix benchmark
changwangss Jun 11, 2024
91c973b
fix pylint
changwangss Jun 14, 2024
27148eb
fix pylint
changwangss Jun 14, 2024
a375b6f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jun 14, 2024
6b429c0
install requirements_pt.txt of inc3.x
XuehaoSun Jun 14, 2024
ce93c14
Merge branch 'main' into wangchang/inc3.x
XuehaoSun Jun 14, 2024
81d0b35
Merge branch 'main' into wangchang/inc3.x
changwangss Jun 26, 2024
35b24f1
Merge branch 'main' into wangchang/inc3.x
changwangss Jul 3, 2024
ed478c1
rebase
changwangss Jul 3, 2024
634ef3e
update sq
changwangss Jul 4, 2024
565c377
remove woq hqq
changwangss Jul 4, 2024
33e15d8
Merge branch 'main' into wangchang/inc3.x
changwangss Jul 4, 2024
54c8157
fix pylint
changwangss Jul 4, 2024
b5537bc
Merge branch 'main' into wangchang/inc3.x
changwangss Jul 10, 2024
f77c8f8
fit to the latest inc
changwangss Jul 10, 2024
daa485c
remove engine ci and neuralchat ci
changwangss Jul 11, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
34 changes: 0 additions & 34 deletions .github/checkgroup.yml
Original file line number Diff line number Diff line change
Expand Up @@ -30,40 +30,6 @@ subprojects:
- "optimize-unit-test-PR-test"
- "Genreate-OptimizeUT-Report"

- id: "NeuralChat Unit Test"
paths:
- ".github/workflows/unit-test-neuralchat.yml"
- ".github/workflows/script/unitTest/run_unit_test_neuralchat.sh"
- "intel_extension_for_transformers/neural_chat/**"
- "requirements.txt"
- "setup.py"
- "intel_extension_for_transformers/transformers/llm/finetuning/**"
- "intel_extension_for_transformers/transformers/llm/quantization/**"
- "intel_extension_for_transformers/transformers/**"
- "intel_extension_for_transformers/langchain/**"
- "!intel_extension_for_transformers/neural_chat/docs/**"
- "!intel_extension_for_transformers/neural_chat/examples/**"
- "!intel_extension_for_transformers/neural_chat/assets/**"
- "!intel_extension_for_transformers/neural_chat/README.md"
checks:
- "neuralchat-unit-test-baseline"
- "neuralchat-unit-test-PR-test"
- "Generate-NeuralChat-Report"

- id: "Engine Unit Test workflow"
paths:
- ".github/workflows/unit-test-engine.yml"
- "requirements.txt"
- "setup.py"
- intel_extension_for_transformers/transformers/**
- "intel_extension_for_transformers/transformers/runtime/**"
- "!intel_extension_for_transformers/transformers/runtime/kernels/**"
- "!intel_extension_for_transformers/transformers/runtime/third_party/**"
- "!intel_extension_for_transformers/transformers/runtime/docs/**"
checks:
- "engine-unit-test-baseline"
- "engine-unit-test-PR-test"
- "Genreate-Engine-Report"

# - id: "Windows Binary Test"
# paths:
Expand Down
1 change: 1 addition & 0 deletions .github/workflows/script/unitTest/env_setup.sh
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@ until [ "$n" -ge 5 ]; do
git clone https://github.com/intel/neural-compressor.git /neural-compressor
cd /neural-compressor
pip install -r requirements.txt
pip install -r requirements_pt.txt
python setup.py install && break
n=$((n + 1))
sleep 5
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -36,21 +36,18 @@ OMP_NUM_THREADS=<physical cores num> numactl -m <node N> -C <cpu list> python ru
--model <MODEL_NAME_OR_PATH> \
--sq \
--output_dir <SQ_MODEL_SAVE_PATH> \ # Default is "./saved_results."
--int8 \
--benchmark \
--batch_size 1
# load SQ model quantied by itrex and do benchmark.
OMP_NUM_THREADS=<physical cores num> numactl -m <node N> -C <cpu list> python run_generation_sq.py \
--model <SQ_MODEL_SAVE_PATH> \
--int8 \
--benchmark \
--batch_size 1
# load SQ model quantied configure.json and do benchmark.
python run_generation_sq.py \
--model <MODEL_NAME_OR_PATH> \
--output_dir <SQ_MODEL_SAVE_PATH> \
--int8 \
--restore \
--restore_sq_model_from_json \
--benchmark \
--batch_size 1
```
Expand All @@ -68,23 +65,20 @@ python run_generation_sq.py \
--model <MODEL_NAME_OR_PATH> \
--sq \
--output_dir <SQ_MODEL_SAVE_PATH> \ # Default is "./saved_results."
--int8 \
--accuracy \
--batch_size 56

# load SQ model quantied by itrex and do benchmark.
python run_generation_sq.py \
--model <SQ_MODEL_SAVE_PATH> \
--int8 \
--accuracy \
--batch_size 56

# load SQ model quantied configure.json and do benchmark.
python run_generation_sq.py \
--model <MODEL_NAME_OR_PATH> \
--output_dir <SQ_MODEL_SAVE_PATH> \
--int8 \
--restore \
--restore_sq_model_from_json \
--accuracy \
--batch_size 56

Expand Down
Loading