add base model qlora fintuning config file and optimize deduplicate.py #128

chg0901 · 2024-03-23T06:31:12Z

optimize deduplicate.py

Add time print information
save duplicate dataset as well
remove print(content)

Terminal print information in seconds

(tf2_9) D:\github_repos\EmoLLM\datasets>python deduplicate.py
file name: ./FatherLikeBF/qwen9747_zhipuai8857_final_merge_all.json
print_len_seen_hashes=500 Time:  15.8351 15.8351
print_len_seen_hashes=1000 Time:  40.83554 56.67064
print_len_duplicate=500 Time:  89.68914 89.68914
print_len_seen_hashes=1500 Time:  87.62722 144.29787
print_len_duplicate=1000 Time:  105.06385 194.75299
print_len_duplicate=1500 Time:  120.15144 314.90443
print_len_seen_hashes=2000 Time:  199.31009 343.60795
print_len_duplicate=2000 Time:  146.22297 461.1274

add base model qlora fintuning config file: internlm2_7b_base_qlora_e10

add full finetune code from internlm2

other 2 configs for base model

update cli_internlm2.py with three methods to download or load model

download model in openxlab (6-8M/s)
download model in modelscope (20-30M/s)
offline model

create upload_modelscope.py

add base model and update personal contributions

Create README_internlm2_7b_base_qlora.md

InternLM2 7B Base QLoRA fine-tuning guide

Add time print information save duplicate dataset as well remove print(content)

…10_M_1e4_32_64.py

three methods to load model 1. download model in openxlab 2. download model in modelscope 3. offline model

InternLM2 7B Base QLoRA 微调指南

chg0901 added 8 commits March 23, 2024 15:24

optimize deduplicate.py

950cab0

Add time print information save duplicate dataset as well remove print(content)

add base model qlora fintuning config file: internlm2_7b_base_qlora_e…

252adc7

…10_M_1e4_32_64.py

add full finetune code from internlm2

df81a99

other 2 configs for base model

0124001

update cli_internlm2.py

a22ec59

three methods to load model 1. download model in openxlab 2. download model in modelscope 3. offline model

create upload_modelscope.py

affd90b

add base model and update personal contributions

6e0042a

Create README_internlm2_7b_base_qlora.md

383789e

InternLM2 7B Base QLoRA 微调指南

aJupyter changed the base branch from main to dev March 23, 2024 11:17

aJupyter merged commit a12a7ef into SmartFlowAI:dev Mar 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add base model qlora fintuning config file and optimize deduplicate.py #128

add base model qlora fintuning config file and optimize deduplicate.py #128

chg0901 commented Mar 23, 2024 •

edited

Loading

add base model qlora fintuning config file and optimize deduplicate.py #128

add base model qlora fintuning config file and optimize deduplicate.py #128

Conversation

chg0901 commented Mar 23, 2024 • edited Loading

optimize deduplicate.py

Terminal print information in seconds

add base model qlora fintuning config file: internlm2_7b_base_qlora_e10

add full finetune code from internlm2

other 2 configs for base model

update cli_internlm2.py with three methods to download or load model

create upload_modelscope.py

add base model and update personal contributions

Create README_internlm2_7b_base_qlora.md

chg0901 commented Mar 23, 2024 •

edited

Loading