第四节：XTuner 大模型单卡低成本微调实战

# 修改import部分
- from xtuner.dataset.map_fns import oasst1_map_fn, template_map_fn_factory
+ from xtuner.dataset.map_fns import template_map_fn_factory

# 修改模型为本地路径
- pretrained_model_name_or_path = 'internlm/internlm-chat-7b'
+ pretrained_model_name_or_path = './internlm-chat-7b'

# 修改训练数据为 MedQA2019-structured-train.jsonl 路径
- data_path = 'timdettmers/openassistant-guanaco'
+ data_path = 'MedQA2019-structured-train.jsonl'

# 修改 train_dataset 对象
train_dataset = dict(
    type=process_hf_dataset,
-   dataset=dict(type=load_dataset, path=data_path),
+   dataset=dict(type=load_dataset, path='json', data_files=dict(train=data_path)),
    tokenizer=tokenizer,
    max_length=max_length,
-   dataset_map_fn=alpaca_map_fn,
+   dataset_map_fn=None,
    template_map_fn=dict(
        type=template_map_fn_factory, template=prompt_template),
    remove_unused_columns=True,
    shuffle_before_pack=True,
    pack_to_max_length=pack_to_max_length)

开始训练

xtuner train internlm_chat_7b_qlora_medqa2019_e3.py --deepspeed deepspeed_zero2

将训练完成的Adapter转为Huggingface格式

export MKL_SERVICE_FORCE_INTEL=1
xtuner convert pth_to_hf internlm_chat_7b_qlora_medqa2019_e3.py ${训练保存的模型路径} ${转换要保存的路径}

合并模型，将Adapter与预训练模型合并一起

xtuner convert merge ./internlm-chat-7b ./hf ./merged --max-shard-size 2GB

# xtuner convert merge \
#     ${NAME_OR_PATH_TO_LLM} \
#     ${NAME_OR_PATH_TO_ADAPTER} \
#     ${SAVE_PATH} \
#     --max-shard-size 2GB

测试模型:

参考代码
```
streamlit run web_demo.py
```

3. XTuner项目代码分析

为什么输入xtuner train 命令就能运行起来了呢？

这里关键的代码主要有：setup.py、xtuner/entry_point.py

## setup.py 文件中
entry_points={'console_scripts': ['xtuner = xtuner:cli']})

这一句说明了，当在终端输入xtuner命令的时候执行xtuner.cli方法

modes = {
    'list-cfg': list_cfg.__file__,
    'copy-cfg': copy_cfg.__file__,
    'log-dataset': log_dataset.__file__,
    'check-custom-dataset': check_custom_dataset.__file__,
    'train': train.__file__,
    'test': test.__file__,
    'chat': chat.__file__,
    'convert': {
        'pth_to_hf': pth_to_hf.__file__,
        'merge': merge.__file__,
        'split': split.__file__,
        '--help': lambda: print_log(CONVERT_HELP_MSG, 'current'),
        '-h': lambda: print_log(CONVERT_HELP_MSG, 'current')
    },
    'preprocess': {
        'arxiv': arxiv_preprocess.__file__,
        '--help': lambda: print_log(PREPROCESS_HELP_MSG, 'current'),
        '-h': lambda: print_log(PREPROCESS_HELP_MSG, 'current')
    }
}


def cli():
    args = sys.argv[1:]
        return

cli方法首先会解析参数，然后根据参数从modes字典中取对应的方法。比如输入：xtuner train，其实后端会执行train.__file__就是train.py文件了。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

coursenote.md

coursenote.md

第四节：XTuner 大模型单卡低成本微调实战

目录

1. 环境配置

2. 模型微调

3. XTuner项目代码分析

Files

coursenote.md

Latest commit

History

coursenote.md

File metadata and controls

第四节：XTuner 大模型单卡低成本微调实战

目录

1. 环境配置

2. 模型微调

3. XTuner项目代码分析