Skip to content

Issues: THUDM/GLM-130B

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

BIG-Bench evaluation?
#5 opened Aug 8, 2022 by Randl updated Aug 8, 2022
Hugging Face transformers integration
#48 opened Dec 20, 2022 by asolano updated Dec 20, 2022
[Question] Can I finetune GLM-130B with SAT framework?
#55 opened Jan 8, 2023 by smeyerhot updated Jan 8, 2023
Can i have only the encoder of the model ?
#66 opened Jan 13, 2023 by MohamedAliRashad updated Jan 19, 2023
"Server Error" output on the huggingface demo
#77 opened Feb 3, 2023 by ogkalu2 updated Feb 3, 2023
Evaluating on My Own Datasets
#83 opened Feb 12, 2023 by lyy1994 updated Feb 12, 2023
zero-shot or few-shot for summarization task question.
#84 opened Feb 14, 2023 by siyuanxue updated Feb 14, 2023
will you release DCU Ascend train and inference code?
#86 opened Feb 16, 2023 by clockfly updated Feb 16, 2023
left-right generate in Chinese
#87 opened Feb 16, 2023 by lyzKF updated Feb 16, 2023
Why is there a backward method in the W8A16Linear class?
#90 opened Feb 21, 2023 by Ant0082 updated Feb 21, 2023
How to eval the glm-130b loss? I got a loss 6 on wudao corpus
#91 opened Feb 23, 2023 by Syno8 updated Feb 23, 2023
Can we add code data to continue train ?
#93 opened Mar 1, 2023 by mx8435 updated Mar 1, 2023
GLM-130B model evaluation on the 4 x RTX 3090 GPU machine
#94 opened Mar 1, 2023 by Tomas0413 updated Mar 1, 2023
continue pretrain and fine-tune
#79 opened Feb 8, 2023 by Porraio updated Mar 3, 2023
[Disscussion] Can we align GLM-130B to human like chatgpt?
#43 opened Dec 10, 2022 by AnShengqiang updated Mar 10, 2023
machine specification for pretraining
#99 opened Mar 12, 2023 by wlike updated Mar 12, 2023
评估数据集好像下载不了
#101 opened Mar 16, 2023 by cingtiye updated Mar 16, 2023
模型解压出错
#107 opened Mar 29, 2023 by EasyLuck updated Mar 29, 2023
关于GLM在fasttransformer中的实现问题
#111 opened Mar 31, 2023 by shiqingzhangCSU updated Mar 31, 2023
中文推理prompt样例
#114 opened Apr 3, 2023 by chuckhope updated Apr 3, 2023
训练数据
#116 opened Apr 4, 2023 by joan126 updated Apr 4, 2023
ProTip! Add no:assignee to see everything that’s not assigned.