Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

THUDM / GLM-130B Public

Notifications
Fork 608
Star 7.7k

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Issues: THUDM/GLM-130B

Labels 9 Milestones 0

Labels 9 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

119 Open 80 Closed

119 Open 80 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

BIG-Bench evaluation?

#5 opened Aug 8, 2022 by Randl updated Aug 8, 2022

1

Hugging Face transformers integration

#48 opened Dec 20, 2022 by asolano updated Dec 20, 2022

[Question] Can I finetune GLM-130B with SAT framework?

#55 opened Jan 8, 2023 by smeyerhot updated Jan 8, 2023

TASK_NAME/validation.jsonl used for tuning hyperparameters for zero-shot testing?

#64 opened Jan 12, 2023 by tomyoung903 updated Jan 12, 2023

will the GLM generation add some common text generation penalties like GPT-3?

#65 opened Jan 13, 2023 by cyente updated Jan 13, 2023

1

Can i have only the encoder of the model ?

#66 opened Jan 13, 2023 by MohamedAliRashad updated Jan 19, 2023

1

"Server Error" output on the huggingface demo

#77 opened Feb 3, 2023 by ogkalu2 updated Feb 3, 2023

Evaluating on My Own Datasets

#83 opened Feb 12, 2023 by lyy1994 updated Feb 12, 2023

zero-shot or few-shot for summarization task question.

#84 opened Feb 14, 2023 by siyuanxue updated Feb 14, 2023

1

will you release DCU Ascend train and inference code?

#86 opened Feb 16, 2023 by clockfly updated Feb 16, 2023

left-right generate in Chinese

#87 opened Feb 16, 2023 by lyzKF updated Feb 16, 2023

Why is there a backward method in the W8A16Linear class?

#90 opened Feb 21, 2023 by Ant0082 updated Feb 21, 2023

How to eval the glm-130b loss? I got a loss 6 on wudao corpus

#91 opened Feb 23, 2023 by Syno8 updated Feb 23, 2023

Can we add code data to continue train ?

#93 opened Mar 1, 2023 by mx8435 updated Mar 1, 2023

GLM-130B model evaluation on the 4 x RTX 3090 GPU machine

#94 opened Mar 1, 2023 by Tomas0413 updated Mar 1, 2023

continue pretrain and fine-tune

#79 opened Feb 8, 2023 by Porraio updated Mar 3, 2023

5

你好，big-bench好像不支持pytorch，请问如何测试big-bench

#98 opened Mar 10, 2023 by haiqizhang updated Mar 10, 2023

[Disscussion] Can we align GLM-130B to human like chatgpt?

#43 opened Dec 10, 2022 by AnShengqiang updated Mar 10, 2023

7

machine specification for pretraining

#99 opened Mar 12, 2023 by wlike updated Mar 12, 2023

评估数据集好像下载不了

#101 opened Mar 16, 2023 by cingtiye updated Mar 16, 2023

单机离线状态下无法运行，报错[errno 11001]getaddrinfo failed

#103 opened Mar 24, 2023 by gsxy456 updated Mar 24, 2023

模型解压出错

#107 opened Mar 29, 2023 by EasyLuck updated Mar 29, 2023

关于GLM在fasttransformer中的实现问题

#111 opened Mar 31, 2023 by shiqingzhangCSU updated Mar 31, 2023

1

中文推理prompt样例

#114 opened Apr 3, 2023 by chuckhope updated Apr 3, 2023

训练数据

#116 opened Apr 4, 2023 by joan126 updated Apr 4, 2023

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.