Skip to content

Issues: open-compass/opencompass

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Feature] dataset for humaneval-multipl
#1673 opened Nov 9, 2024 by jyshee
1 task
[Feature] Support Math23k
#1667 opened Nov 7, 2024 by 00INDEX
1 task
[Bug] mmlupro 正则提取错误
#1661 opened Nov 4, 2024 by bittersweet1999
2 tasks done
[Bug] Error in downloading GSM8k dataset
#1654 opened Oct 31, 2024 by acse-ym722
2 tasks done
[Bug] 多种测试方法测试结果差异大
#1649 opened Oct 29, 2024 by DietDietDiet
2 tasks done
[Bug] TruthfulQA同时使用多个评估metric时报错
#1646 opened Oct 28, 2024 by XEric7
2 tasks done
SafetyBench数据集评测bug
#1622 opened Oct 18, 2024 by shutttttdown
2 tasks done
[Bug] hf推理与vllm推理评测结果不一致
#1594 opened Oct 9, 2024 by luhairong11
2 tasks done
[Bug] Silent GPU failures, works with --debug
#1585 opened Oct 4, 2024 by anuragprat1k
2 tasks done
[Bug] pid_params.py dumps not successfully.
#1580 opened Sep 29, 2024 by tonysy
2 tasks done
[Bug] debug不显示日志
#1578 opened Sep 29, 2024 by HaltonJiang
2 tasks done
[Bug] mtbench101:batch_size设置
#1571 opened Sep 27, 2024 by zhang-junjian
2 tasks done
[Feature] Support MMMLU backlog
#1560 opened Sep 24, 2024 by tonysy
1 task
[Bug] HuggingFacewithChatTemplate
#1557 opened Sep 24, 2024 by ZCzzzzzz
2 tasks done
[Bug] 试图添加mlc后端遇到问题,找不到cuda
#1551 opened Sep 23, 2024 by XJY990705
2 tasks done
[Bug] SciCode miss with_background Bug Something isn't working
#1540 opened Sep 18, 2024 by tonysy
2 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.