-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于retriever训练数据的构建 #7
Comments
是的,主要是怕 datasets 下载遇到网络问题。但我们用的都是原始数据集,所以理论上用对应数据集的名称也可以加载 |
噢,原始的数据集有整理好的有开源出来吗 |
原始数据集放在了这里的 |
还有两个问题想请教下: |
|
您好,想请问下,在运行python3 tools/process_retriever_train_data.py --save retriever_data --data-names TRAIN 构建retriever训练数据的时候,是不是数据集必须提前下载到本地,通过本地加载,而不支持通过datasets下载,因为我看data_files参数的路径都是本地路径
The text was updated successfully, but these errors were encountered: